arXiv Open Access 2025

SafeTab-P: Disclosure Avoidance for the 2020 Census Detailed Demographic and Housing Characteristics File A (Detailed DHC-A)

Sam Haney Skye Berghel Bayard Carlson Ryan Cumings-Menon Luke Hartman +9 lainnya
Lihat Sumber

Abstrak

This article describes the disclosure avoidance algorithm that the U.S. Census Bureau used to protect the Detailed Demographic and Housing Characteristics File A (Detailed DHC-A) of the 2020 Census. The tabulations contain statistics (counts) of demographic characteristics of the entire population of the United States, crossed with detailed races and ethnicities at varying levels of geography. The article describes the SafeTab-P algorithm, which is based on adding noise drawn to statistics of interest from a discrete Gaussian distribution. A key innovation in SafeTab-P is the ability to adaptively choose how many statistics and at what granularity to release them, depending on the size of a population group. We prove that the algorithm satisfies a well-studied variant of differential privacy, called zero-concentrated differential privacy (zCDP). We then describe how the algorithm was implemented on Tumult Analytics and briefly outline the parameterization and tuning of the algorithm.

Topik & Kata Kunci

Penulis (14)

S

Sam Haney

S

Skye Berghel

B

Bayard Carlson

R

Ryan Cumings-Menon

L

Luke Hartman

M

Michael Hay

A

Ashwin Machanavajjhala

G

Gerome Miklau

A

Amritha Pai

S

Simran Rajpal

D

David Pujol

W

William Sexton

R

Ruchit Shrestha

D

Daniel Simmons-Marengo

Format Sitasi

Haney, S., Berghel, S., Carlson, B., Cumings-Menon, R., Hartman, L., Hay, M. et al. (2025). SafeTab-P: Disclosure Avoidance for the 2020 Census Detailed Demographic and Housing Characteristics File A (Detailed DHC-A). https://arxiv.org/abs/2505.01472

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓