arXiv Open Access 2025

SafeTab-H: Disclosure Avoidance for the 2020 Census Detailed Demographic and Housing Characteristics File B (Detailed DHC-B)

William Sexton Skye Berghel Bayard Carlson Sam Haney Luke Hartman +8 lainnya
Lihat Sumber

Abstrak

This article describes SafeTab-H, a disclosure avoidance algorithm applied to the release of the U.S. Census Bureau's Detailed Demographic and Housing Characteristics File B (Detailed DHC-B) as part of the 2020 Census. The tabulations contain household statistics about household type and tenure iterated by the householder's detailed race, ethnicity, or American Indian and Alaska Native tribe and village at varying levels of geography. We describe the algorithmic strategy which is based on adding noise from a discrete Gaussian distribution and show that the algorithm satisfies a well-studied variant of differential privacy, called zero-concentrated differential privacy. We discuss how the implementation of the SafeTab-H codebase relies on the Tumult Analytics privacy library. We also describe the theoretical expected error properties of the algorithm and explore various aspects of its parameter tuning.

Topik & Kata Kunci

Penulis (13)

W

William Sexton

S

Skye Berghel

B

Bayard Carlson

S

Sam Haney

L

Luke Hartman

M

Michael Hay

A

Ashwin Machanavajjhala

G

Gerome Miklau

A

Amritha Pai

S

Simran Rajpal

D

David Pujol

R

Ruchit Shrestha

D

Daniel Simmons-Marengo

Format Sitasi

Sexton, W., Berghel, S., Carlson, B., Haney, S., Hartman, L., Hay, M. et al. (2025). SafeTab-H: Disclosure Avoidance for the 2020 Census Detailed Demographic and Housing Characteristics File B (Detailed DHC-B). https://arxiv.org/abs/2505.03072

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓