Find in Library
Search millions of books, articles, and more
Indexed Open Access Databases
A Latent Class Modeling Approach for Differentially Private Synthetic Data for Contingency Tables
oleh: Michelle Nixon, Andres Barrientos, Jerome Reiter, Aleksandra Slavkovic
Format: | Article |
---|---|
Diterbitkan: | Labor Dynamics Institute 2022-07-01 |
Deskripsi
We present an approach to construct differentially private synthetic data for contingency tables. The algorithm achieves privacy by adding noise to selected summary counts, e.g., two-way margins of the contingency table, via the Geometric mechanism. We posit an underlying latent class model for the counts, estimate the parameters of the model based on the noisy counts, and generate synthetic data using the estimated model. This approach allows the agency to create multiple imputations of synthetic data with no additional privacy loss, thereby facilitating estimation of uncertainty in downstream analyses. We illustrate the approach using a subset of the 2016 American Community Survey Public Use Microdata Sets.