Advantages of imputation vs. data swapping for statistical disclosure control

Satkartar K. Kinney; Charlotte B. Looby; Feng Yu

Advantages of imputation vs. data swapping for statistical disclosure control

Kinney, S. K., Looby, C. B., & Yu, F. (2020). Advantages of imputation vs. data swapping for statistical disclosure control. Lecture Notes in Computer Science, 12276, 281-296. https://doi.org/10.1007/978-3-030-57521-2_20

Copy citation

Abstract

Data swapping is an approach long-used by public agencies to protect respondent confidentiality in which values of some variables are swapped with similar records for a small portion of respondents. Synthetic data is a newer method in which many if not all values are replaced with multiple imputations. Synthetic data can be difficult to implement for complex data; however, when the portion of data replaced is similar to data swapping, it becomes simple to implement using publicly available software. This paper describes how this simplification of synthetic data can be used to provide a better balance of data quality and disclosure protection compared to data swapping. This is illustrated via an empirical comparison using data from the Survey of Earned Doctorates.

Publications Info

To contact an RTI author, request a report, or for additional information about publications by our experts, send us your request.

publications@rti.org

RTI shares its evidence-based research - through peer-reviewed publications and media - to ensure that it is accessible for others to build on, in line with our mission and scientific standards.

Recent Publications

Article

Target trial emulation for regulatory and clinical decision making in cancer

April 2026

Article

A systematic process to accurately link large-scale research consents to state public health newborn screening samples

April 2026

Article

The acute and chronic pharmacokinetics and pharmacodynamics of oral cannabidiol with and without low doses of delta-9-tetrahydrocannabinol

April 2026

Article

Newborn screening for type 1 diabetes using genome-based risk scores in the Early Check program

April 2026

Article

Policing as a Structural Determinant of Health

April 2026

Article

Grocery store workers’ knowledge, attitudes, and barriers influencing uptake of COVID-19 vaccine in the United States: A qualitative study

April 2026

Article

A comparative analysis of pediatric pneumococcal vaccination strategies: A dynamic model of PCV20 vs. PCV15 and PCV13

April 2026

Article

"She clearly thought that something bad had happened to her": How military lawyers construct narratives of victim legitimacy and perceived harm in sexual assault cases

April 2026

View All Publications