Simultaneous Edit-Imputation for Continuous Microdata

HJ Kim; LH Cox; Alan Karr; JP Reiter; QL Wang

Simultaneous Edit-Imputation for Continuous Microdata

Kim, HJ., Cox, LH., Karr, A., Reiter, JP., & Wang, QL. (2015). Simultaneous Edit-Imputation for Continuous Microdata. Journal of the American Statistical Association, 110(511), 987-999. https://doi.org/10.1080/01621459.2015.1040881

Copy citation

Abstract

Many statistical organizations collect data that are expected to satisfy linear constraints; as examples, component variables should sum to total variables, and ratios of pairs of variables should be bounded by expert-specified constants. When reported data violate constraints, organizations identify and replace values potentially in error in a process known as edit-imputation. To date, most approaches separate the error localization and imputation steps, typically using optimization methods to identify the variables to change followed by hot deck imputation. We present an approach. that fully integrates editing and imputation for continuous microdata under linear constraints. Our approach relies on a Bayesian hierarchical model that includes (i) a flexible joint probability model for the underlying true values of the data with support only on the set of values that satisfy all editing constraints, (ii) a model for latent indicators of the variables that are in error, and (iii) a model for the reported responses for variables in error. We illustrate the potential advantages of the Bayesian editing approach over existing approaches using simulation studies. We apply the model to edit faulty data from the 2007 U.S. Census of Manufactures. Supplementary materials for this article are available online.

Publications Info

To contact an RTI author, request a report, or for additional information about publications by our experts, send us your request.

publications@rti.org

RTI shares its evidence-based research - through peer-reviewed publications and media - to ensure that it is accessible for others to build on, in line with our mission and scientific standards.

Recent Publications

Article

Factors influencing wasting in children under 5 in arid regions of Kenya

March 2026

Article

Psychometric evaluation of the weekly version of the PTSD checklist for DSM-5

March 2026

Article

Uptake of newly licensed influenza vaccine formulations among patients receiving chronic hemodialysis during the 2010/2011 to 2021/2022 influenza seasons

March 2026

Article

Health care providers' perceptions of screening for risk of type 1 diabetes in newborns using genetic risk scores

February 2026

Article

Housing age and sociodemographic characteristics as predictors of residential lead exposure and modeled child blood lead levels

February 2026

Article

Systematic examination of gene expression and proteomic evidence across tissues supports the role of mitochondrial dysregulation in me/cfs

February 2026

Article

The mortality and economic benefits of achieving air pollution standards in India

February 2026

Article

EpiSmokEr2: A robust epigenetic classifier for smoking status inference using Illumina EPIC methylation data

February 2026

View All Publications