Machine learning for medical coding in healthcare surveys

Emily Catherine Hadley; Rob Chew; Jason Matthew Nance; Peter Michael Baumgartner; M. Rita Thissen; David M. Plotner; Christine Marie Carr

Machine learning for medical coding in healthcare surveys

Hadley, E. C., Chew, R., Nance, J. M., Baumgartner, P. M., Thissen, M. R., Plotner, D. M., Carr, C. M., & National Center for Health Statistics (U.S.) (2021). Machine learning for medical coding in healthcare surveys. Vital and health statistics. Series 2, Data evaluation and methods research, 2021(189), 1-22. https://doi.org/10.15620/cdc:109828

Copy citation

Abstract

Objective
Medical coding, or the translation of healthcare information into numeric codes, is expensive and time intensive. This exploratory study evaluates the use of machine learning classifiers to perform automated medical coding for large statistical healthcare surveys.
Methods
This research used medically coded data from the Emergency Department portion of the 2016 and 2017 National Hospital Ambulatory Medical Care Survey (NHAMCS-ED). Natural language processing classifiers were developed to assign medical codes using verbatim text from patient visits as inputs. Medical codes assigned included three-digit truncated 10th Revision of the International Statistical Classification of Diseases and Related Health Problems, Clinical Modification (ICD-10-CM) codes for diagnoses (DIAG) and cause of injury (CAUSE), as well as the full length NCHS reason for visit (RFV) classification codes.
Results
The best-performing model of the multiple machine learning models assessed was a multilabel logistic regression. The Jaccard coefficient was used for measuring the degree of agreement between a model and a human versus two humans on the same set of codes. The human-to-human agreement consistently outperformed the model-to-human agreement, though both performed best on diagnosis (human-to human: 0.88, model-to-human: 0.78) and worst on injury codes (human: 0.50, model: 0.28). The model outperformed the human coders on 7.7% of the unique codes assigned by both the model and a human, with strong performance on specific truncated ICD–10–CM diagnosis codes.
Conclusion
This case study demonstrates the potential of machine learning for medical coding in the context of large statistical healthcare surveys. While trained medical coders outperformed the assessed models across the medical coding tasks of assigning correct diagnosis, injury, and RFV codes, machine learning models showed promise in assisting with medical coding projects, particularly if used as an adjunct to human coding.

Publications Info

To contact an RTI author, request a report, or for additional information about publications by our experts, send us your request.

publications@rti.org

RTI shares its evidence-based research - through peer-reviewed publications and media - to ensure that it is accessible for others to build on, in line with our mission and scientific standards.

Meet the Experts

Navigate to Robert Chew

Robert Chew

Navigate to Christine Carr

Christine Carr

Recent Publications

Article

Dynamic operation of a bench-scale CO2 capture system with non-aqueous and monoethanolamine solvents in process-intensified equipment

September 2026

Article

Use of fentanyl test strips by people who inject drugs: Longitudinal findings from the south Atlantic fentanyl test strip study (SAFTSS)

August 2026

Article

Oral toxicokinetics of the indoor air pollutant, α-pinene, and its genotoxic metabolite, α-pinene oxide, in rodents and comparison to inhalation route of exposure

August 2026

Article

Implementation of the IWQOL-Lite-CT in observational research: Comparison of baseline scores with a clinical trial population and psychometric evaluation

August 2026

Article

Racial differences in adverse pregnancy outcomes and incident hypertension: A mediation analysis

July 2026

Article

Mental health, substance use, and child maltreatment

July 2026

Article

Impact of enhanced practices on opioid overdose deaths: A community-based modeling approach

July 2026

Article

A cross-sectional study of acceptability and influence of HEALing communities study communications campaign messaging among community members in four U.S. states

July 2026

View All Publications

Machine learning for medical coding in healthcare surveys

Abstract

Meet the Experts

Robert Chew

Christine Carr

Recent Publications

Dynamic operation of a bench-scale CO2 capture system with non-aqueous and monoethanolamine solvents in process-intensified equipment

Use of fentanyl test strips by people who inject drugs: Longitudinal findings from the south Atlantic fentanyl test strip study (SAFTSS)

Oral toxicokinetics of the indoor air pollutant, α-pinene, and its genotoxic metabolite, α-pinene oxide, in rodents and comparison to inhalation route of exposure

Implementation of the IWQOL-Lite-CT in observational research: Comparison of baseline scores with a clinical trial population and psychometric evaluation

Racial differences in adverse pregnancy outcomes and incident hypertension: A mediation analysis

Mental health, substance use, and child maltreatment

Impact of enhanced practices on opioid overdose deaths: A community-based modeling approach

A cross-sectional study of acceptability and influence of HEALing communities study communications campaign messaging among community members in four U.S. states

RTI International and Othram awarded NIJ funding for major study of forensic genetic genealogy across ancestral populations

New Approach Methodologies: Why Scientific Rigor Matters More Than Ever

Youth tobacco use continues to decline: RTI publishes results of the 2025 National Youth Tobacco Survey in partnership with FDA

Cogeneration’s Advantage: Efficiency, Resilience, and the Case for Captured Heat

Turning Clean Energy Investment into Economic Growth in North Carolina

Supporting Defense Innovation Through North Carolina’s Smart Textile Ecosystem

Microplastics in the Public Eye: What Consumers Are Saying—and Why It Matters

Current Nutrition Trends: Fact, Fiction, and Half-Truths

Landmark 10-year clinical study finds lasting benefit for women with two distinct pelvic organ prolapse surgeries

Evaluating Alternative Strategies to Traditional Local Police Response