Differential diagnosis generators: An evaluation of currently available computer programs

WF Bond; LM Schwartz; KR Weaver; D Levick; M Giuliano; Mark Graber

Differential diagnosis generators

An evaluation of currently available computer programs

Bond, WF., Schwartz, LM., Weaver, KR., Levick, D., Giuliano, M., & Graber, M. (2012). Differential diagnosis generators: An evaluation of currently available computer programs. Journal of General Internal Medicine, 27(2), 213-219. https://doi.org/10.1007/s11606-011-1804-8

Copy citation

Abstract

Background
Differential diagnosis (DDX) generators are computer programs that generate a DDX based on various clinical data.
Objective
We identified evaluation criteria through consensus, applied these criteria to describe the features of DDX generators, and tested performance using cases from the New England Journal of Medicine (NEJM©) and the Medical Knowledge Self Assessment Program (MKSAP©).
Methods
We first identified evaluation criteria by consensus. Then we performed Google® and Pubmed searches to identify DDX generators. To be included, DDX generators had to do the following: generate a list of potential diagnoses rather than text or article references; rank or indicate critical diagnoses that need to be considered or eliminated; accept at least two signs, symptoms or disease characteristics; provide the ability to compare the clinical presentations of diagnoses; and provide diagnoses in general medicine. The evaluation criteria were then applied to the included DDX generators. Lastly, the performance of the DDX generators was tested with findings from 20 test cases. Each case performance was scored one through five, with a score of five indicating presence of the exact diagnosis. Mean scores and confidence intervals were calculated.
Key Results
Twenty three programs were initially identified and four met the inclusion criteria. These four programs were evaluated using the consensus criteria, which included the following: input method; mobile access; filtering and refinement; lab values, medications, and geography as diagnostic factors; evidence based medicine (EBM) content; references; and drug information content source. The mean scores (95% Confidence Interval) from performance testing on a five-point scale were Isabel© 3.45 (2.53, 4.37), DxPlain® 3.45 (2.63–4.27), Diagnosis Pro® 2.65 (1.75–3.55) and PEPID™ 1.70 (0.71–2.69). The number of exact matches paralleled the mean score finding.
Conclusions
Consensus criteria for DDX generator evaluation were developed. Application of these criteria as well as performance testing supports the use of DxPlain® and Isabel© over the other currently available DDX generators.

Publications Info

To contact an RTI author, request a report, or for additional information about publications by our experts, send us your request.

publications@rti.org

RTI shares its evidence-based research - through peer-reviewed publications and media - to ensure that it is accessible for others to build on, in line with our mission and scientific standards.

Recent Publications

Article

Newborn screening for type 1 diabetes using genome-based risk scores in the Early Check program

April 2026

Article

Policing as a Structural Determinant of Health

April 2026

Article

Grocery store workers’ knowledge, attitudes, and barriers influencing uptake of COVID-19 vaccine in the United States: A qualitative study

April 2026

Article

A comparative analysis of pediatric pneumococcal vaccination strategies: A dynamic model of PCV20 vs. PCV15 and PCV13

April 2026

Article

"She clearly thought that something bad had happened to her": How military lawyers construct narratives of victim legitimacy and perceived harm in sexual assault cases

April 2026

Article

The breastfeeding experiences of mother-infant dyads and the effects of an fmr1 mutation

April 2026

Article

Identifying strategies to leverage electronic health records and health information technology in colorectal cancer screening in primary care clinics

April 2026

Article

Initial evaluation of the pulmonary hypertension functional classification self-report (PH-FC-SR) measurement properties: A patient-focused measure

April 2026

View All Publications