Machine-learning-based classification of research grant award records

Christina A. Freyman; John J. Byrnes; Jeffrey Alexander

Machine-learning-based classification of research grant award records

Freyman, C. A., Byrnes, J. J., & Alexander, J. (2016). Machine-learning-based classification of research grant award records. Research Evaluation, 25(4), 442-450. https://doi.org/10.1093/reseval/rvw016

Copy citation

Abstract

Policy makers frequently ask agencies to report how much money they are spending on research and development activities in specific fields or topics; however, records are rarely classified in ways that will inform policy and budget decisions. This work explores how topic co-clustering, an approach to text analysis based on machine learning, might be used to tag National Science Foundation (NSF) grant awards automatically with terms referring to scientific disciplines or to socioeconomic objectives. This approach is an alternative approach to the Latent Dirichlet Allocation topic model produced by the NSF for an experimental Portfolio Explorer (Nichols 2014). We use metadata in the grant records to validate the results, and do not access the metadata as part of the automated tagging process. The results show that in the case of scientific disciplines, where our language models were well-formed and we had a valid comparison set for manual classification, the machine-assigned tags were a reasonable and valid means for describing the research conducted under each grant. In assigning socioeconomic objectives to grants, we saw relatively poor precision and recall in classification, due to the poorly formed and sparse language models available for those terms. Our analysis suggests that this approach can be used to classify large corpora of scientific awards into desired categories, which may be of use for monitoring R&D trends and for identifying portfolios of grant projects for evaluation.

Publications Info

To contact an RTI author, request a report, or for additional information about publications by our experts, send us your request.

publications@rti.org

RTI shares its evidence-based research - through peer-reviewed publications and media - to ensure that it is accessible for others to build on, in line with our mission and scientific standards.

Meet the Experts

Navigate to Jeffrey M. Alexander

Jeffrey M. Alexander

Recent Publications

Article

Factors influencing wasting in children under 5 in arid regions of Kenya

March 2026

Article

Psychometric evaluation of the weekly version of the PTSD checklist for DSM-5

March 2026

Article

Uptake of newly licensed influenza vaccine formulations among patients receiving chronic hemodialysis during the 2010/2011 to 2021/2022 influenza seasons

March 2026

Article

Multi-ancestry genome-wide association study and meta-analysis of lung function decline

February 2026

Article

A microsimulation model to assess the cost-effectiveness of physical activity policies among US adults: The physical activity, diabetes, and cardiovascular disease model

February 2026

Article

Estimating lifetime drinking trajectories for alcohol use from adolescence to older adulthood in the United States: A three-step approach

February 2026

Article

Challenging behaviors across COVID-19 in young children with rare neurogenetic conditions: A seven-year, cross-syndrome analysis

February 2026

Article

Hypocretin receptor 1 blockade early in abstinence reduces future demand for cocaine

February 2026

View All Publications