Reproducible clusters from microarray research: whither?

Nikhil Garge; Grier Page; AP Sprague; BS Gorman; DB Allison

Reproducible clusters from microarray research

whither?

Garge, N., Page, G., Sprague, AP., Gorman, BS., & Allison, DB. (2005). Reproducible clusters from microarray research: whither? BMC Bioinformatics, 6(Suppl 2), S10. https://doi.org/10.1186/1471-2105-6-S2-S10

Copy citation

Abstract

MOTIVATION : In cluster analysis, the validity of specific solutions, algorithms, and procedures present significant challenges because there is no null hypothesis to test and no 'right answer'. It has been noted that a replicable classification is not necessarily a useful one, but a useful one that characterizes some aspect of the population must be replicable. By replicable we mean reproducible across multiple samplings from the same population. Methodologists have suggested that the validity of clustering methods should be based on classifications that yield reproducible findings beyond chance levels. We used this approach to determine the performance of commonly used clustering algorithms and the degree of replicability achieved using several microarray datasets. METHODS: We considered four commonly used iterative partitioning algorithms (Self Organizing Maps (SOM), K-means, Clutsering LARge Applications (CLARA), and Fuzzy C-means) and evaluated their performances on 37 microarray datasets, with sample sizes ranging from 12 to 172. We assessed reproducibility of the clustering algorithm by measuring the strength of relationship between clustering outputs of subsamples of 37 datasets. Cluster stability was quantified using Cramer's v2 from a kXk table. Cramer's v2 is equivalent to the squared canonical correlation coefficient between two sets of nominal variables. Potential scores range from 0 to 1, with 1 denoting perfect reproducibility. RESULTS: All four clustering routines show increased stability with larger sample sizes. K-means and SOM showed a gradual increase in stability with increasing sample size. CLARA and Fuzzy C-means, however, yielded low stability scores until sample sizes approached 30 and then gradually increased thereafter. Average stability never exceeded 0.55 for the four clustering routines, even at a sample size of 50. These findings suggest several plausible scenarios: (1) microarray datasets lack natural clustering structure thereby producing low stability scores on all four methods; (2) the algorithms studied do not produce reliable results and/or (3) sample sizes typically used in microarray research may be too small to support derivation of reliable clustering results. Further research should be directed towards evaluating stability performances of more clustering algorithms on more datasets specially having larger sample sizes with larger numbers of clusters considered

Publications Info

To contact an RTI author, request a report, or for additional information about publications by our experts, send us your request.

publications@rti.org

RTI shares its evidence-based research - through peer-reviewed publications and media - to ensure that it is accessible for others to build on, in line with our mission and scientific standards.

Meet the Experts

Navigate to Grier Page

Grier Page

Recent Publications

Article

baysc: An R package for Bayesian survey clustering

March 2026

Article

Newborn screening for type 1 diabetes using genome-based risk scores in the Early Check program

March 2026

Article

Uptake of newly licensed influenza vaccine formulations among patients receiving chronic hemodialysis during the 2010/2011 to 2021/2022 influenza seasons

March 2026

Article

Multi-ancestry genome-wide association study and meta-analysis of lung function decline

February 2026

Article

A microsimulation model to assess the cost-effectiveness of physical activity policies among US adults: The physical activity, diabetes, and cardiovascular disease model

February 2026

View All Publications

Reproducible clusters from microarray research

Abstract

Meet the Experts

Grier Page

Recent Publications

baysc: An R package for Bayesian survey clustering

Newborn screening for type 1 diabetes using genome-based risk scores in the Early Check program

Factors influencing wasting in children under 5 in arid regions of Kenya

Test-optional admissions policies and student success metrics

Psychometric evaluation of the weekly version of the PTSD checklist for DSM-5

Uptake of newly licensed influenza vaccine formulations among patients receiving chronic hemodialysis during the 2010/2011 to 2021/2022 influenza seasons

Multi-ancestry genome-wide association study and meta-analysis of lung function decline

A microsimulation model to assess the cost-effectiveness of physical activity policies among US adults: The physical activity, diabetes, and cardiovascular disease model