RTI uses cookies to offer you the best experience online. By clicking “accept” on this website, you opt in and you agree to the use of cookies. If you would like to know more about how RTI uses cookies and how to manage them please view our Privacy Policy here. You can “opt out” or change your mind by visiting: http://optout.aboutads.info/. Click “accept” to agree.
"Friends don't let friends use NSE or KGE for hydrologic model accuracy evaluation
Clark, M. P., Knoben, W. J. M., Spieler, D., Grundemann, G. J., Thebault, C., Vasqueza, N. A., Wood, A. W., Song, Y., Shen, C., Carney, S., & van Werkhoven, K. (2026). Comment on Williams (2025): "Friends don't let friends use NSE or KGE for hydrologic model accuracy evaluation: A rant with data and suggestions for better practice". Environmental Modelling & Software, 197, Article 106869. https://doi.org/10.1016/j.envsoft.2026.106869
Williams (2025), hereafter W25, raises valid concerns about the widespread use of the Nash-Sutcliffe Efficiency (NSE) and Kling-Gupta Efficiency (KGE) metrics in hydrologic model evaluation, arguing that these skill scores confound model accuracy with flow variability and should be replaced by error-based metrics such as the Root Mean Squared Error (RMSE) and the Mean Absolute Error (MAE). While we agree that model evaluation often lacks critical interpretation, we disagree that abandoning skill scores offers a constructive path forward. In this commentary, we discuss three main limitations in the W25 paper. First, we contend that W25 gives little attention to the broader literature on hydrologic model evaluation, leaving its recommendations weakly grounded in existing research. Second, we note that W25's recommendation to replace skill scores with error-based metrics such as RMSE does not resolve the underlying issue: both skill scores and error-based metrics conflate spatial variations in model accuracy with variations in flow variability. Third, we suggest that W25 overlooks the value of NSE and KGE in supporting standardized test environments that enable consistent model comparison. More generally, we argue that the W25 paper points the field in less productive directions for future research-simply replacing NSE and KGE with error-based metrics does not help the community address the core challenges in hydrologic model evaluation.
RTI shares its evidence-based research - through peer-reviewed publications and media - to ensure that it is accessible for others to build on, in line with our mission and scientific standards.