Generalisability through local validation: overcoming barriers due to data disparity in healthcare
Author(s)
Mitchell, William G.; Dee, Edward C.; Celi, Leo Anthony G.
Download12886_2021_Article_1992.pdf (465.1Kb)
Publisher with Creative Commons License
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
Abstract
Cho et al. report deep learning model accuracy for tilted myopic disc detection in a South Korean population. Here we explore the importance of generalisability of machine learning (ML) in healthcare, and we emphasise that recurrent underrepresentation of data-poor regions may inadvertently perpetuate global health inequity.
Creating meaningful ML systems is contingent on understanding how, when, and why different ML models work in different settings. While we echo the need for the diversification of ML datasets, such a worthy effort would take time and does not obviate uses of presently available datasets if conclusions are validated and re-calibrated for different groups prior to implementation.
The importance of external ML model validation on diverse populations should be highlighted where possible – especially for models built with single-centre data.
Date issued
2021-05-21Department
Massachusetts Institute of Technology. Institute for Medical Engineering & SciencePublisher
BioMed Central
Citation
BMC Ophthalmology. 2021 May 21;21(1):228
Version: Final published version