Generalisability through local validation: overcoming barriers due to data disparity in healthcare

Mitchell, William G.; Dee, Edward C.; Celi, Leo Anthony G.

Author(s)

Mitchell, William G.; Dee, Edward C.; Celi, Leo Anthony G.

Download12886_2021_Article_1992.pdf (465.1Kb)

Publisher with Creative Commons License

Terms of use

Creative Commons Attribution https://creativecommons.org/licenses/by/4.0/

Metadata

Show full item record

Abstract

Abstract Cho et al. report deep learning model accuracy for tilted myopic disc detection in a South Korean population. Here we explore the importance of generalisability of machine learning (ML) in healthcare, and we emphasise that recurrent underrepresentation of data-poor regions may inadvertently perpetuate global health inequity. Creating meaningful ML systems is contingent on understanding how, when, and why different ML models work in different settings. While we echo the need for the diversification of ML datasets, such a worthy effort would take time and does not obviate uses of presently available datasets if conclusions are validated and re-calibrated for different groups prior to implementation. The importance of external ML model validation on diverse populations should be highlighted where possible – especially for models built with single-centre data.

Date issued

2021-05-21

URI

https://hdl.handle.net/1721.1/136795.2

Department

Massachusetts Institute of Technology. Institute for Medical Engineering & Science

Publisher

BioMed Central

Citation

BMC Ophthalmology. 2021 May 21;21(1):228

Version: Final published version

Collections

MIT Open Access Articles

Version	Item	Date	Summary
2	1721.1/136795.2*	2021-12-02T15:02:35Z	Authority information verified/added.
1	1721.1/136795	2021-11-01T14:33:26Z

MIT Libraries homeDSpace@MIT