TY - JOUR
T1 - Diagnosis and risk stratification in hypertrophic cardiomyopathy using machine learning wall thickness measurement
T2 - a comparison with human test-retest performance
AU - Augusto, João B.
AU - Davies, Rhodri H.
AU - Bhuva, Anish N.
AU - Knott, Kristopher D.
AU - Seraphim, Andreas
AU - Alfarih, Mashael
AU - Lau, Clement
AU - Hughes, Rebecca K.
AU - Lopes, Luís R.
AU - Shiwani, Hunain
AU - Treibel, Thomas A.
AU - Gerber, Bernhard L.
AU - Hamilton-Craig, Christian
AU - Ntusi, Ntobeko A.B.
AU - Pontone, Gianluca
AU - Desai, Milind Y.
AU - Greenwood, John P.
AU - Swoboda, Peter P.
AU - Captur, Gabriella
AU - Cavalcante, João
AU - Bucciarelli-Ducci, Chiara
AU - Petersen, Steffen E.
AU - Schelbert, Erik
AU - Manisty, Charlotte
AU - Moon, James C.
N1 - Funding Information:
RHD was funded through the CAP-AI programme by a grant from the European Regional Development Fund and Barts Charity. AS declares a doctoral research fellowship from the British Heart Foundation (FS/18/83/34025). LRL is funded by a Medical Research Council UK Clinical Academic Partnership Award. CB-D is in part supported by the National Institute for Health Research (NIHR) Biomedical Research Centre at University Hospitals Bristol NHS Foundation Trust and the University of Bristol; the views expressed in this publication are those of the authors and not necessarily those of the NHS, the NIHR, or the Department of Health and Social Care. JCM and CM are directly and indirectly supported by the University College London Hospitals and Barts Health NIHR Biomedical Research Centres.
Funding Information:
SEP reports personal fees from Circle Cardiovascular Imaging, outside of the submitted work. JC reports research support from Siemens Healthineers and Circle Cardiovascular Imaging, outside of the submitted work. GP declares receiving honorarium as a speaker and research grants from GE Healthcare, Bracco, and Heartflow, outside of the submitted work. MYD is the principal investigator of the VALOR-HCM trial ( NCT04349072 ), sponsored by Myokardia, outside of the submitted work. CB-D is the part-time Chief Executive Officer of the Society for Cardiovascular Magnetic Resonance. All other authors declare no competing interests.
Publisher Copyright:
© 2021 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY-NC-ND 4.0 license
PY - 2021/1
Y1 - 2021/1
N2 - Background: Left ventricular maximum wall thickness (MWT) is central to diagnosis and risk stratification of hypertrophic cardiomyopathy, but human measurement is prone to variability. We developed an automated machine learning algorithm for MWT measurement and compared precision (reproducibility) with that of 11 international experts, using a dataset of patients with hypertrophic cardiomyopathy. Methods: 60 adult patients with hypertrophic cardiomyopathy, including those carrying hypertrophic cardiomyopathy gene mutations, were recruited at three institutes in the UK from August, 2018, to September, 2019: Barts Heart Centre, University College London Hospital (The Heart Hospital), and Leeds Teaching Hospitals NHS Trust. Participants had two cardiovascular magnetic resonance scans (test and retest) on the same day, ensuring no biological variability, using four cardiac MRI scanner models represented across two manufacturers and two field strengths. End-diastolic short-axis MWT was measured in test and retest by 11 international experts (from nine centres in six countries) and an automated machine learning method, which was trained to segment endocardial and epicardial contours on an independent, multicentre, multidisease dataset of 1923 patients. Machine learning MWT measurement was done with a method based on solving Laplace's equation. To assess test–retest reproducibility, we estimated the absolute test–retest MWT difference (precision), the coefficient of variation (CoV) for duplicate measurements, and the number of patients reclassified between test and retest according to different thresholds (MWT >15 mm and >30 mm). We calculated the sample size required to detect a prespecified MWT change between pairs of scans for machine learning and each expert. Findings: 1440 MWT measurements were analysed, corresponding to two scans from 60 participants by 12 observers (11 experts and machine learning). Experts differed in the MWT they measured, ranging from 14·9 mm (SD 4·2) to 19·0 mm (4·7; p<0·0001 for trend). Machine learning-measured mean MWT was 16·8 mm (4·1). Machine learning precision was superior, with a test–retest difference of 0·7 mm (0·6) compared with experts, who ranged from 1·1 mm (0·9) to 3·7 mm (2·0; p values for machine learning vs expert comparison ranging from <0·0001 to 0·0073) and a significantly lower CoV than for all experts (4·3% [95% CI 3·3–5·1] vs 5·7–12·1% across experts). On average, 38 (64%) patients were designated as having MWT greater than 15 mm by machine learning compared with 27 (45%) to 50 (83%) patients by experts; five (8%) patients were reclassified in test–retest by machine learning compared with four (7%) to 12 (20%) by experts. With a cutoff point of more than 30 mm for implantable cardioverter-defibrillator, three experts would have changed recommendations between tests a total of four times, but machine learning was consistent. Using machine learning, a clinical trial to detect a 2 mm MWT change would need 2·3 times (range 1·6–4·6) fewer patients. Interpretation: In this preliminary study, machine learning MWT measurement in hypertrophic cardiomyopathy is superior to human experts with potential implications for diagnosis, risk stratification, and clinical trials. Funding: European Regional Development Fund and Barts Charity.
AB - Background: Left ventricular maximum wall thickness (MWT) is central to diagnosis and risk stratification of hypertrophic cardiomyopathy, but human measurement is prone to variability. We developed an automated machine learning algorithm for MWT measurement and compared precision (reproducibility) with that of 11 international experts, using a dataset of patients with hypertrophic cardiomyopathy. Methods: 60 adult patients with hypertrophic cardiomyopathy, including those carrying hypertrophic cardiomyopathy gene mutations, were recruited at three institutes in the UK from August, 2018, to September, 2019: Barts Heart Centre, University College London Hospital (The Heart Hospital), and Leeds Teaching Hospitals NHS Trust. Participants had two cardiovascular magnetic resonance scans (test and retest) on the same day, ensuring no biological variability, using four cardiac MRI scanner models represented across two manufacturers and two field strengths. End-diastolic short-axis MWT was measured in test and retest by 11 international experts (from nine centres in six countries) and an automated machine learning method, which was trained to segment endocardial and epicardial contours on an independent, multicentre, multidisease dataset of 1923 patients. Machine learning MWT measurement was done with a method based on solving Laplace's equation. To assess test–retest reproducibility, we estimated the absolute test–retest MWT difference (precision), the coefficient of variation (CoV) for duplicate measurements, and the number of patients reclassified between test and retest according to different thresholds (MWT >15 mm and >30 mm). We calculated the sample size required to detect a prespecified MWT change between pairs of scans for machine learning and each expert. Findings: 1440 MWT measurements were analysed, corresponding to two scans from 60 participants by 12 observers (11 experts and machine learning). Experts differed in the MWT they measured, ranging from 14·9 mm (SD 4·2) to 19·0 mm (4·7; p<0·0001 for trend). Machine learning-measured mean MWT was 16·8 mm (4·1). Machine learning precision was superior, with a test–retest difference of 0·7 mm (0·6) compared with experts, who ranged from 1·1 mm (0·9) to 3·7 mm (2·0; p values for machine learning vs expert comparison ranging from <0·0001 to 0·0073) and a significantly lower CoV than for all experts (4·3% [95% CI 3·3–5·1] vs 5·7–12·1% across experts). On average, 38 (64%) patients were designated as having MWT greater than 15 mm by machine learning compared with 27 (45%) to 50 (83%) patients by experts; five (8%) patients were reclassified in test–retest by machine learning compared with four (7%) to 12 (20%) by experts. With a cutoff point of more than 30 mm for implantable cardioverter-defibrillator, three experts would have changed recommendations between tests a total of four times, but machine learning was consistent. Using machine learning, a clinical trial to detect a 2 mm MWT change would need 2·3 times (range 1·6–4·6) fewer patients. Interpretation: In this preliminary study, machine learning MWT measurement in hypertrophic cardiomyopathy is superior to human experts with potential implications for diagnosis, risk stratification, and clinical trials. Funding: European Regional Development Fund and Barts Charity.
UR - http://www.scopus.com/inward/record.url?scp=85098873583&partnerID=8YFLogxK
U2 - 10.1016/S2589-7500(20)30267-3
DO - 10.1016/S2589-7500(20)30267-3
M3 - Article
C2 - 33735065
AN - SCOPUS:85098873583
SN - 2589-7500
VL - 3
SP - e20-e28
JO - The Lancet Digital Health
JF - The Lancet Digital Health
IS - 1
ER -