Predicting incident dementia in cerebral small vessel disease: comparison of machine learning and traditional statistical models

Rui Li; Eric L. Harshfield; Steven Bell; Michael Burkhart; Anil M. Tuladhar; Saima Hilal; Daniel J. Tozer; Francesca M. Chappell; Stephen D.J. Makin; Jessica W. Lo; Joanna M. Wardlaw; Frank Erik de Leeuw; Christopher Chen; Zoe Kourtzi; Hugh S. Markus

doi:10.1016/j.cccb.2023.100179

Predicting incident dementia in cerebral small vessel disease: comparison of machine learning and traditional statistical models

Rui Li^*, Eric L. Harshfield, Steven Bell, Michael Burkhart, Anil M. Tuladhar, Saima Hilal, Daniel J. Tozer, Francesca M. Chappell, Stephen D.J. Makin, Jessica W. Lo, Joanna M. Wardlaw, Frank Erik de Leeuw, Christopher Chen, Zoe Kourtzi, Hugh S. Markus

^*Corresponding author for this work

Applied Health Sciences

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

Abstract

Background: Cerebral small vessel disease (SVD) contributes to 45% of dementia cases worldwide, yet we lack a reliable model for predicting dementia in SVD. Past attempts largely relied on traditional statistical approaches. Here, we investigated whether machine learning (ML) methods improved prediction of incident dementia in SVD from baseline SVD-related features over traditional statistical methods. Methods: We included three cohorts with varying SVD severity (RUN DMC, n = 503; SCANS, n = 121; HARMONISATION, n = 265). Baseline demographics, vascular risk factors, cognitive scores, and magnetic resonance imaging (MRI) features of SVD were used for prediction. We conducted both survival analysis and classification analysis predicting 3-year dementia risk. For each analysis, several ML methods were evaluated against standard Cox or logistic regression. Finally, we compared the feature importance ranked by different models. Results: We included 789 participants without missing data in the survival analysis, amongst whom 108 (13.7%) developed dementia during a median follow-up of 5.4 years. Excluding those censored before three years, we included 750 participants in the classification analysis, amongst whom 48 (6.4%) developed dementia by year 3. Comparing statistical and ML models, only regularised Cox/logistic regression outperformed their statistical counterparts overall, but not significantly so in survival analysis. Baseline cognition was highly predictive, and global cognition was the most important feature. Conclusions: When using baseline SVD-related features to predict dementia in SVD, the ML survival or classification models we evaluated brought little improvement over traditional statistical approaches. The benefits of ML should be evaluated with caution, especially given limited sample size and features.

Original language	English
Article number	100179
Number of pages	8
Journal	Cerebral Circulation - Cognition and Behavior
Volume	5
Early online date	11 Aug 2023
DOIs	https://doi.org/10.1016/j.cccb.2023.100179
Publication status	Published - 2023

Bibliographical note

Funding Information:
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was funded by a British Heart Foundation (BHF) programme grant [grant number RG/F/22/110052 ] and infrastructural support was provided by the Cambridge British Heart Foundation Centre of Research Excellence [grant number RE/18/1/34212 ] and the Cambridge University Hospitals NIHR Biomedical Research Centre [grant number BRC-1215–20014 ].

Funding Information:
HSM is supported by an NIHR Senior Investigator Award, and a number of peer reviewed funders including Medical Research Council, EU, Alzheimer's Society, Stroke Association, BHF. The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care. RL is supported by a PhD scholarship awarded by Trinity College, University of Cambridge. ELH is supported by Cambridge BHF Centre of Research Excellence [grant number RE/18/1/34212 ]; Alzheimer's Society [grant number AS-RF-21–017 ]; BHF programme grant [grant number RG/F/22/110052 ]; Cambridge NIHR Biomedical Research Centre [grant number BRC-1215–20014 ]. SB is supported by BHF . AMT is supported by Dutch Heart Foundation [grant number 2016T044 ]. Wellcome Trust [grant number 081589 ] provided initial funding for SCANS study. SDJM is supported by Wellcome Trust [grant number WT088134/Z/09/A ]. JMW is supported by Wellcome Trust , Row Fogo Trust , and Medical Research Council . CC is supported by National Medical Research Council of Singapore. ZK is supported by Wellcome Trust and Alan Turing Institute.

Keywords

Cerebral small vessel disease
Dementia
Machine learning
Prediction

Access to Document

10.1016/j.cccb.2023.100179Licence: CC BY

Li_etal_CCCB_Predicting_incident_Dementia_VoR
© 2023 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/
Final published version, 2.08 MBLicence: CC BY

Cite this

Li, R., Harshfield, E. L., Bell, S., Burkhart, M., Tuladhar, A. M., Hilal, S., Tozer, D. J., Chappell, F. M., Makin, S. D. J., Lo, J. W., Wardlaw, J. M., de Leeuw, F. E., Chen, C., Kourtzi, Z., & Markus, H. S. (2023). Predicting incident dementia in cerebral small vessel disease: comparison of machine learning and traditional statistical models. Cerebral Circulation - Cognition and Behavior, 5, Article 100179. https://doi.org/10.1016/j.cccb.2023.100179

Li, R, Harshfield, EL, Bell, S, Burkhart, M, Tuladhar, AM, Hilal, S, Tozer, DJ, Chappell, FM, Makin, SDJ, Lo, JW, Wardlaw, JM, de Leeuw, FE, Chen, C, Kourtzi, Z & Markus, HS 2023, 'Predicting incident dementia in cerebral small vessel disease: comparison of machine learning and traditional statistical models', Cerebral Circulation - Cognition and Behavior, vol. 5, 100179. https://doi.org/10.1016/j.cccb.2023.100179

@article{85ad18ca3b114bc29fec6a28af05b5a9,

title = "Predicting incident dementia in cerebral small vessel disease: comparison of machine learning and traditional statistical models",

abstract = "Background: Cerebral small vessel disease (SVD) contributes to 45% of dementia cases worldwide, yet we lack a reliable model for predicting dementia in SVD. Past attempts largely relied on traditional statistical approaches. Here, we investigated whether machine learning (ML) methods improved prediction of incident dementia in SVD from baseline SVD-related features over traditional statistical methods. Methods: We included three cohorts with varying SVD severity (RUN DMC, n = 503; SCANS, n = 121; HARMONISATION, n = 265). Baseline demographics, vascular risk factors, cognitive scores, and magnetic resonance imaging (MRI) features of SVD were used for prediction. We conducted both survival analysis and classification analysis predicting 3-year dementia risk. For each analysis, several ML methods were evaluated against standard Cox or logistic regression. Finally, we compared the feature importance ranked by different models. Results: We included 789 participants without missing data in the survival analysis, amongst whom 108 (13.7%) developed dementia during a median follow-up of 5.4 years. Excluding those censored before three years, we included 750 participants in the classification analysis, amongst whom 48 (6.4%) developed dementia by year 3. Comparing statistical and ML models, only regularised Cox/logistic regression outperformed their statistical counterparts overall, but not significantly so in survival analysis. Baseline cognition was highly predictive, and global cognition was the most important feature. Conclusions: When using baseline SVD-related features to predict dementia in SVD, the ML survival or classification models we evaluated brought little improvement over traditional statistical approaches. The benefits of ML should be evaluated with caution, especially given limited sample size and features.",

keywords = "Cerebral small vessel disease, Dementia, Machine learning, Prediction",

author = "Rui Li and Harshfield, {Eric L.} and Steven Bell and Michael Burkhart and Tuladhar, {Anil M.} and Saima Hilal and Tozer, {Daniel J.} and Chappell, {Francesca M.} and Makin, {Stephen D.J.} and Lo, {Jessica W.} and Wardlaw, {Joanna M.} and {de Leeuw}, {Frank Erik} and Christopher Chen and Zoe Kourtzi and Markus, {Hugh S.}",

note = "Funding Information: The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was funded by a British Heart Foundation (BHF) programme grant [grant number RG/F/22/110052 ] and infrastructural support was provided by the Cambridge British Heart Foundation Centre of Research Excellence [grant number RE/18/1/34212 ] and the Cambridge University Hospitals NIHR Biomedical Research Centre [grant number BRC-1215–20014 ]. Funding Information: HSM is supported by an NIHR Senior Investigator Award, and a number of peer reviewed funders including Medical Research Council, EU, Alzheimer's Society, Stroke Association, BHF. The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care. RL is supported by a PhD scholarship awarded by Trinity College, University of Cambridge. ELH is supported by Cambridge BHF Centre of Research Excellence [grant number RE/18/1/34212 ]; Alzheimer's Society [grant number AS-RF-21–017 ]; BHF programme grant [grant number RG/F/22/110052 ]; Cambridge NIHR Biomedical Research Centre [grant number BRC-1215–20014 ]. SB is supported by BHF . AMT is supported by Dutch Heart Foundation [grant number 2016T044 ]. Wellcome Trust [grant number 081589 ] provided initial funding for SCANS study. SDJM is supported by Wellcome Trust [grant number WT088134/Z/09/A ]. JMW is supported by Wellcome Trust , Row Fogo Trust , and Medical Research Council . CC is supported by National Medical Research Council of Singapore. ZK is supported by Wellcome Trust and Alan Turing Institute. ",

year = "2023",

doi = "10.1016/j.cccb.2023.100179",

language = "English",

volume = "5",

journal = "Cerebral Circulation - Cognition and Behavior",

issn = "2666-2450",

publisher = "Elsevier",

}

TY - JOUR

T1 - Predicting incident dementia in cerebral small vessel disease

T2 - comparison of machine learning and traditional statistical models

AU - Li, Rui

AU - Harshfield, Eric L.

AU - Bell, Steven

AU - Burkhart, Michael

AU - Tuladhar, Anil M.

AU - Hilal, Saima

AU - Tozer, Daniel J.

AU - Chappell, Francesca M.

AU - Makin, Stephen D.J.

AU - Lo, Jessica W.

AU - Wardlaw, Joanna M.

AU - de Leeuw, Frank Erik

AU - Chen, Christopher

AU - Kourtzi, Zoe

AU - Markus, Hugh S.

N1 - Funding Information: The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was funded by a British Heart Foundation (BHF) programme grant [grant number RG/F/22/110052 ] and infrastructural support was provided by the Cambridge British Heart Foundation Centre of Research Excellence [grant number RE/18/1/34212 ] and the Cambridge University Hospitals NIHR Biomedical Research Centre [grant number BRC-1215–20014 ]. Funding Information: HSM is supported by an NIHR Senior Investigator Award, and a number of peer reviewed funders including Medical Research Council, EU, Alzheimer's Society, Stroke Association, BHF. The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care. RL is supported by a PhD scholarship awarded by Trinity College, University of Cambridge. ELH is supported by Cambridge BHF Centre of Research Excellence [grant number RE/18/1/34212 ]; Alzheimer's Society [grant number AS-RF-21–017 ]; BHF programme grant [grant number RG/F/22/110052 ]; Cambridge NIHR Biomedical Research Centre [grant number BRC-1215–20014 ]. SB is supported by BHF . AMT is supported by Dutch Heart Foundation [grant number 2016T044 ]. Wellcome Trust [grant number 081589 ] provided initial funding for SCANS study. SDJM is supported by Wellcome Trust [grant number WT088134/Z/09/A ]. JMW is supported by Wellcome Trust , Row Fogo Trust , and Medical Research Council . CC is supported by National Medical Research Council of Singapore. ZK is supported by Wellcome Trust and Alan Turing Institute.

PY - 2023

Y1 - 2023

N2 - Background: Cerebral small vessel disease (SVD) contributes to 45% of dementia cases worldwide, yet we lack a reliable model for predicting dementia in SVD. Past attempts largely relied on traditional statistical approaches. Here, we investigated whether machine learning (ML) methods improved prediction of incident dementia in SVD from baseline SVD-related features over traditional statistical methods. Methods: We included three cohorts with varying SVD severity (RUN DMC, n = 503; SCANS, n = 121; HARMONISATION, n = 265). Baseline demographics, vascular risk factors, cognitive scores, and magnetic resonance imaging (MRI) features of SVD were used for prediction. We conducted both survival analysis and classification analysis predicting 3-year dementia risk. For each analysis, several ML methods were evaluated against standard Cox or logistic regression. Finally, we compared the feature importance ranked by different models. Results: We included 789 participants without missing data in the survival analysis, amongst whom 108 (13.7%) developed dementia during a median follow-up of 5.4 years. Excluding those censored before three years, we included 750 participants in the classification analysis, amongst whom 48 (6.4%) developed dementia by year 3. Comparing statistical and ML models, only regularised Cox/logistic regression outperformed their statistical counterparts overall, but not significantly so in survival analysis. Baseline cognition was highly predictive, and global cognition was the most important feature. Conclusions: When using baseline SVD-related features to predict dementia in SVD, the ML survival or classification models we evaluated brought little improvement over traditional statistical approaches. The benefits of ML should be evaluated with caution, especially given limited sample size and features.

AB - Background: Cerebral small vessel disease (SVD) contributes to 45% of dementia cases worldwide, yet we lack a reliable model for predicting dementia in SVD. Past attempts largely relied on traditional statistical approaches. Here, we investigated whether machine learning (ML) methods improved prediction of incident dementia in SVD from baseline SVD-related features over traditional statistical methods. Methods: We included three cohorts with varying SVD severity (RUN DMC, n = 503; SCANS, n = 121; HARMONISATION, n = 265). Baseline demographics, vascular risk factors, cognitive scores, and magnetic resonance imaging (MRI) features of SVD were used for prediction. We conducted both survival analysis and classification analysis predicting 3-year dementia risk. For each analysis, several ML methods were evaluated against standard Cox or logistic regression. Finally, we compared the feature importance ranked by different models. Results: We included 789 participants without missing data in the survival analysis, amongst whom 108 (13.7%) developed dementia during a median follow-up of 5.4 years. Excluding those censored before three years, we included 750 participants in the classification analysis, amongst whom 48 (6.4%) developed dementia by year 3. Comparing statistical and ML models, only regularised Cox/logistic regression outperformed their statistical counterparts overall, but not significantly so in survival analysis. Baseline cognition was highly predictive, and global cognition was the most important feature. Conclusions: When using baseline SVD-related features to predict dementia in SVD, the ML survival or classification models we evaluated brought little improvement over traditional statistical approaches. The benefits of ML should be evaluated with caution, especially given limited sample size and features.

KW - Cerebral small vessel disease

KW - Dementia

KW - Machine learning

KW - Prediction

UR - http://www.scopus.com/inward/record.url?scp=85167790782&partnerID=8YFLogxK

U2 - 10.1016/j.cccb.2023.100179

DO - 10.1016/j.cccb.2023.100179

M3 - Article

C2 - 37593075

AN - SCOPUS:85167790782

SN - 2666-2450

VL - 5

JO - Cerebral Circulation - Cognition and Behavior

JF - Cerebral Circulation - Cognition and Behavior

M1 - 100179

ER -

Predicting incident dementia in cerebral small vessel disease: comparison of machine learning and traditional statistical models

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this