Automated classification of depression from structural brain measures across two independent community‐based cohorts

Aleks  Stolicyn; Mathew A.  Harris; Xueyi Shen; Miruna C.  Barbu; Mark J.  Adams; Emma L. Hawkins; Laura  de Nooij; Hon Wah  Yeung; Alison D Murray; Stephen M.  Lawrie; J. Douglas Steele; Andrew M.  McIntosh; Heather C. Whalley

doi:10.1002/hbm.25095

Automated classification of depression from structural brain measures across two independent community‐based cohorts

Aleks Stolicyn^* (Corresponding Author), Mathew A. Harris, Xueyi Shen, Miruna C. Barbu, Mark J. Adams, Emma L. Hawkins, Laura de Nooij, Hon Wah Yeung, Alison D Murray, Stephen M. Lawrie, J. Douglas Steele, Andrew M. McIntosh, Heather C. Whalley

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

18 Citations (Scopus)

4 Downloads (Pure)

Abstract

Major depressive disorder (MDD) has been the subject of many neuroimaging case–control classification studies. Although some studies report accuracies ≥80%, most have investigated relatively small samples of clinically‐ascertained, currently symptomatic cases, and did not attempt replication in larger samples. We here first aimed to replicate previously reported classification accuracies in a small, well‐phenotyped community‐based group of current MDD cases with clinical interview‐based diagnoses (from STratifying Resilience and Depression Longitudinally cohort, ‘STRADL’). We performed a set of exploratory predictive classification analyses with measures related to brain morphometry and white matter integrity. We applied three classifier types—SVM, penalised logistic regression or decision tree—either with or without optimisation, and with or without feature selection. We then determined whether similar accuracies could be replicated in a larger independent population‐based sample with self‐reported current depression (UK Biobank cohort). Additional analyses extended to lifetime MDD diagnoses—remitted MDD in STRADL, and lifetime‐experienced MDD in UK Biobank. The highest cross‐validation accuracy (75%) was achieved in the initial current MDD sample with a decision tree classifier and cortical surface area features. The most frequently selected decision tree split variables included surface areas of bilateral caudal anterior cingulate, left lingual gyrus, left superior frontal, right precentral and paracentral regions. High accuracy was not achieved in the larger samples with self‐reported current depression (53.73%), with remitted MDD (57.48%), or with lifetime‐experienced MDD (52.68–60.29%). Our results indicate that high predictive classification accuracies may not immediately translate to larger samples with broader criteria for depression, and may not be robust across different classification approaches.

Original language	English
Pages (from-to)	3922-3937
Number of pages	16
Journal	Human Brain Mapping
Volume	41
Issue number	14
Early online date	19 Jun 2020
DOIs	https://doi.org/10.1002/hbm.25095
Publication status	Published - 1 Oct 2020

Bibliographical note

ACKNOWLEDGEMENTS:
This study was supported and funded by the Wellcome Trust Strategic Award ‘Stratifying Resilience and Depression Longitudinally’ (STRADL) (Reference 104036/Z/14/Z), and the Medical Research Council Mental Health Pathfinder Award ‘Leveraging routinely collected and linked research data to study the causes and consequences of common mental disorders’ (Reference MRC-MC_PC_17209). MAH is supported by research funding from the Dr Mortimer and Theresa Sackler Foundation. The research was conducted using the UK Biobank resource, with application number 4844. Structural brain imaging
data from the UK Biobank was processed at the University of Edinburgh Centre for Cognitive Ageing and Cognitive Epidemiology
(CCACE)
http://www.ccace.ed.ac.uk/), which is a part of the crosscouncil Lifelong Health and Wellbeing Initiative (MR/K026992/1).
CCACE received funding from Biotechnology and Biological Sciences Research Council (BBSRC), Medical Research Council (MRC), and was also supported by Age UK as part of The Disconnected Mind project. This work has made use of the resources provided by the Edinburgh Compute and Data Facility (ECDF) (http://www.ecdf.ed.ac.uk/)

Keywords

brain structure
classical twin
depression
diffusion MRI
machine learning
major depressive disorder
structural MRI
METAANALYSIS
GRAY-MATTER VOLUME
ABNORMALITIES
MRI
SCANS
ALGORITHM
classification
BIPOLAR DISORDER
FEATURES
INDIVIDUALS
SELECTION

Access to Document

10.1002/hbm.25095Licence: CC BY

Stolicyn_et_al_HBM_AutomatedClassificationOf_VOr
This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. © 2020 The Authors. Human Brain Mapping published by Wiley Periodicals, Inc. https://creativecommons.org/licenses/by/4.0/
Final published version, 1.8 MBLicence: CC BY

Cite this

Stolicyn, A., Harris, M. A., Shen, X., Barbu, M. C., Adams, M. J., Hawkins, E. L., de Nooij, L., Yeung, H. W., Murray, A. D., Lawrie, S. M., Steele, J. D., McIntosh, A. M., & Whalley, H. C. (2020). Automated classification of depression from structural brain measures across two independent community‐based cohorts. Human Brain Mapping, 41(14), 3922-3937. https://doi.org/10.1002/hbm.25095

Stolicyn, A, Harris, MA, Shen, X, Barbu, MC, Adams, MJ, Hawkins, EL, de Nooij, L, Yeung, HW, Murray, AD, Lawrie, SM, Steele, JD, McIntosh, AM & Whalley, HC 2020, 'Automated classification of depression from structural brain measures across two independent community‐based cohorts', Human Brain Mapping, vol. 41, no. 14, pp. 3922-3937. https://doi.org/10.1002/hbm.25095

@article{4d5b9736218a410db637ee1dc30a91b6,

title = "Automated classification of depression from structural brain measures across two independent community‐based cohorts",

abstract = "Major depressive disorder (MDD) has been the subject of many neuroimaging case–control classification studies. Although some studies report accuracies ≥80%, most have investigated relatively small samples of clinically‐ascertained, currently symptomatic cases, and did not attempt replication in larger samples. We here first aimed to replicate previously reported classification accuracies in a small, well‐phenotyped community‐based group of current MDD cases with clinical interview‐based diagnoses (from STratifying Resilience and Depression Longitudinally cohort, {\textquoteleft}STRADL{\textquoteright}). We performed a set of exploratory predictive classification analyses with measures related to brain morphometry and white matter integrity. We applied three classifier types—SVM, penalised logistic regression or decision tree—either with or without optimisation, and with or without feature selection. We then determined whether similar accuracies could be replicated in a larger independent population‐based sample with self‐reported current depression (UK Biobank cohort). Additional analyses extended to lifetime MDD diagnoses—remitted MDD in STRADL, and lifetime‐experienced MDD in UK Biobank. The highest cross‐validation accuracy (75%) was achieved in the initial current MDD sample with a decision tree classifier and cortical surface area features. The most frequently selected decision tree split variables included surface areas of bilateral caudal anterior cingulate, left lingual gyrus, left superior frontal, right precentral and paracentral regions. High accuracy was not achieved in the larger samples with self‐reported current depression (53.73%), with remitted MDD (57.48%), or with lifetime‐experienced MDD (52.68–60.29%). Our results indicate that high predictive classification accuracies may not immediately translate to larger samples with broader criteria for depression, and may not be robust across different classification approaches.",

keywords = "brain structure, classical twin, depression, diffusion MRI, machine learning, major depressive disorder, structural MRI, METAANALYSIS, GRAY-MATTER VOLUME, ABNORMALITIES, MRI, SCANS, ALGORITHM, classification, BIPOLAR DISORDER, FEATURES, INDIVIDUALS, SELECTION",

author = "Aleks Stolicyn and Harris, {Mathew A.} and Xueyi Shen and Barbu, {Miruna C.} and Adams, {Mark J.} and Hawkins, {Emma L.} and {de Nooij}, Laura and Yeung, {Hon Wah} and Murray, {Alison D} and Lawrie, {Stephen M.} and Steele, {J. Douglas} and McIntosh, {Andrew M.} and Whalley, {Heather C.}",

note = "ACKNOWLEDGEMENTS: This study was supported and funded by the Wellcome Trust Strategic Award {\textquoteleft}Stratifying Resilience and Depression Longitudinally{\textquoteright} (STRADL) (Reference 104036/Z/14/Z), and the Medical Research Council Mental Health Pathfinder Award {\textquoteleft}Leveraging routinely collected and linked research data to study the causes and consequences of common mental disorders{\textquoteright} (Reference MRC-MC_PC_17209). MAH is supported by research funding from the Dr Mortimer and Theresa Sackler Foundation. The research was conducted using the UK Biobank resource, with application number 4844. Structural brain imaging data from the UK Biobank was processed at the University of Edinburgh Centre for Cognitive Ageing and Cognitive Epidemiology (CCACE) http://www.ccace.ed.ac.uk/), which is a part of the crosscouncil Lifelong Health and Wellbeing Initiative (MR/K026992/1). CCACE received funding from Biotechnology and Biological Sciences Research Council (BBSRC), Medical Research Council (MRC), and was also supported by Age UK as part of The Disconnected Mind project. This work has made use of the resources provided by the Edinburgh Compute and Data Facility (ECDF) (http://www.ecdf.ed.ac.uk/)",

year = "2020",

month = oct,

day = "1",

doi = "10.1002/hbm.25095",

language = "English",

volume = "41",

pages = "3922--3937",

journal = "Human Brain Mapping",

issn = "1065-9471",

publisher = "Wiley-Liss Inc.",

number = "14",

}

TY - JOUR

T1 - Automated classification of depression from structural brain measures across two independent community‐based cohorts

AU - Stolicyn, Aleks

AU - Harris, Mathew A.

AU - Shen, Xueyi

AU - Barbu, Miruna C.

AU - Adams, Mark J.

AU - Hawkins, Emma L.

AU - de Nooij, Laura

AU - Yeung, Hon Wah

AU - Murray, Alison D

AU - Lawrie, Stephen M.

AU - Steele, J. Douglas

AU - McIntosh, Andrew M.

AU - Whalley, Heather C.

N1 - ACKNOWLEDGEMENTS: This study was supported and funded by the Wellcome Trust Strategic Award ‘Stratifying Resilience and Depression Longitudinally’ (STRADL) (Reference 104036/Z/14/Z), and the Medical Research Council Mental Health Pathfinder Award ‘Leveraging routinely collected and linked research data to study the causes and consequences of common mental disorders’ (Reference MRC-MC_PC_17209). MAH is supported by research funding from the Dr Mortimer and Theresa Sackler Foundation. The research was conducted using the UK Biobank resource, with application number 4844. Structural brain imaging data from the UK Biobank was processed at the University of Edinburgh Centre for Cognitive Ageing and Cognitive Epidemiology (CCACE) http://www.ccace.ed.ac.uk/), which is a part of the crosscouncil Lifelong Health and Wellbeing Initiative (MR/K026992/1). CCACE received funding from Biotechnology and Biological Sciences Research Council (BBSRC), Medical Research Council (MRC), and was also supported by Age UK as part of The Disconnected Mind project. This work has made use of the resources provided by the Edinburgh Compute and Data Facility (ECDF) (http://www.ecdf.ed.ac.uk/)

PY - 2020/10/1

Y1 - 2020/10/1

N2 - Major depressive disorder (MDD) has been the subject of many neuroimaging case–control classification studies. Although some studies report accuracies ≥80%, most have investigated relatively small samples of clinically‐ascertained, currently symptomatic cases, and did not attempt replication in larger samples. We here first aimed to replicate previously reported classification accuracies in a small, well‐phenotyped community‐based group of current MDD cases with clinical interview‐based diagnoses (from STratifying Resilience and Depression Longitudinally cohort, ‘STRADL’). We performed a set of exploratory predictive classification analyses with measures related to brain morphometry and white matter integrity. We applied three classifier types—SVM, penalised logistic regression or decision tree—either with or without optimisation, and with or without feature selection. We then determined whether similar accuracies could be replicated in a larger independent population‐based sample with self‐reported current depression (UK Biobank cohort). Additional analyses extended to lifetime MDD diagnoses—remitted MDD in STRADL, and lifetime‐experienced MDD in UK Biobank. The highest cross‐validation accuracy (75%) was achieved in the initial current MDD sample with a decision tree classifier and cortical surface area features. The most frequently selected decision tree split variables included surface areas of bilateral caudal anterior cingulate, left lingual gyrus, left superior frontal, right precentral and paracentral regions. High accuracy was not achieved in the larger samples with self‐reported current depression (53.73%), with remitted MDD (57.48%), or with lifetime‐experienced MDD (52.68–60.29%). Our results indicate that high predictive classification accuracies may not immediately translate to larger samples with broader criteria for depression, and may not be robust across different classification approaches.

AB - Major depressive disorder (MDD) has been the subject of many neuroimaging case–control classification studies. Although some studies report accuracies ≥80%, most have investigated relatively small samples of clinically‐ascertained, currently symptomatic cases, and did not attempt replication in larger samples. We here first aimed to replicate previously reported classification accuracies in a small, well‐phenotyped community‐based group of current MDD cases with clinical interview‐based diagnoses (from STratifying Resilience and Depression Longitudinally cohort, ‘STRADL’). We performed a set of exploratory predictive classification analyses with measures related to brain morphometry and white matter integrity. We applied three classifier types—SVM, penalised logistic regression or decision tree—either with or without optimisation, and with or without feature selection. We then determined whether similar accuracies could be replicated in a larger independent population‐based sample with self‐reported current depression (UK Biobank cohort). Additional analyses extended to lifetime MDD diagnoses—remitted MDD in STRADL, and lifetime‐experienced MDD in UK Biobank. The highest cross‐validation accuracy (75%) was achieved in the initial current MDD sample with a decision tree classifier and cortical surface area features. The most frequently selected decision tree split variables included surface areas of bilateral caudal anterior cingulate, left lingual gyrus, left superior frontal, right precentral and paracentral regions. High accuracy was not achieved in the larger samples with self‐reported current depression (53.73%), with remitted MDD (57.48%), or with lifetime‐experienced MDD (52.68–60.29%). Our results indicate that high predictive classification accuracies may not immediately translate to larger samples with broader criteria for depression, and may not be robust across different classification approaches.

KW - brain structure

KW - classical twin

KW - depression

KW - diffusion MRI

KW - machine learning

KW - major depressive disorder

KW - structural MRI

KW - METAANALYSIS

KW - GRAY-MATTER VOLUME

KW - ABNORMALITIES

KW - MRI

KW - SCANS

KW - ALGORITHM

KW - classification

KW - BIPOLAR DISORDER

KW - FEATURES

KW - INDIVIDUALS

KW - SELECTION

UR - http://www.scopus.com/inward/record.url?scp=85087179913&partnerID=8YFLogxK

U2 - 10.1002/hbm.25095

DO - 10.1002/hbm.25095

M3 - Article

SN - 1065-9471

VL - 41

SP - 3922

EP - 3937

JO - Human Brain Mapping

JF - Human Brain Mapping

IS - 14

ER -

Automated classification of depression from structural brain measures across two independent community‐based cohorts

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this