TY - JOUR
T1 - Artificial intelligence-based decision support for HRCT stratification in fibrotic lung disease
T2 - an international study of 116 observers from 37 countries
AU - Calandriello, Lucio
AU - Mackintosh, John
AU - Felder, Federico
AU - Agrawal , Aditya
AU - Alamoudi, Omer
AU - Alberti, Laura
AU - Aquaro, Giuseppe
AU - Arenas-Jiménez, Juan
AU - Au-Yong, Iain
AU - Avdeev, Sergey
AU - Balbi, Maurizio
AU - Baldi, Bruno
AU - Yu-Lin Ban, Andrea
AU - Belaconi, Ionela-Nicoleta
AU - Bendstrup, Elisabeth
AU - Bennett, David
AU - Blum, Hans-Christian
AU - Bariga, Nicola Boscolo
AU - Bozovic, Gracijela
AU - Bruzzi, John
AU - Broqi, Marcel
AU - Buendia-Roldan, Ivette
AU - Calaras, Diana
AU - Campainha, Sergio
AU - Carbone, Roberto G.
AU - Carvalho, André
AU - Cereser, Lorenzo
AU - Chai, Gin Tsen
AU - Chaudhary, Sachin
AU - Chaudhuri, Nazia
AU - Cheong, Patrick Alain Chui Wan
AU - Cooper, Wendy
AU - Cutaia, Giuseppe
AU - D'Abronzo, Rosa
AU - De Kruif, Martijn D.
AU - Delgado-García, Diemen
AU - Dhooria, Sahajal
AU - Diaz-Castanon, Jesus J
AU - Eiger, Glenn
AU - Ellis, Samantha
AU - Estrada-Y-Martin, Rosa
AU - Fang, Yingying
AU - Morrissey, Brian
PY - 2023/10/27
Y1 - 2023/10/27
N2 - Methods: We evaluated a deep learning algorithm (DL), for classifying HRCT based on ATS/ERS/JRS/ALAT IPF guideline criteria (SOFIA), among an international group of radiologists and pulmonologists. Participants evaluated HRCTs from 203 suspected IPF patients, assigning a likelihood score for each of the guideline-based HRCT categories (each 0-100%, summing to 100%). SOFIA scores were then provided, and participants were given the opportunity to revise their scores. Agreement on (weighted kappa) and prognostic accuracy (Cox regression and C-index) of 1) UIP scores, 2) guideline-based diagnosis and 3) INBUILD categorisation (UIP/probable UIP vs indeterminate/alternative diagnosis – i.e., trial screening mode) were evaluated. Results: 116 participants completed the study, including 20 ILD trained radiologists. The majority opinion of ILD radiologists on each HRCT was used as a diagnostic reference standard. SOFIA improved agreement for UIP probability scores among all participants, excluding the ILD radiologists, (0.67 [IQR 0.57-0.73] vs 0.71 [IQR, 0.65-0.76], p=2.1x10-5) and guideline-based diagnoses (0.50 [IQR 0.43-0.54] vs 0.61 [IQR, 0.56-0.66], p=2.8x10-16) and INBUILD categorisation (0.42 [IQR 0.35-0.47] vs 0.56 [IQR, 0.49-0.62], p=7.1x10-19). Prognostic accuracy for UIP probability scores (mortality) were good for radiologist scoring (n=116, C-index=0.60 [IQR 0.58-0.62]), and these improved with the addition of SOFIA (C-index=0.63 [IQR 0.61-0.65], p=3.6x10-12). Conclusion: In pulmonary fibrosis, DL support may improve accuracy of HRCT diagnoses, provide prognostic information and faciliate screening in clinical trials.
AB - Methods: We evaluated a deep learning algorithm (DL), for classifying HRCT based on ATS/ERS/JRS/ALAT IPF guideline criteria (SOFIA), among an international group of radiologists and pulmonologists. Participants evaluated HRCTs from 203 suspected IPF patients, assigning a likelihood score for each of the guideline-based HRCT categories (each 0-100%, summing to 100%). SOFIA scores were then provided, and participants were given the opportunity to revise their scores. Agreement on (weighted kappa) and prognostic accuracy (Cox regression and C-index) of 1) UIP scores, 2) guideline-based diagnosis and 3) INBUILD categorisation (UIP/probable UIP vs indeterminate/alternative diagnosis – i.e., trial screening mode) were evaluated. Results: 116 participants completed the study, including 20 ILD trained radiologists. The majority opinion of ILD radiologists on each HRCT was used as a diagnostic reference standard. SOFIA improved agreement for UIP probability scores among all participants, excluding the ILD radiologists, (0.67 [IQR 0.57-0.73] vs 0.71 [IQR, 0.65-0.76], p=2.1x10-5) and guideline-based diagnoses (0.50 [IQR 0.43-0.54] vs 0.61 [IQR, 0.56-0.66], p=2.8x10-16) and INBUILD categorisation (0.42 [IQR 0.35-0.47] vs 0.56 [IQR, 0.49-0.62], p=7.1x10-19). Prognostic accuracy for UIP probability scores (mortality) were good for radiologist scoring (n=116, C-index=0.60 [IQR 0.58-0.62]), and these improved with the addition of SOFIA (C-index=0.63 [IQR 0.61-0.65], p=3.6x10-12). Conclusion: In pulmonary fibrosis, DL support may improve accuracy of HRCT diagnoses, provide prognostic information and faciliate screening in clinical trials.
KW - Chronic diseases
KW - DIAGNOSIS
KW - Idiopathic pulmonary fibrosis
U2 - 10.1183/13993003.congress-2023.OA4848
DO - 10.1183/13993003.congress-2023.OA4848
M3 - Abstract
SN - 0903-1936
VL - 62
JO - European Respiratory Journal
JF - European Respiratory Journal
IS - suppl. 67
M1 - OA4848
ER -