TY - JOUR
T1 - Enhancing systematic reviews
T2 - An in-depth analysis on the impact of active learning parameter combinations for biomedical abstract screening
AU - Ofori-Boateng, Regina
AU - Trujillo-Escobar, Tamy Goretty
AU - Aceves-Martins, Magaly
AU - Wiratunga, Nirmalie
AU - Moreno-Garcia, Carlos Francisco
PY - 2024/11
Y1 - 2024/11
N2 - Systematic Review (SR) are foundational to influencing policies and decision-making in healthcare and beyond. SRs thoroughly synthesise primary research on a specific topic while maintaining reproducibility and transparency. However, the rigorous nature of SRs introduces two main challenges: significant time involved and the continuously growing literature, resulting in potential data omission, making most SRs become outmoded even before they are published. As a solution, AI techniques have been leveraged to simplify the SR process, especially the abstract screening phase. Active learning (AL) has emerged as a preferred method among these AI techniques, allowing interactive learning through human input. Several AL software have been proposed for abstract screening. Despite its prowess, how the various parameters involved in AL influence the software's efficacy is still unclear. This research seeks to demystify this by exploring how different AL strategies, such as initial training set, query strategies etc. impact SR automation. Experimental evaluations were conducted on five complex medical SR datasets, and the GLM model was used to interpret the findings statistically. Some AL variables, such as the feature extractor, initial training size, and classifiers, showed notable observations and practical conclusions were drawn within the context of SR and beyond where AL is deployed.
AB - Systematic Review (SR) are foundational to influencing policies and decision-making in healthcare and beyond. SRs thoroughly synthesise primary research on a specific topic while maintaining reproducibility and transparency. However, the rigorous nature of SRs introduces two main challenges: significant time involved and the continuously growing literature, resulting in potential data omission, making most SRs become outmoded even before they are published. As a solution, AI techniques have been leveraged to simplify the SR process, especially the abstract screening phase. Active learning (AL) has emerged as a preferred method among these AI techniques, allowing interactive learning through human input. Several AL software have been proposed for abstract screening. Despite its prowess, how the various parameters involved in AL influence the software's efficacy is still unclear. This research seeks to demystify this by exploring how different AL strategies, such as initial training set, query strategies etc. impact SR automation. Experimental evaluations were conducted on five complex medical SR datasets, and the GLM model was used to interpret the findings statistically. Some AL variables, such as the feature extractor, initial training size, and classifiers, showed notable observations and practical conclusions were drawn within the context of SR and beyond where AL is deployed.
KW - Abstract screening
KW - Active learning
KW - Evidence-based medicine
KW - Human-in-the-loop
KW - Machine learning
KW - Systematic reviews
UR - http://www.scopus.com/inward/record.url?scp=85204958840&partnerID=8YFLogxK
U2 - 10.1016/j.artmed.2024.102989
DO - 10.1016/j.artmed.2024.102989
M3 - Review article
AN - SCOPUS:85204958840
SN - 0933-3657
VL - 157
JO - Artificial Intelligence in Medicine
JF - Artificial Intelligence in Medicine
M1 - 102989
ER -