A novel application of machine learning and zero-shot classification methods for automated abstract screening in systematic reviews

Carlos Francisco Moreno-Garcia; Chrisina Jayne; Eyad Elyan; Magaly Aceves Martins

doi:10.1016/j.dajour.2023.100162

A novel application of machine learning and zero-shot classification methods for automated abstract screening in systematic reviews

Carlos Francisco Moreno-Garcia^* (Corresponding Author), Chrisina Jayne, Eyad Elyan, Magaly Aceves Martins

^*Corresponding author for this work

The Rowett Institute of Nutrition and Health

Research output: Contribution to journal › Article › peer-review

8 Citations (Scopus)

3 Downloads (Pure)

Abstract

Zero-shot classification refers to assigning a label to a text (sentence, paragraph, whole paper) without prior training. This is possible by teaching the system how to codify a question and find its answer in the text. In many domains, especially health sciences, systematic reviews are evidence-based syntheses of information related to a specific topic. Producing them is demanding and time-consuming in terms of collecting, filtering, evaluating and synthesising large volumes of literature, which require significant effort performed by experts. One of its most demanding steps is abstract screening, which requires scientists to sift through various abstracts of relevant papers and include or exclude papers based on pre-established criteria. This process is time-consuming and subjective and requires a consensus between scientists, which may not always be possible. With the recent advances in machine learning and deep learning research, especially in natural language processing, it becomes possible to automate or semi-automate this task. This paper proposes a novel application of traditional machine learning and zero-shot classification methods for automated abstract screening for systematic reviews. Extensive experiments were carried out using seven public datasets. Competitive results were obtained in terms of accuracy, precision and recall across all datasets, which indicate that the burden and the human mistake in the abstract screening process might be reduced.

Original language	English
Article number	100162
Number of pages	9
Journal	Decision Analytics Journal
Volume	6
Early online date	12 Jan 2023
DOIs	https://doi.org/10.1016/j.dajour.2023.100162
Publication status	Published - Mar 2023

Data Availability Statement

Data and code are available in one of the references of the manuscript [39], which points to a GitHub site.

Keywords

bladder cancer
early detection
biomarkers
diagnostic performance
primary care
community

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1016/j.dajour.2023.100162Licence: CC BY

Moreno-Garcia_etal_DAJ_A_Novel_Application_VOR
: © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/license s/by/4.0/).
Final published version, 892 KBLicence: CC BY

Cite this

@article{c91ab211b9234407af250ee4ab276f06,

title = "A novel application of machine learning and zero-shot classification methods for automated abstract screening in systematic reviews",

abstract = "Zero-shot classification refers to assigning a label to a text (sentence, paragraph, whole paper) without prior training. This is possible by teaching the system how to codify a question and find its answer in the text. In many domains, especially health sciences, systematic reviews are evidence-based syntheses of information related to a specific topic. Producing them is demanding and time-consuming in terms of collecting, filtering, evaluating and synthesising large volumes of literature, which require significant effort performed by experts. One of its most demanding steps is abstract screening, which requires scientists to sift through various abstracts of relevant papers and include or exclude papers based on pre-established criteria. This process is time-consuming and subjective and requires a consensus between scientists, which may not always be possible. With the recent advances in machine learning and deep learning research, especially in natural language processing, it becomes possible to automate or semi-automate this task. This paper proposes a novel application of traditional machine learning and zero-shot classification methods for automated abstract screening for systematic reviews. Extensive experiments were carried out using seven public datasets. Competitive results were obtained in terms of accuracy, precision and recall across all datasets, which indicate that the burden and the human mistake in the abstract screening process might be reduced.",

keywords = "bladder cancer, early detection, biomarkers, diagnostic performance, primary care, community",

author = "Moreno-Garcia, {Carlos Francisco} and Chrisina Jayne and Eyad Elyan and {Aceves Martins}, Magaly",

year = "2023",

month = mar,

doi = "10.1016/j.dajour.2023.100162",

language = "English",

volume = "6",

journal = "Decision Analytics Journal",

issn = "2772-6622",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - A novel application of machine learning and zero-shot classification methods for automated abstract screening in systematic reviews

AU - Moreno-Garcia, Carlos Francisco

AU - Jayne, Chrisina

AU - Elyan, Eyad

AU - Aceves Martins, Magaly

PY - 2023/3

Y1 - 2023/3

N2 - Zero-shot classification refers to assigning a label to a text (sentence, paragraph, whole paper) without prior training. This is possible by teaching the system how to codify a question and find its answer in the text. In many domains, especially health sciences, systematic reviews are evidence-based syntheses of information related to a specific topic. Producing them is demanding and time-consuming in terms of collecting, filtering, evaluating and synthesising large volumes of literature, which require significant effort performed by experts. One of its most demanding steps is abstract screening, which requires scientists to sift through various abstracts of relevant papers and include or exclude papers based on pre-established criteria. This process is time-consuming and subjective and requires a consensus between scientists, which may not always be possible. With the recent advances in machine learning and deep learning research, especially in natural language processing, it becomes possible to automate or semi-automate this task. This paper proposes a novel application of traditional machine learning and zero-shot classification methods for automated abstract screening for systematic reviews. Extensive experiments were carried out using seven public datasets. Competitive results were obtained in terms of accuracy, precision and recall across all datasets, which indicate that the burden and the human mistake in the abstract screening process might be reduced.

AB - Zero-shot classification refers to assigning a label to a text (sentence, paragraph, whole paper) without prior training. This is possible by teaching the system how to codify a question and find its answer in the text. In many domains, especially health sciences, systematic reviews are evidence-based syntheses of information related to a specific topic. Producing them is demanding and time-consuming in terms of collecting, filtering, evaluating and synthesising large volumes of literature, which require significant effort performed by experts. One of its most demanding steps is abstract screening, which requires scientists to sift through various abstracts of relevant papers and include or exclude papers based on pre-established criteria. This process is time-consuming and subjective and requires a consensus between scientists, which may not always be possible. With the recent advances in machine learning and deep learning research, especially in natural language processing, it becomes possible to automate or semi-automate this task. This paper proposes a novel application of traditional machine learning and zero-shot classification methods for automated abstract screening for systematic reviews. Extensive experiments were carried out using seven public datasets. Competitive results were obtained in terms of accuracy, precision and recall across all datasets, which indicate that the burden and the human mistake in the abstract screening process might be reduced.

KW - bladder cancer

KW - early detection

KW - biomarkers

KW - diagnostic performance

KW - primary care

KW - community

U2 - 10.1016/j.dajour.2023.100162

DO - 10.1016/j.dajour.2023.100162

M3 - Article

SN - 2772-6622

VL - 6

JO - Decision Analytics Journal

JF - Decision Analytics Journal

M1 - 100162

ER -

A novel application of machine learning and zero-shot classification methods for automated abstract screening in systematic reviews

Abstract

Data Availability Statement

Keywords

UN SDGs

Access to Document

Fingerprint

Cite this