Zero-shot Generative Large Language Models for Systematic Review Screening Automation

Wang, Shuai; Scells, Harrisen; Zhuang, Shengyao; Potthast, Martin; Koopman, Bevan; Zuccon, Guido

Computer Science > Information Retrieval

arXiv:2401.06320 (cs)

[Submitted on 12 Jan 2024 (v1), last revised 1 Feb 2024 (this version, v2)]

Title:Zero-shot Generative Large Language Models for Systematic Review Screening Automation

Authors:Shuai Wang, Harrisen Scells, Shengyao Zhuang, Martin Potthast, Bevan Koopman, Guido Zuccon

View PDF HTML (experimental)

Abstract:Systematic reviews are crucial for evidence-based medicine as they comprehensively analyse published research findings on specific questions. Conducting such reviews is often resource- and time-intensive, especially in the screening phase, where abstracts of publications are assessed for inclusion in a review. This study investigates the effectiveness of using zero-shot large language models~(LLMs) for automatic screening. We evaluate the effectiveness of eight different LLMs and investigate a calibration technique that uses a predefined recall threshold to determine whether a publication should be included in a systematic review. Our comprehensive evaluation using five standard test collections shows that instruction fine-tuning plays an important role in screening, that calibration renders LLMs practical for achieving a targeted recall, and that combining both with an ensemble of zero-shot models saves significant screening time compared to state-of-the-art approaches.

Comments:	Accepted to ECIR2024 full paper (findings)
Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL)
Cite as:	arXiv:2401.06320 [cs.IR]
	(or arXiv:2401.06320v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2401.06320

Submission history

From: Shuai Wang [view email]
[v1] Fri, 12 Jan 2024 01:54:08 UTC (1,290 KB)
[v2] Thu, 1 Feb 2024 02:08:28 UTC (1,290 KB)

Computer Science > Information Retrieval

Title:Zero-shot Generative Large Language Models for Systematic Review Screening Automation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Zero-shot Generative Large Language Models for Systematic Review Screening Automation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators