research-article

BURExtract-Llama: An LLM for Clinical Concept Extraction in Breast Ultrasound Reports

Authors:

Fardeen Siddiqui,

Antonio Verdone,

Qingyang Zhang,

Yiqiu ShenAuthors Info & Claims

MCHM'24: Proceedings of the 1st International Workshop on Multimedia Computing for Health and Medicine

Pages 53 - 58

https://doi.org/10.1145/3688868.3689200

Published: 31 October 2024 Publication History

Abstract

Breast ultrasound plays a pivotal role in detecting and diagnosing breast abnormalities. Radiology reports summarize key findings from these examinations, highlighting lesion characteristics and malignancy assessments. However, extracting this critical information is challenging due to the unstructured nature of radiology reports, which often exhibit varied linguistic styles and inconsistent formatting. While proprietary LLMs like GPT-4 effectively retrieve information, they are costly and raise privacy concerns when handling protected health information. This study presents a pipeline for developing an in-house LLM to extract clinical information from these reports. We first utilize GPT-4 to create a small subset of labeled data, then fine-tune a Llama3-8B using this dataset. Evaluated on a subset of reports annotated by clinicians, the proposed model achieves an average F1 score of 84.6%, which is on par with GPT-4. Our findings demonstrate that it is feasible to develop an in-house LLM that not only matches the performance of GPT-4 but also offers cost reductions and enhanced data privacy.

References

[1]

AI@Meta. 2024. Llama 3 Model Card. (2024). https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md

[2]

Raghavendra Chalapathy, Ehsan Zare Borzeshi, and Massimo Piccardi. 2016. Bidirectional LSTM-CRF for clinical concept extraction. arXiv preprint arXiv:1611.08373 (2016).

[3]

Kevin Clark, Minh-Thang Luong, Quoc V Le, and Christopher D Manning. 2020. Electra: Pre-training text encoders as discriminators rather than generators. arXiv preprint arXiv:2003.10555 (2020).

[4]

Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, and Luke Zettlemoyer. 2024. Qlora: Efficient finetuning of quantized llms. Advances in Neural Information Processing Systems, Vol. 36 (2024).

[5]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).

[6]

Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Zhiyong Wu, Baobao Chang, Xu Sun, Jingjing Xu, and Zhifang Sui. 2022. A survey on in-context learning. arXiv preprint arXiv:2301.00234 (2022).

[7]

Andrea Esuli, Diego Marcheggiani, and Fabrizio Sebastiani. 2013. An enhanced CRFs-based system for information extraction from radiology reports. Journal of biomedical informatics, Vol. 46, 3 (2013), 425--435.

[8]

Karin Flobbe, Anne Marie Bosch, Alfons GH Kessels, Geerard L Beets, Patricia J Nelemans, Maarten F von Meyenfeldt, and Joseph MA van Engelshoven. 2003. The additional diagnostic value of ultrasonography in the diagnosis of breast cancer. Archives of internal medicine, Vol. 163, 10 (2003), 1194--1199.

[9]

Carol Friedman, Philip O Alderson, John HM Austin, James J Cimino, and Stephen B Johnson. 1994. A general natural-language text processor for clinical radiology. Journal of the American Medical Informatics Association, Vol. 1, 2 (1994), 161--174.

[10]

Francini Hak, Tiago Guimar aes, and Manuel Santos. 2022. Towards effective clinical decision support systems: A systematic review. PLoS One, Vol. 17, 8 (2022), e0272846.

[11]

Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, and Weizhu Chen. 2021. LoRA: Low-Rank Adaptation of Large Language Models. arxiv: 2106.09685 [cs.CL] https://arxiv.org/abs/2106.09685

[12]

Woosuk Kwon, Zhuohan Li, Siyuan Zhuang, Ying Sheng, Lianmin Zheng, Cody Hao Yu, Joseph E. Gonzalez, Hao Zhang, and Ion Stoica. 2023. Efficient Memory Management for Large Language Model Serving with PagedAttention. arxiv: 2309.06180 [cs.LG] https://arxiv.org/abs/2309.06180

[13]

Zhenzhong Lan, Mingda Chen, Sebastian Goodman, Kevin Gimpel, Piyush Sharma, and Radu Soricut. 2019. Albert: A lite bert for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942 (2019).

[14]

Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019).

[15]

Ellen B Mendelson, Marcela Böhm-Vélez, Wendie A Berg, GJ Whitman, MI Feldman, H Madjar, et al. 2013. Acr bi-rads® ultrasound. ACR BI-RADS® atlas, breast imaging reporting and data system, Vol. 149 (2013).

[16]

David Fraile Navarro, Kiran Ijaz, Dana Rezazadegan, Hania Rahimi-Ardabili, Mark Dras, Enrico Coiera, and Shlomo Berkovsky. 2023. Clinical named entity recognition and relation extraction using natural language processing of medical free text: A systematic review. International Journal of Medical Informatics, Vol. 177 (2023), 105122.

[17]

OpenAI. 2024. GPT-4--32K. https://openai.com. Accessed: 2024-07-04.

[18]

Cheng Peng, Xi Yang, Zehao Yu, Jiang Bian, William R Hogan, and Yonghui Wu. 2023. Clinical concept and relation extraction using prompt-based machine reading comprehension. Journal of the American Medical Informatics Association, Vol. 30, 9 (2023), 1486--1493.

[19]

Alex Sherstinsky. 2018. Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network. CoRR, Vol. abs/1808.03314 (2018). showeprint[arXiv]1808.03314 http://arxiv.org/abs/1808.03314

[20]

Buzhou Tang, Hongxin Cao, Yonghui Wu, Min Jiang, and Hua Xu. 2013. Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features. In BMC medical informatics and decision making, Vol. 13. Springer, 1--10.

[21]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in neural information processing systems, Vol. 30 (2017).

[22]

Xi Yang, Jiang Bian, William R Hogan, and Yonghui Wu. 2020. Clinical concept extraction using transformers. Journal of the American Medical Informatics Association, Vol. 27, 12 (2020), 1935--1942.

Cited By

Shen YXu YMa JRui WZhao CHeacock LHuang C(2024)Multi-modal large language models in radiology: principles, applications, and potentialAbdominal Radiology10.1007/s00261-024-04708-8Online publication date: 2-Dec-2024
https://doi.org/10.1007/s00261-024-04708-8

Index Terms

BURExtract-Llama: An LLM for Clinical Concept Extraction in Breast Ultrasound Reports
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Information extraction

Recommendations

Axillary lymph node metastasis status prediction of early-stage breast cancer using convolutional neural networks
Abstract
Deep learning (DL) algorithms have been proven to be very effective in a wide range of computer vision applications, such as segmentation, classification, and detection. DL models can automatically assess complex medical image scenes without ...
Highlights
- We developed and trained a computer-aided prediction system based on ultrasound images to predict axillary lymph node metastasis status in patients with early-stage breast cancer.
- The peritumoral tissue included abnormal tissue (it ...
Shape symmetry analysis of breast tumors on ultrasound images

Shape characteristics of malignant and benign breast tumors are significantly different. In this paper, the reflective symmetry of breast tumor shapes on ultrasound images was investigated. A new reflective symmetry measure (RSML) derived from ...
Mass detection and classification in breast ultrasound images

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MCHM'24: Proceedings of the 1st International Workshop on Multimedia Computing for Health and Medicine

October 2024

85 pages

ISBN:9798400711954

DOI:10.1145/3688868

Program Chairs:
Xuequan Lu
La Trobe University, Australia
,
Wenxi Yue
University of Sydney, Australia
,
Imran Razzak
University of New South Wales, Australia
,
Kun Hu
University of Sydney, Australia
,
Jinglei Lv
University of Sydney, Australia
,
Sen Zhang
University of Sydney, Australia
,
Junhui Hou
City University of Hong Kong, China
,
Zhiyong Wang
University of Sydney, Australia
,
Jiebo Luo
University of Rochester, USA
,
Wei Xiang
La Trobe University, Australia

Copyright © 2024 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 October 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

MM '24

Sponsor:

SIGMM

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne VIC, Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
129
Total Downloads

Downloads (Last 12 months)129
Downloads (Last 6 weeks)37

Reflects downloads up to 15 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Shen YXu YMa JRui WZhao CHeacock LHuang C(2024)Multi-modal large language models in radiology: principles, applications, and potentialAbdominal Radiology10.1007/s00261-024-04708-8Online publication date: 2-Dec-2024
https://doi.org/10.1007/s00261-024-04708-8

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents