Paying Per-Label Attention for Multi-label Extraction from Radiology Reports

Schrempf, Patrick; Watson, Hannah; Mikhael, Shadia; Pajak, Maciej; Falis, Matúš; Lisowska, Aneta; Muir, Keith W.; Harris-Birtill, David; O’Neil, Alison Q.

doi:10.1007/978-3-030-61166-8_29

Patrick Schrempf^27,28,
Hannah Watson²⁷,
Shadia Mikhael²⁷,
Maciej Pajak²⁷,
Matúš Falis²⁷,
Aneta Lisowska²⁷,
Keith W. Muir²⁹,
David Harris-Birtill²⁸ &
…
Alison Q. O’Neil^27,30

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12446))

Included in the following conference series:

International Workshop on Interpretability of Machine Intelligence in Medical Image Computing
International Workshop on Medical Image Learning with Less Labels and Imperfect Data
International Workshop on Large-scale Annotation of Biomedical data and Expert Label Synthesis

1565 Accesses
1 Citations
3 Altmetric

Abstract

Training medical image analysis models requires large amounts of expertly annotated data which is time-consuming and expensive to obtain. Images are often accompanied by free-text radiology reports which are a rich source of information. In this paper, we tackle the automated extraction of structured labels from head CT reports for imaging of suspected stroke patients, using deep learning. Firstly, we propose a set of 31 labels which correspond to radiographic findings (e.g. hyperdensity) and clinical impressions (e.g. haemorrhage) related to neurological abnormalities. Secondly, inspired by previous work, we extend existing state-of-the-art neural network models with a label-dependent attention mechanism. Using this mechanism and simple synthetic data augmentation, we are able to robustly extract many labels with a single model, classified according to the radiologist’s reporting (positive, uncertain, negative). This approach can be used in further research to effectively extract many labels from medical text.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Multi-label annotation of text reports from computed tomography of the chest, abdomen, and pelvis using deep learning

Article Open access 15 April 2022

Efficient labeling of french mammogram reports with MammoBERT

Article Open access 22 October 2024

ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases

Notes

1.
iCAIRD project number: 104690; University of St Andrews: CS14871.
2.
https://github.com/google-research/bert.
3.
https://github.com/th0mi/clinicalBERT.

References

Alsentzer, E., et al.: Publicly available clinical BERT embeddings. In: Proceedings of the 2nd Clinical Natural Language Processing Workshop, pp. 72–78. Association for Computational Linguistics, Minneapolis, Minnesota, USA, Jun 2019. https://doi.org/10.18653/v1/W19-1909
Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: Bengio, Y., LeCun, Y. (eds.) 3rd International Conference on Learning Representations, ICLR, San Diego, CA, USA, 7–9 May 2015, Conference Track Proceedings (2015)
Google Scholar
Banerjee, S., Akkaya, C., Perez-Sorrosal, F., Tsioutsiouliklis, K.: Hierarchical transfer learning for multi-label text classification. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 6295–6300 (2019)
Google Scholar
Bodenreider, O.: The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res. 32(90001), 267D–270 (2004). https://doi.org/10.1093/nar/gkh061
Article Google Scholar
Cho, K., van Merriënboer, B., Bahdanau, D., Bengio, Y.: On the properties of neural machine translation: Encoder-decoder approaches. In: Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, pp. 103–111. Association for Computational Linguistics, Doha, Qatar, October 2014.https://doi.org/10.3115/v1/W14-4012
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 4171–4186. Association for Computational Linguistics, Minneapolis, Minnesota, June 2019. https://doi.org/10.18653/v1/N19-1423
Drozdov, I., et al.: Supervised and unsupervised language modelling in chest x-ray radiological reports. Plos One 15(3), e0229963 (2020)
Article Google Scholar
Gorinski, P.J., et al.: Named entity recognition for electronic health records: a comparison of rule-based and machine learning approaches. arXiv preprint arXiv:1903.03985 (2019)
Irvin, J., et al.: CheXpert: a large chest radiograph dataset with uncertainty labels and expert comparison. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 590–597 (2019)
Google Scholar
IST-3 collaborative group: Association between brain imaging signs, early and late outcomes, and response to intravenous alteplase after acute ischaemic stroke in the third International Stroke Trial (IST-3): secondary analysis of a randomised controlled trial. Lancet Neurol. 14, pp. 485–496 (2015). https://doi.org/10.1016/S1474-4422(15)00012-5
Johnson, A.E., et al.: MIMIC-III, a freely accessible critical care database. Sci. Data 3, 160035 (2016)
Article Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: Bengio, Y., LeCun, Y. (eds.) 3rd International Conference on Learning Representations, ICLR, San Diego, CA, USA, 7–9 May 2015, Conference Track Proceedings (2015)
Google Scholar
Loper, E., Bird, S.: NLTK: the natural language toolkit. In: Proceedings of the ACL Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics. Association for Computational Linguistics, Philadelphia (2002)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Mullenbach, J., Wiegreffe, S., Duke, J., Sun, J., Eisenstein, J.: Explainable prediction of medical codes from clinical text. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pp. 1101–1111. Association for Computational Linguistics, New Orleans, Louisiana, Jun 2018. https://doi.org/10.18653/v1/N18-1100
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet MATH Google Scholar
Radiological Society of North America: RSNA Intracranial Hemorrhage Detection (Kaggle challenge). https://www.kaggle.com/c/rsna-intracranial-hemorrhage-detection/overview
Řehůřek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, ELRA, Valletta, Malta, pp. 45–50, May 2010
Google Scholar
Smit, A., Jain, S., Rajpurkar, P., Pareek, A., Ng, A.Y., Lungren, M.P.: CheXbert: combining automatic labelers and expert annotations for accurate radiology report labeling using BERT. arXiv preprint arXiv:2004.09167 (2020)
Wolf, T., et al.: HuggingFace’s transformers: state-of-the-art natural language processing. ArXiv abs/1910.03771 (2019)
Google Scholar
Wood, D., et al.: Automated labelling using an attention model for radiology reports of MRI scans (ALARM). In: Medical Imaging with Deep Learning (2020). https://openreview.net/forum?id=UFnWZTbM5t
Yadav, K., Sarioglu, E., Choi, H., Cartwright IV, W.B., Hinds, P.S., Chamberlain, J.M.: Automated outcome classification of computed tomography imaging reports for pediatric traumatic brain injury. Acad. Emerg. Med. 23(2), 171–178 (2016). https://doi.org/10.1111/acem.12859
Article Google Scholar
Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., Hovy, E.: Hierarchical attention networks for document classification. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1480–1489 (2016)
Google Scholar
Yetisgen-Yildiz, M., Gunn, M.L., Xia, F., Payne, T.H.: A text processing pipeline to extract recommendations from radiology reports. J. Biomed. Inf. 46(2), 354–362 (2013)
Article Google Scholar
Zech, J., et al.: Natural language-based machine learning models for the annotation of clinical radiology reports. Radiology 287(2), 570–580 (2018)
Article MathSciNet Google Scholar

Download references

Acknowledgements

This work is part of the Industrial Centre for AI Research in digital Diagnostics (iCAIRD) which is funded by Innovate UK on behalf of UK Research and Innovation (UKRI) [project number: 104690]. We would like to thank the Glasgow Safe Haven for assistance in creating and providing this dataset. Thanks also to The Data Lab for support and funding.

Author information

Authors and Affiliations

Canon Medical Research Europe, Edinburgh, UK
Patrick Schrempf, Hannah Watson, Shadia Mikhael, Maciej Pajak, Matúš Falis, Aneta Lisowska & Alison Q. O’Neil
University of St Andrews, St Andrews, UK
Patrick Schrempf & David Harris-Birtill
Institute of Neuroscience & Psychology, University of Glasgow, Glasgow, UK
Keith W. Muir
University of Edinburgh, Edinburgh, UK
Alison Q. O’Neil

Authors

Patrick Schrempf
View author publications
You can also search for this author in PubMed Google Scholar
Hannah Watson
View author publications
You can also search for this author in PubMed Google Scholar
Shadia Mikhael
View author publications
You can also search for this author in PubMed Google Scholar
Maciej Pajak
View author publications
You can also search for this author in PubMed Google Scholar
Matúš Falis
View author publications
You can also search for this author in PubMed Google Scholar
Aneta Lisowska
View author publications
You can also search for this author in PubMed Google Scholar
Keith W. Muir
View author publications
You can also search for this author in PubMed Google Scholar
David Harris-Birtill
View author publications
You can also search for this author in PubMed Google Scholar
Alison Q. O’Neil
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Patrick Schrempf .

Editor information

Editors and Affiliations

University of Porto, Porto, Portugal
Jaime Cardoso
University of Houston, Houston, TX, USA
Hien Van Nguyen
University of Minnesota, Minneapolis, MN, USA
Nicholas Heller
University of Coimbra, Coimbra, Portugal
Pedro Henriques Abreu
Amsterdam University Medical Center, Amsterdam, The Netherlands
Ivana Isgum
University of Porto, Porto, Portugal
Wilson Silva
University of Porto, Porto, Portugal
Ricardo Cruz
University of Coimbra, Coimbra, Portugal
Jose Pereira Amorim
Johns Hopkins University, Baltimore, MD, USA
Vishal Patel
University of Houston, Houston, TX, USA
Badri Roysam
Chinese Academy of Sciences, Beijing, China
Kevin Zhou
UT Southwestern Medical Center, Dallas, TX, USA
Steve Jiang
University of Arkansas, Fayetteville, AR, USA
Ngan Le
University of Arkansas, Fayetteville, AR, USA
Khoa Luu
University of Bern, Bern, Switzerland
Raphael Sznitman
Eindhoven University of Technology, Eindhoven, The Netherlands
Veronika Cheplygina
Technical University of Munich, Nantes, Germany
Diana Mateus
University of Dundee, Dundee, UK
Emanuele Trucco
Eindhoven University of Technology, Eindhoven, The Netherlands
Samaneh Abbasi

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 149 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schrempf, P. et al. (2020). Paying Per-Label Attention for Multi-label Extraction from Radiology Reports. In: Cardoso, J., et al. Interpretable and Annotation-Efficient Learning for Medical Image Computing. IMIMIC MIL3ID LABELS 2020 2020 2020. Lecture Notes in Computer Science(), vol 12446. Springer, Cham. https://doi.org/10.1007/978-3-030-61166-8_29

Download citation

DOI: https://doi.org/10.1007/978-3-030-61166-8_29
Published: 02 October 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-61165-1
Online ISBN: 978-3-030-61166-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Paying Per-Label Attention for Multi-label Extraction from Radiology Reports

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-label annotation of text reports from computed tomography of the chest, abdomen, and pelvis using deep learning

Efficient labeling of french mammogram reports with MammoBERT

ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 149 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

Paying Per-Label Attention for Multi-label Extraction from Radiology Reports

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-label annotation of text reports from computed tomography of the chest, abdomen, and pelvis using deep learning

Efficient labeling of french mammogram reports with MammoBERT

ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 149 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation