Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Free access
Just Accepted

MedNER: Enhanced Named Entity Recognition in Medical Corpus via Optimized Balanced and Deep Active Learning

Online AM: 17 July 2024 Publication History

Abstract

Ever-growing electronic medical corpora provide unprecedented opportunities for researchers to analyze patient conditions and drug effects. Meanwhile, severe challenges emerged in the large-scale electronic medical records process phase. Primarily, emerging words for medical terms, including informal descriptions, are difficult to recognize. Moreover, although deep models can help in entity extraction on medical texts, it requires large-scale labels which are time-intensive to obtain and not always available in the medical domain. However, when encountering a situation where massive unseen concepts appear, or labeled data is insufficient, the performance of existing algorithms will suffer an intolerable decline. In this paper, we propose a balanced and deep active learning framework (MedNER) for Named Entity Recognition in the medical corpus to alleviate above problems. Specifically, to describe our selection strategy precisely, we first define the uncertainty of a medical sentence as a labeling loss predicted by a loss-prediction module and define diversity as the least text distance between pairs of sentences in a sample batch computed based on word-morpheme embeddings. Furthermore, aiming to make a trade-off between uncertainty and diversity, we formulate a Distinct-K optimization problem to maximize the slightest uncertainty and diversity of chosen sentences. Finally, we propose a threshold-based approximation selection algorithm, Distinct-K Filter, which selects the most beneficial training samples by balancing diversity and uncertainty. Extensive experimental results on real datasets demonstrate that MedNER significantly outperforms existing approaches.

References

[1]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural Machine Translation by Jointly Learning to Align and Translate. In ICLR.
[2]
William H. Beluch, Tim Genewein, Andreas Nürnberger, and Jan M. Köhler. 2018. The Power of Ensembles for Active Learning in Image Classification. In CVPR. 9368–9377.
[3]
Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. Enriching Word Vectors with Subword Information. TACL 5 (2017), 135–146.
[4]
Shayok Chakraborty, Vineeth Nallure Balasubramanian, Qian Sun, Sethuraman Panchanathan, and Jieping Ye. 2015. Active Batch Selection via Convex Relaxations with Guaranteed Solution Bounds. IEEE Trans. Pattern Anal. Mach. Intell. 37, 10 (2015), 1945–1958.
[5]
Xinxiong Chen, Lei Xu, Zhiyuan Liu, Maosong Sun, and Huan-Bo Luan. 2015. Joint Learning of Character and Word Embeddings. In IJCAI. 1236–1242.
[6]
Jason P. C. Chiu and Eric Nichols. 2016. Named Entity Recognition with Bidirectional LSTM-CNNs. TACL 4 (2016), 357–370.
[7]
Junghwan Cho, Kyewook Lee, Ellie Shin, Garry Choy, and Synho Do. 2015. How much data is needed to train a medical image deep learning system to achieve necessary high accuracy? arXiv preprint arXiv:1511.06348 (2015).
[8]
Kyunghyun Cho, Bart van Merrienboer, Çaglar Gülçehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. In EMNLP. ACL, 1724–1734.
[9]
Shaika Chowdhury, C Zhang, and Philip S. Yu. 2018. Multi-Task Pharmacovigilance Mining from Social Media Posts. In WWW. 117–126.
[10]
Fenia Christopoulou, Thy Thy Tran, Sunil Kumar Sahu, Makoto Miwa, and Sophia Ananiadou. 2020. Adverse drug events and medication relation extraction in electronic health records with ensemble deep learning methods. Journal of the American Medical Informatics Association 27, 1 (2020), 39–46.
[11]
Pinar Donmez, Jaime G. Carbonell, and Paul N. Bennett. 2007. Dual Strategy Active Learning. In ECML. 116–127.
[12]
Joseph Gatto, Parker Seegmiller, Garrett Johnston, and Sarah M Preum. 2022. HealthE: Classifying Entities in Online Textual Health Advice. arXiv preprint arXiv:2210.03246 (2022).
[13]
Alex Graves, Abdel-rahman Mohamed, and Geoffrey E. Hinton. 2013. Speech recognition with deep recurrent neural networks. In ICASSP. 6645–6649.
[14]
Yuhong Guo. 2010. Active instance sampling via matrix partition. Advances in Neural Information Processing Systems (2010), 802–810.
[15]
M. M. Halldórsson and J. Radhakrishnan. 1997. Greed is good: Approximating independent sets in sparse and bounded-degree graphs. In Algorithmica. 145–163.
[16]
Daniel Hanisch, Katrin Fundel, Heinz-Theodor Mevissen, Ralf Zimmer, and Juliane Fluck. 2005. ProMiner: rule-based protein and gene entity recognition. BMC Bioinform. 6, S-1 (2005).
[17]
Steven C. H. Hoi, Rong Jin, and Michael R. Lyu. 2009. Batch Mode Active Learning with Applications to Text Categorization and Image Retrieval. TKDE 21, 9 (2009), 1233–1248.
[18]
Yan Hu, Iqra Ameer, Xu Zuo, Xueqing Peng, Yujia Zhou, Zehan Li, Yiming Li, Jianfu Li, Xiaoqian Jiang, and Hua Xu. 2023. Zero-shot Clinical Entity Recognition using ChatGPT. arXiv preprint arXiv:2303.16416v2 (2023).
[19]
Sheng-Jun Huang, Jia-Wei Zhao, and Zhao-Yang Liu. 2018. Cost-Effective Training of Deep CNNs with Active Model Adaptation. In SIGKDD. 1580–1588.
[20]
Adina R Kern-Goldberger, Sindhu K Srinivas, Elizabeth A Howell, Michael Harhay, and Lisa D Levine. 2023. Validation of maternal co-morbidity diagnoses using differential data extraction strategies across a large health system. American Journal of Obstetrics & Gynecology 228, 1 (2023), S247–S248.
[21]
Matt J. Kusner, Yu Sun, N Kolkin, and Kilian Q. Weinberger. 2015. From Word Embeddings To Document Distances. In ICML. 957–966.
[22]
John D. Lafferty, Andrew McCallum, and Fernando C. N. Pereira. 2001. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In ICML. 282–289.
[23]
Chenliang Li, Aixin Sun, Jianshu Weng, and Qi He. 2015. Tweet Segmentation and Its Application to Named Entity Recognition. TKDE 27, 2 (2015), 558–570.
[24]
Huayu Li, Martin Renqiang Min, Yong Ge, and Asim Kadav. 2017. A Context-aware Attention Network for Interactive Question Answering. In SIGKDD. 927–935.
[25]
Fenglong Ma, Radha Chitta, Jing Zhou, Quanzeng You, Tong Sun, and Jing Gao. 2017. Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks. In SIGKDD. 1903–1911.
[26]
Christopher D. Manning, Prabhakar Raghavan, and H Schütze. 2008. Introduction to information retrieval. Cambridge University Press.
[27]
Tomas Mikolov, Martin Karafiát, Lukás Burget, Jan Cernocký, and Sanjeev Khudanpur. 2010. Recurrent neural network based language model. In INTERSPEECH. 1045–1048.
[28]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Gregory S. Corrado, and Jeffrey Dean. 2013. Distributed Representations of Words and Phrases and their Compositionality. In NIPS. 3111–3119.
[29]
Priyanka C Nair and B Indira Devi. 2022. Automatic Symptom Extraction from Unstructured Web Data for Designing Healthcare Systems. In Emerging Research in Computing, Information, Communication and Applications: ERCICA 2020, Volume 2. Springer, 599–608.
[30]
Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. Glove: Global Vectors for Word Representation. In EMNLP. 1532–1543.
[31]
Sarah Riepenhausen, Cornelia Mertens, and Martin Dugas. 2021. Comparing SDTM and FHIR® for Real World Data from Electronic Health Records for Clinical Trial Submissions. In MIE. 585–589.
[32]
Abeed Sarker, Azadeh Nikfarjam, and Graciela Gonzalez-Hernandez. 2016. Social Media Mining Shared Task Workshop. In Biocomputing 2016: Proceedings of the Pacific Symposium. 581–592.
[33]
Max Schumm, Ming-Yeah Hu, Vivek Sant, Jiyoon Kim, Chi-Hong Tseng, Javier Sanz, Steven Raman, Run Yu, and Masha Livhits. 2023. Automated extraction of incidental adrenal nodules from electronic health records. Surgery 173, 1 (2023), 52–58.
[34]
Ozan Sener and Silvio Savarese. 2018. Active Learning for Convolutional Neural Networks: A Core-Set Approach. In ICLR.
[35]
Yanyao Shen, Hyokun Yun, Zachary C. Lipton, Yakov Kronrod, and Animashree Anandkumar. 2018. Deep Active Learning for Named Entity Recognition. In ICLR.
[36]
Joseph P. Turian, Lev-Arie Ratinov, and Yoshua Bengio. 2010. Word Representations: A Simple and General Method for Semi-Supervised Learning. In ACL 2010. 384–394.
[37]
Liqin Wang, Sheril Varghese, Sonam Bassir, Kimberly G Blumenthal, Elizabeth J Phillips, and Li Zhou. 2022. Stevens-Johnson syndrome and toxic epidermal necrolysis: A systematic review of /MEDLINE case reports from 1980 to 2020. Frontiers in Medicine 9 (2022).
[38]
Shuhe Wang, Xiaofei Sun, Xiaoya Li, Rongbin Ouyang, Fei Wu, Tianwei Zhang, Jiwei Li, and Guoyin Wang. 2023. GPT-NER: Named Entity Recognition via Large Language Models. arXiv preprint arXiv:2304.10428 (2023).
[39]
Jun Wen, Xiang Zhang, Everett Rush, Vidul A Panickan, Xingyu Li, Tianrun Cai, Doudou Zhou, Yuk-Lam Ho, Lauren Costa, Edmon Begoli, et al. 2023. Multimodal representation learning for predicting molecule–disease relations. Bioinformatics 39, 2 (2023), btad085.
[40]
Mingyu Xiao and Hiroshi Nagamochi. 2017. Exact algorithms for maximum independent set. Inf. Comput. 255 (2017), 126–146.
[41]
Christopher C. Yang, Haodong Yang, Ling Jiang, and Mi Zhang. 2012. Social media mining for drug safety signal detection. In SHB. 33–40.
[42]
Yi Yang, Zhigang Ma, Feiping Nie, Xiaojun Chang, and Alexander G. Hauptmann. 2015. Multi-Class Active Learning by Uncertainty Sampling with Diversity Maximization. International Journal of Computer Vision 113, 2 (2015), 113–127.
[43]
Donggeun Yoo and In So Kweon. 2019. Learning Loss for Active Learning. In CVPR. 93–102.
[44]
Tongxuan Zhang, Hongfei Lin, Yuqi Ren, Liang Yang, Bo Xu, Zhihao Yang, Jian Wang, and Yijia Zhang. 2019. Adverse drug reaction detection via a multihop self-attention mechanism. BMC Bioinformatics 20, 1 (2019), 479:1–479:11.
[45]
Zongwei Zhou, Jae Y. Shin, Lei Zhang, Suryakanth R. Gurudu, Michael B. Gotway, and Jianming Liang. 2017. Fine-Tuning Convolutional Neural Networks for Biomedical Image Analysis: Actively and Incrementally. In CVPR. 4761–4772.

Index Terms

  1. MedNER: Enhanced Named Entity Recognition in Medical Corpus via Optimized Balanced and Deep Active Learning

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Intelligent Systems and Technology
    ACM Transactions on Intelligent Systems and Technology Just Accepted
    EISSN:2157-6912
    Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Online AM: 17 July 2024
    Accepted: 27 April 2024
    Revised: 19 April 2024
    Received: 10 April 2023

    Check for updates

    Author Tags

    1. Named Entity Recognition
    2. Active Learning
    3. Medical Text Mining

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 123
      Total Downloads
    • Downloads (Last 12 months)123
    • Downloads (Last 6 weeks)46
    Reflects downloads up to 17 Oct 2024

    Other Metrics

    Citations

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Full Access

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media