Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2512410.2512427acmconferencesArticle/Chapter ViewAbstractPublication PagesdareConference Proceedingsconference-collections
research-article

Challenges in understanding clinical notes: why NLP engines fall short and where background knowledge can help

Published: 01 November 2013 Publication History

Abstract

Understanding of Electronic Medical Records(EMRs) plays a crucial role in improving healthcare outcomes. However, the unstructured nature of EMRs poses several technical challenges for structured information extraction from clinical notes leading to automatic analysis. Natural Language Processing(NLP) techniques developed to process EMRs are effective for variety of tasks, they often fail to preserve the semantics of original information expressed in EMRs, particularly in complex scenarios. This paper illustrates the complexity of the problems involved and deals with conflicts created due to the shortcomings of NLP techniques and demonstrates where domain specific knowledge bases can come to rescue in resolving conflicts that can significantly improve the semantic annotation and structured information extraction. We discuss various insights gained from our study on real world dataset.

References

[1]
Alan R Aronson. Metamap: Mapping text to the umls metathesaurus. Bethesda, MD: NLM, NIH, DHHS, 2006.
[2]
Olivier Bodenreider. The unified medical language system (umls): integrating biomedical terminology. Nucleic acids research, 32(suppl 1):D267--D270, 2004.
[3]
Carol Friedman, Philip O Alderson, John HM Austin, James J Cimino, and Stephen B Johnson. A general natural-language text processor for clinical radiology. Journal of the American Medical Informatics Association, 1(2):161--174, 1994.
[4]
Sujan Perera, Cory Henson, Krishnaprasad Thirunarayan, Amit Sheth, and Suhas Nair. Data driven knowledge acquisition method for domain knowledge enrichment in the healthcare. In IEEE International Conference on Bioinformatics and Biomedicine, pages 1--8, October 2012.
[5]
Guergana K Savova, James J Masanz, Philip V Ogren, Jiaping Zheng, Sunghwan Sohn, Karin C Kipper-Schuler, and Christopher G Chute. Mayo clinical text analysis and knowledge extraction system (ctakes): architecture, component evaluation and applications. Journal of the American Medical Informatics Association, 17(5):507--513, 2010.

Cited By

View all
  • (2023)Review of Natural Language Processing in PharmacologyPharmacological Reviews10.1124/pharmrev.122.00071575:4(714-738)Online publication date: 17-Mar-2023
  • (2023)Identifying Alcohol-Related Information From Unstructured Bilingual Clinical Notes With Multilingual TransformersIEEE Access10.1109/ACCESS.2023.324552311(16066-16075)Online publication date: 2023
  • (2022)Comparison of Pretraining Models and Strategies for Health-Related Social Media Text ClassificationHealthcare10.3390/healthcare1008147810:8(1478)Online publication date: 5-Aug-2022
  • Show More Cited By

Index Terms

  1. Challenges in understanding clinical notes: why NLP engines fall short and where background knowledge can help

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    DARE '13: Proceedings of the 2013 international workshop on Data management & analytics for healthcare
    November 2013
    34 pages
    ISBN:9781450324250
    DOI:10.1145/2512410
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 November 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. knowledge base
    2. natural language processing
    3. negation detection

    Qualifiers

    • Research-article

    Conference

    CIKM'13
    Sponsor:

    Acceptance Rates

    DARE '13 Paper Acceptance Rate 5 of 7 submissions, 71%;
    Overall Acceptance Rate 5 of 7 submissions, 71%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)35
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 30 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Review of Natural Language Processing in PharmacologyPharmacological Reviews10.1124/pharmrev.122.00071575:4(714-738)Online publication date: 17-Mar-2023
    • (2023)Identifying Alcohol-Related Information From Unstructured Bilingual Clinical Notes With Multilingual TransformersIEEE Access10.1109/ACCESS.2023.324552311(16066-16075)Online publication date: 2023
    • (2022)Comparison of Pretraining Models and Strategies for Health-Related Social Media Text ClassificationHealthcare10.3390/healthcare1008147810:8(1478)Online publication date: 5-Aug-2022
    • (2022)Identifying Caregiver Availability Using Medical Notes With Rule-Based Natural Language Processing: Retrospective Cohort StudyJMIR Aging10.2196/402415:3(e40241)Online publication date: 22-Sep-2022
    • (2022)Semantic Reasoning with NLI for Assertion Detection in Medical Text2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM55620.2022.9995166(744-748)Online publication date: 6-Dec-2022
    • (2021)Medical Entity Recognition Based on BiLSTM with Knowledge Graph and Attention Mechanism2021 4th International Conference on Intelligent Autonomous Systems (ICoIAS)10.1109/ICoIAS53694.2021.00035(149-157)Online publication date: May-2021
    • (2021)Improving the accuracy of stroke clinical coding with open-source software and natural language processingJournal of Clinical Neuroscience10.1016/j.jocn.2021.10.02494(233-236)Online publication date: Dec-2021
    • (2021)Tensions in Representing Behavioral Data in an Electronic Health RecordComputer Supported Cooperative Work (CSCW)10.1007/s10606-021-09402-7Online publication date: 22-Jun-2021
    • (2020)Bio-semantic relation extraction with attention-based external knowledge reinforcementBMC Bioinformatics10.1186/s12859-020-3540-821:1Online publication date: 24-May-2020
    • (2020)DsOn: Ontology-Driven Model for Symptom and Drug Knowledge Extraction on Social Media2020 International Conference on Computation, Automation and Knowledge Management (ICCAKM)10.1109/ICCAKM46823.2020.9051527(552-559)Online publication date: Jan-2020
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media