Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Extraction of events and temporal expressions from clinical narratives

Published: 01 December 2013 Publication History

Abstract

This paper addresses an important task of event and timex extraction from clinical narratives in context of the i2b2 2012 challenge. State-of-the-art approaches for event extraction use a multi-class classifier for finding the event types. However, such approaches consider each event in isolation. In this paper, we present a sentence-level inference strategy which enforces consistency constraints on attributes of those events which appear close to one another. Our approach is general and can be used for other tasks as well. We also design novel features like clinical descriptors (from medical ontologies) which encode a lot of useful information about the concepts. For timex extraction, we adapt a state-of-the-art system, HeidelTime, for use in clinical narratives and also develop several rules which complement HeidelTime. We also give a robust algorithm for date extraction. For the event extraction task, we achieved an overall F1 score of 0.71 for determining span of the events along with their attributes. For the timex extraction task, we achieved an F1 score of 0.79 for determining span of the temporal expressions. We present detailed error analysis of our system and also point out some factors which can help to improve its accuracy.

References

[1]
Sun, W., Rumshisky, A. and Uzuner, O., Evaluating temporal relations in clinical text: 2012 i2b2 challenge. J Am Med Inform Assoc.
[2]
de Bruijn, B., Cherry, C., Kiritchenko, S., Martin, J. and Zhu, X., Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. J Am Med Inform Assoc. v18 i5. 557-562.
[3]
Jiang, M., Chen, Y., Liu, M., Rosenbloom, S., Mani, S. and Denny, J., A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. J Am Med Inform Assoc. v18 i5. 601-606.
[4]
Roberts, K. and Harabagiu, S., A flexible framework for deriving assertions from electronic medical records. J Am Med Inform Assoc. v18 i5. 568-573.
[5]
Uzuner, Ö, Zhang, X. and Sibanda, T., Machine learning and rule-based approaches to assertion classification. J Am Med Inform Assoc. v16 i1. 109-115.
[6]
D'Avolio, L., Nguyen, T., Goryachev, S. and Fiore, L., Automated concept-level information extraction to reduce the need for custom software and rules development. J Am Med Inform Assoc. v18 i5. 607-613.
[7]
Torii, M., Wagholikar, K. and Liu, H., Using machine learning for concept extraction on clinical documents from multiple data sources. J Am Med Inform Assoc. v18 i5. 580-587.
[8]
Uzuner, O., South, B., Shen, S. and DuVall, S., 2010 i2b2/va challenge on concepts, assertions, and relations in clinical text. J Am Med Inform Assoc.
[9]
Strötgen, J. and Gertz, M., Multilingual and cross-domain temporal tagging. Lang Resour Eval. 1-30.
[10]
Chang AX, Manning CD. SUTIME: a library for recognizing and normalizing time expressions. In: 8th International conference on language resources and evaluation (LREC 2012); 2012. <http://nlp.stanford.edu/pubs/lrec2012-sutime.pdf>.
[11]
Verhagen MSauri, R., Caselli, T. and Pustejovsky, J., Semeval-2010 task 13: Tempeval-2. In: Proceedings of the 5th international workshop on semantic evaluation, Association for Computational Linguistics. pp. 57-62.
[12]
Angeli G, Manning CD, Jurafsky D. Parsing time: learning to interpret time expressions. In: NAACL-HLT; 2012. <http://nlp.stanford.edu/pubs/2012-naacl-temporal.pdf>.
[13]
Sun, W., Rumshisky, A. and Uzuner, O., Temporal reasoning over clinical text: the state of the art. J Am Med Inform Assoc.
[14]
JodaTime; 2013. <http://joda-time.sourceforge.net/> {accessed 10.06.13}.
[15]
Chang, C.C. and Lin, C.J., LIBSVM: a library for support vector machines. ACM Trans Intel Syst Technol. v2. 27:1-27:27.
[16]
Chapman, W., Bridewell, W., Hanbury, P., Cooper, G. and Buchanan, B., A simple algorithm for identifying negated findings and diseases in discharge summaries. J Biomed Inform. v34 i5. 301-310.
[17]
Gurobi Optimization I. Gurobi optimizer reference manual; 2013. <http://www.gurobi.com>.
[18]
Roth D, Yih W. Global inference for entity and relation identification via a linear programming formulation; 2007. <http://cogcomp.cs.illinois.edu/papers/RothYi07.pdf>.
[19]
Koehn P. Statistical significance tests for machine translation evaluation. In: Proceedings of EMNLP, vol. 4; 2004. p. 388-95.
[20]
Sohn, S., Wagholikar, K.B., Li, D., Jonnalagadda, S.R., Tao, C. and Elayavilli, R.K., Comprehensive temporal information detection from clinical text: medical events, time, and TLINK identification. J Am Med Inform Assoc.
[21]
Tang, B., Wu, Y., Jiang, M., Chen, Y., Denny, J.C. and Xu, H., A hybrid system for temporal information extraction from clinical text. J Am Med Inform Assoc.
[22]
Xu, Y., Wang, Y., Liu, T., Tsujii, J., Eric, I. and Chang, C., An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge. J Am Med Inform Assoc.
[23]
Cherry, C., Zhu, X., Martin, J. and de Bruijn, B., í la recherche du temps perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge. J Am Med Inform Assoc.
[24]
Roberts, K., Rink, B. and Harabagiu, S.M., A flexible framework for recognizing events, temporal expressions, and temporal relations in clinical text. J Am Med Inform Assoc.
[25]
Grouin, C., Grabar, N., Hamon, T., Rosset, S., Tannier, X. and Zweigenbaum, P., Eventual situations for timeline extraction from clinical reports. J Am Med Inform Assoc.
[26]
Kovačević, A., Dehghan, A., Filannino, M., Keane, J.A. and Nenadic, G., Combining rules and machine learning for extraction of temporal expressions and events from clinical narratives. J Am Med Inform Assoc.

Cited By

View all

Index Terms

  1. Extraction of events and temporal expressions from clinical narratives
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Journal of Biomedical Informatics
    Journal of Biomedical Informatics  Volume 46, Issue
    December, 2013
    62 pages

    Publisher

    Elsevier Science

    San Diego, CA, United States

    Publication History

    Published: 01 December 2013

    Author Tags

    1. Electronic health records
    2. Information extraction
    3. Integer quadratic programmming
    4. Named entity recognition
    5. Natural language processing
    6. Temporal extraction

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 30 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Time expression recognition and normalization: a surveyArtificial Intelligence Review10.1007/s10462-023-10400-y56:9(9115-9140)Online publication date: 24-Jan-2023
    • (2019)Time Recognition of Chinese Electronic Medical Record of Depression Based on Conditional Random FieldBrain Informatics10.1007/978-3-030-37078-7_15(149-158)Online publication date: 13-Dec-2019
    • (2016)As Time Goes ByProceedings of the 25th International Conference on World Wide Web10.1145/2872427.2883055(915-925)Online publication date: 11-Apr-2016
    • (2014)Joint inference for end-to-end coreference resolution for clinical notesProceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics10.1145/2649387.2649437(192-201)Online publication date: 20-Sep-2014

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media