Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1870658.1870669dlproceedingsArticle/Chapter ViewAbstractPublication PagesemnlpConference Proceedingsconference-collections
research-article
Free access

Positional language models for clinical information retrieval

Published: 09 October 2010 Publication History

Abstract

The PECO framework is a knowledge representation for formulating clinical questions. Queries are decomposed into four aspects, which are Patient-Problem (P), Exposure (E), Comparison (C) and Outcome (O). However, no test collection is available to evaluate such framework in information retrieval. In this work, we first present the construction of a large test collection extracted from systematic literature reviews. We then describe an analysis of the distribution of PECO elements throughout the relevant documents and propose a language modeling approach that uses these distributions as a weighting strategy. In our experiments carried out on a collection of 1.5 million documents and 423 queries, our method was found to lead to an improvement of 28% in MAP and 50% in P@5, as compared to the state-of-the-art method.

References

[1]
Florian Boudin, Jian-Yun Nie, and Martin Dawes. 2010a. Clinical Information Retrieval using Document and PICO Structure. In Proceedings of the HLT-NAACL 2010 conference, pages 822--830.
[2]
Florian Boudin, Lixin Shi, and Jian-Yun Nie. 2010b. Improving Medical Information Retrieval with PICO Element Detection. In Proceedings of the ECIR 2010 conference, pages 50--61.
[3]
Grace Y. Chung. 2009. Sentence retrieval for abstracts of randomized controlled trials. BMC Medical Informatics and Decision Making, 9(1).
[4]
Thomas Owens Sheri Keitz Connie Schardt, Martha B Adams and Paul Fontelo. 2007. Utilization of the PICO framework to improve searching for clinical questions. BMC Medical Informatics and Decision Making, 7(1).
[5]
Martin Dawes, Pierre Pluye, Laura Shea, Roland Grad, Arlene Greenberg, and Jian-Yun Nie. 2007. The identification of clinically important elements within medical journal abstracts: PatientPopulationProblem, ExposureIntervention, Comparison, Outcome, Duration and Results (PECODR). Informatics in Primary care, 15(1):9--16.
[6]
D. Demner-Fushman and J. Lin. 2007. Answering clinical questions with knowledge-based and statistical techniques. Computational Linguistics, 33(1):63--103.
[7]
William R. Hersh, Katherine Crabtree, David H. Hickam, Lynetta Sacherek, Linda Rose, and Charles P. Friedman. 2000. Factors associated with successful answering of clinical questions using an information retrieval system. Bulletin of the Medical Library Association, 88(4):323--331.
[8]
Larry McKnight and Padmini Srinivasan. 2003. Categorization of sentence types in medical abstracts. Proceedings of the AMIA annual symposium.
[9]
Donald Metzler and W. Bruce Croft. 2005. A Markov random field model for term dependencies. In Proceedings of the SIGIR conference, pages 472--479.
[10]
Yun Niu, Graeme Hirst, Gregory McArthur, and Patricia Rodriguez-Gianolli. 2003. Answering clinical questions with role identification. In Proceedings of the ACL 2003 Workshop on Natural Language Processing in Biomedicine, pages 73--80.
[11]
Pierre Pluye, Roland M. Grad, Lynn G. Dunikowski, and Randolph Stephenson. 2005. Impact of clinical information-retrieval technology on physicians: a literature review of quantitative, qualitative and mixed methods studies. International Journal of Medical Informatics, 74(9):745--768.
[12]
Jay M. Ponte and W. Bruce Croft. 1998. A language modeling approach to information retrieval. In Proceedings of the SIGIR conference, pages 275--281.
[13]
Scott W. Richardson, Mark C. Wilson, Jim Nishikawa, and Robert S. Hayward. 1995. The well-built clinical question: a key to evidence-based decisions. ACP Journal Club, 123(3):A12--13.
[14]
David L. Sackett, William Rosenberg, J. A. Muir Gray, Brian Haynes, and W. Scott Richardson. 1996. Evidence based medicine: what it is and what it isn't. British medical journal, 312:71--72.

Cited By

View all
  • (2018)SIGIR 2018 Tutorial on Health Search (HS2018)The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval10.1145/3209978.3210188(1391-1394)Online publication date: 27-Jun-2018
  • (2016)Aggregating semantic information nuggets for answering clinical queriesProceedings of the 31st Annual ACM Symposium on Applied Computing10.1145/2851613.2851694(1041-1047)Online publication date: 4-Apr-2016
  • (2012)Using a medical thesaurus to predict query difficultyProceedings of the 34th European conference on Advances in Information Retrieval10.1007/978-3-642-28997-2_46(480-484)Online publication date: 1-Apr-2012
  1. Positional language models for clinical information retrieval

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image DL Hosted proceedings
        EMNLP '10: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
        October 2010
        1332 pages

        Publisher

        Association for Computational Linguistics

        United States

        Publication History

        Published: 09 October 2010

        Qualifiers

        • Research-article

        Acceptance Rates

        Overall Acceptance Rate 73 of 234 submissions, 31%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)42
        • Downloads (Last 6 weeks)4
        Reflects downloads up to 08 Mar 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2018)SIGIR 2018 Tutorial on Health Search (HS2018)The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval10.1145/3209978.3210188(1391-1394)Online publication date: 27-Jun-2018
        • (2016)Aggregating semantic information nuggets for answering clinical queriesProceedings of the 31st Annual ACM Symposium on Applied Computing10.1145/2851613.2851694(1041-1047)Online publication date: 4-Apr-2016
        • (2012)Using a medical thesaurus to predict query difficultyProceedings of the 34th European conference on Advances in Information Retrieval10.1007/978-3-642-28997-2_46(480-484)Online publication date: 1-Apr-2012

        View Options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Login options

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media