Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1277741.1277853acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

Knowledge-intensive conceptual retrieval and passage extraction of biomedical literature

Published: 23 July 2007 Publication History
  • Get Citation Alerts
  • Abstract

    This paper presents a study of incorporating domain-specific knowledge (i.e., information about concepts and relationships between concepts in a certain domain) in an information retrieval (IR) system to improve its effectiveness in retrieving biomedical literature. The effects of different types of domain-specific knowledge in performance contribution are examined. Based on the TREC platform, we show that appropriate use of domain-specific knowledge in a proposed conceptual retrieval model yields about 23% improvement over the best reported result in passage retrieval in the Genomics Track of TREC 2006.

    References

    [1]
    Aronson A.R., Rindflesch T.C. Query expansion using the UMLS Metathesaurus. Proc AMIA Annu Fall Symp. 1997. 485--9.
    [2]
    Baeza-Yates R., Ribeiro-Neto B. Modern Information Retrieval. Addison-Wesley, 1999, 129--131.
    [3]
    Buttcher S., Clarke C.L.A., Cormack G.V. Domain-specific synonym expansion and validation for biomedical information retrieval (MultiText experiments for TREC 2004). TREC'04.
    [4]
    Chang J.T., Schutze H., Altman R.B. Creating an online dictionary of abbreviations from MEDLINE. Journal of the American Medical Informatics Association. 2002 9(6).
    [5]
    Church K.W., Hanks P. Word association norms, mutual information and lexicography. Computational Linguistics. 1990;16:22, C29.
    [6]
    Fontelo P., Liu F., Ackerman M. askMEDLINE: a free-text, natural language query tool for MEDLINE/ . BMC Med Inform Decis Mak. 2005 Mar 10;5(1):5.
    [7]
    Fukuda K., Tamura A., Tsunoda T., Takagi T. Toward information extraction: identifying protein names from biological papers. Pac Symp Biocomput. 1998;:707--18.
    [8]
    Hersh W.R., and etc. TREC 2006 Genomics Track Overview. TREC'06.
    [9]
    Hersh W.R., and etc. TREC 2005 Genomics Track Overview. In TREC'05.
    [10]
    Hersh W.R., and etc. TREC 2004 Genomics Track Overview. In TREC'04.
    [11]
    Hersh W.R., Price S., Donohoe L. Assessing thesaurus-based query expansion using the UMLS Metathesaurus. Proc AMIA Symp. 344--8. 2000.
    [12]
    Levenshtein, V. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics -- Doklady 10, 10 (1996), 707--710.
    [13]
    Lin J., Demner-Fushman D. The Role of Knowledge in Conceptual Retrieval: A Study in the Domain of Clinical Medicine. SIGIR'06. 99--06.
    [14]
    Lindberg D., Humphreys B., and McCray A. The Unified Medical Language System. Methods of Information in Medicine. 32(4):281--291, 1993.
    [15]
    Liu S., Liu F., Yu C., and Meng W.Y. An Effective Approach to Document Retrieval via Utilizing WordNet and Recognizing Phrases. SIGIR'04. 266--272.
    [16]
    Proux D., Rechenmann F., Julliard L., Pillet V.V., Jacq B. Detecting Gene Symbols and Names in Biological Texts: A First Step toward Pertinent Information Extraction. Genome Inform Ser Workshop Genome Inform. 1998;9:72--80.
    [17]
    Robertson S.E., Walker S. Okapi/Keenbow at TREC-8. NIST Special Publication 500--246: TREC 8.
    [18]
    Sackett D.L., and etc. Evidence-Based Medicine: How to Practice and Teach EBM. Churchill Livingstone. Second edition, 2000.
    [19]
    Swanson,D.R., Smalheiser,N.R. An interactive system for finding complementary literatures: a stimulus to scientific discovery. Artificial Intelligence, 1997; 91,183--203.
    [20]
    Voorhees E. Query expansion using lexical-semantic relations. SIGIR 1994. 61--9.
    [21]
    Zhong M., Huang X.J. Concept-based biomedical text retrieval. SIGIR'06. 723--4.
    [22]
    Zhou W., Torvik V.I., Smalheiser N.R. ADAM: Another Database of Abbreviations in MEDLINE. Bioinformatics. 2006; 22(22): 2813--2818.

    Cited By

    View all
    • (2023)Deep Learning–Based Named Entity Recognition and Resolution of Referential Ambiguities for Enhanced Information Extraction from Construction Safety RegulationsJournal of Computing in Civil Engineering10.1061/(ASCE)CP.1943-5487.000106437:5Online publication date: Sep-2023
    • (2020)Improving Document Relevant accuracy by distinguish Doc2query Matching Mechanisms on Biomedical Literature2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence)10.1109/Confluence47617.2020.9058299(727-732)Online publication date: Jan-2020
    • (2019)Part Name Normalization2019 IEEE International Conference on Prognostics and Health Management (ICPHM)10.1109/ICPHM.2019.8819386(1-6)Online publication date: Jun-2019
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
    July 2007
    946 pages
    ISBN:9781595935977
    DOI:10.1145/1277741
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 23 July 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. biomedical documents
    2. document retrieval
    3. passage extraction

    Qualifiers

    • Article

    Conference

    SIGIR07
    Sponsor:
    SIGIR07: The 30th Annual International SIGIR Conference
    July 23 - 27, 2007
    Amsterdam, The Netherlands

    Acceptance Rates

    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Deep Learning–Based Named Entity Recognition and Resolution of Referential Ambiguities for Enhanced Information Extraction from Construction Safety RegulationsJournal of Computing in Civil Engineering10.1061/(ASCE)CP.1943-5487.000106437:5Online publication date: Sep-2023
    • (2020)Improving Document Relevant accuracy by distinguish Doc2query Matching Mechanisms on Biomedical Literature2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence)10.1109/Confluence47617.2020.9058299(727-732)Online publication date: Jan-2020
    • (2019)Part Name Normalization2019 IEEE International Conference on Prognostics and Health Management (ICPHM)10.1109/ICPHM.2019.8819386(1-6)Online publication date: Jun-2019
    • (2019)Document/Query Expansion based on Selecting Significant Concepts for Context Based Retrieval of Medical ImagesJournal of Biomedical Informatics10.1016/j.jbi.2019.103210(103210)Online publication date: May-2019
    • (2018)Automatic quality measurement for health information on the internetInternational Journal of Intelligent Information and Database Systems10.1504/IJIIDS.2014.0683408:4(340-358)Online publication date: 14-Dec-2018
    • (2018)Semantic Sequential Query Expansion for Biomedical Article SearchIEEE Access10.1109/ACCESS.2018.28618696(45448-45457)Online publication date: 2018
    • (2018)Semantic concept-enriched dependence model for medical information retrievalJournal of Biomedical Informatics10.1016/j.jbi.2013.08.01347:C(18-27)Online publication date: 27-Dec-2018
    • (2018)Mining and modeling linkage information from citation context for improving biomedical literature retrievalInformation Processing and Management: an International Journal10.1016/j.ipm.2010.03.01047:1(53-67)Online publication date: 29-Dec-2018
    • (2018)Passage extraction and result combination for genomics information retrievalJournal of Intelligent Information Systems10.1007/s10844-009-0097-434:3(249-274)Online publication date: 28-Dec-2018
    • (2018)Unsupervised Named Entity Normalization for Supporting Information Fusion for Big Bridge Data AnalyticsAdvanced Computing Strategies for Engineering10.1007/978-3-319-91638-5_7(130-149)Online publication date: 19-May-2018
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media