Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2110363.2110396acmconferencesArticle/Chapter ViewAbstractPublication PagesihiConference Proceedingsconference-collections
research-article

An up-to-date knowledge-based literature search and exploration framework for focused bioscience domains

Published: 28 January 2012 Publication History

Abstract

In domain-specific search systems, knowledge of a domain of interest is embedded as a backbone that guides the search process. But the knowledge used in most such systems 1. exists only for few well known broad domains; 2. is of a basic nature: either purely hierarchical or involves only few relationship types; and 3. is not always kept up-to-date missing insights from recently published results. In this paper we present a framework and implementation of a focused and up-to-date knowledge-based search system, called Scooner, that utilizes domain-specific knowledge extracted from recent bioscience abstracts. To our knowledge, this is the first attempt in the field to address all three shortcomings mentioned above. Since recent introduction for operational use at Applied Biotechnology Branch of AFRL, some biologists are using Scooner on a regular basis, while it is being made available for use by many more. Initial evaluations point to the promise of the approach in addressing the challenge we set out to address.

References

[1]
E. Agichtein and L. Gravano. Snowball: Extracting relations from large plain-text collections. In 5th ACM conf. on Digital libraries, pages 85--94, 2000.
[2]
O. Bodenreider. Biomedical Ontologies in Action: Role in Knowledge Management, Data Integration and Decision Support. Yearbook of medical informatics, page 67, 2008.
[3]
K. Clauson, H. Polen, M. Boulos, and J. Dzenowagis. Scope, completeness, and accuracy of drug information in Wikipedia. The Annals of pharmacotherapy, 42(12):1814, 2008.
[4]
M. de Marneffe, B. MacCartney, and C. Manning. Generating Typed Dependency Parses from Phrase Structure Parses. In Proceedings of LREC 2006.
[5]
H. Dietze, D. Alexopoulou, M. Alvers, L. Barrio-Alvers, B. Andreopoulos, A. Doms, J. Hakenberg, J. Monnich, C. Plake, A. Reischuck, et al. Go : Exploring with ontological background knowledge. Bioinformatics for Systems Biology, pages 385--399, 2009.
[6]
O. Etzioni, M. J. Cafarella, D. Downey, S. Kok, A. M. Popescu, T. Shaked, S. Soderland, D. S. Weld, and A. Yates. Web-scale information extraction in knowitall: (preliminary results). In Proceedigns of WWW '04, pages 100--110. ACM, 2004.
[7]
M. Gillam, C. Feie, J. Handler, E. Moody, B. Shneiderman, C. Plaisant, M. Smith, and J. Dickason. The healthcare singularity and the age of semantic medicine, pages 57--63. The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research, 2009.
[8]
M. Harris, J. Clark, A. Ireland, J. Lomax, M. Ashburner, R. Foulger, K. Eilbeck, S. Lewis, B. Marshall, C. Mungall, et al. The Gene Ontology (GO) database and informatics resource. Nucleic acids research, 32 (Database issue): D258, 2004.
[9]
M. Hearst. Automatic acquisition of hyponyms from large text corpora. In 14th conf. on Computational linguistics-Volume 2, pages 539--545, 1992.
[10]
W. Hersh, A. Cohen, P. Roberts, and H. Rekapalli. TREC 2006 Genomics Track Overview.
[11]
G. Jeh and J. Widom. SimRank: A measure of structural-context similarity. In ACM SIGKDD, pages 538--543, 2002.
[12]
M. Laurent and T. Vickers. Seeking health information online: does Wikipedia matter? Journal of the American Medical Informatics Association, 16(4):471--479, 2009.
[13]
D. Lizorkin, P. Velikhov, M. Grinev, and D. Turdakov. Accuracy estimate and optimization techniques for simrank computation. The VLDB Journal, 19(1):45--66, 2010.
[14]
Z. Lu. and beyond: a survey of web tools for searching biomedical literature. Database: the journal of biological databases and curation, 2011, 2011.
[15]
Q. Nguyen, D. Tikk, and U. Leser. Simple tricks for improving pattern-based information extraction from the biomedical literature. Journal of Biomedical Semantics, 1(1):9, 2010.
[16]
C. Perez-Iratxeta, P. Bork, and M. Andrade. XplorMed: a tool for exploring MEDLINE abstracts. Trends in biochemical sciences, 26(9):573--575, 2001.
[17]
C. Ramakrishnan, P. Mendes, R. Gama, G. Ferreira, and A. Sheth. Joint Extraction of Compound Entities and Relationships from Biomedical Literature. In IEEE Intl. Conf. on Web Intelligence and Intelligent Agent Technology, pages 398--401, 2008.
[18]
A. Ruttenberg, T. Clark, W. Bug, M. Samwald, O. Bodenreider, H. Chen, D. Doherty, K. Forsberg, Y. Gao, V. Kashyap, et al. Advancing translational research with the Semantic Web. BMC bioinformatics, 8(Suppl 3):S2, 2007.
[19]
D. Swanson. Migraine and magnesium: eleven neglected connections. Perspectives in biology and medicine, 31(4):526--557, 1988.
[20]
C. Thomas, P. Mehra, R. Brooks, and A. Sheth. Growing Fields of Interest-Using an Expand and Reduce Strategy for Domain Model Extraction. In Intl. Conf. on Web Intelligence and Intelligent Agent Technology, pages 496--502, 2008.
[21]
C. J. Thomas, P. Mehra, A. P. Sheth, W. Wang, and G. Weikum. Automatic Domain Model Creation from Structured and Unstructured Sources. In submitted to ISWC 2011, 2011.
[22]
P. Turney. Expressing implicit semantic relations without supervision. In Proceedings of ACL 2006, pages 313--320, 2010.
[23]
F. Wu and D. S. Weld. Open Information Extraction using Wikipedia. In ACL-2010, 2010.
[24]
Y. Yamamoto and T. Takagi. Biomedical knowledge navigation by literature clustering. Journal of Biomedical Informatics, 40(2):114--130, 2007.

Cited By

View all
  • (2019)Automatic Knowledge Extraction to Build Semantic Web of Things ApplicationsIEEE Internet of Things Journal10.1109/JIOT.2019.29183276:5(8447-8454)Online publication date: Oct-2019
  • (2017)Knowledge-Based Biomedical Word Sense Disambiguation with Neural Concept Embeddings2017 IEEE 17th International Conference on Bioinformatics and Bioengineering (BIBE)10.1109/BIBE.2017.00-61(163-170)Online publication date: Oct-2017
  • (2013)PREDOSEJournal of Biomedical Informatics10.1016/j.jbi.2013.07.00746:6(985-997)Online publication date: 1-Dec-2013
  • Show More Cited By

Index Terms

  1. An up-to-date knowledge-based literature search and exploration framework for focused bioscience domains

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      IHI '12: Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
      January 2012
      914 pages
      ISBN:9781450307819
      DOI:10.1145/2110363
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 28 January 2012

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. domain models
      2. hypothesis generation
      3. information extraction
      4. knowledge-based systems
      5. text mining

      Qualifiers

      • Research-article

      Conference

      IHI '12
      Sponsor:
      IHI '12: ACM International Health Informatics Symposium
      January 28 - 30, 2012
      Florida, Miami, USA

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)6
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 30 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2019)Automatic Knowledge Extraction to Build Semantic Web of Things ApplicationsIEEE Internet of Things Journal10.1109/JIOT.2019.29183276:5(8447-8454)Online publication date: Oct-2019
      • (2017)Knowledge-Based Biomedical Word Sense Disambiguation with Neural Concept Embeddings2017 IEEE 17th International Conference on Bioinformatics and Bioengineering (BIBE)10.1109/BIBE.2017.00-61(163-170)Online publication date: Oct-2017
      • (2013)PREDOSEJournal of Biomedical Informatics10.1016/j.jbi.2013.07.00746:6(985-997)Online publication date: 1-Dec-2013
      • (2013)A graph-based recovery and decomposition of Swanson's hypothesis using semantic predicationsJournal of Biomedical Informatics10.1016/j.jbi.2012.09.00446:2(238-251)Online publication date: 1-Apr-2013
      • (2011)Semantic Predications for Complex Information Needs in Biomedical LiteratureProceedings of the 2011 IEEE International Conference on Bioinformatics and Biomedicine10.1109/BIBM.2011.23(512-519)Online publication date: 12-Nov-2011

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media