Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2166896.2166922acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesswat4lsConference Proceedingsconference-collections
research-article

Exploiting text-rich descriptions for faceted discovery of web resources

Published: 07 December 2011 Publication History
  • Get Citation Alerts
  • Abstract

    Open metadata registries are a fundamental tool for researchers in the Life Sciences trying to locate resources such as web services or databases. While sophisticated standards have been produced for annotating these resources with rich, well-structured metadata, evidence shows that in open registries a majority of annotations simply consists of informal free text descriptions. This reality must be taken into account in order to develop effective techniques for resource discovery in the Life Sciences.
    In this work we propose a method for resource discovery which is able to exploit such textual descriptions to find relevant resources. It is a requirement-driven approach, in which the user specifies informational needs as a target task and a set of facets of interest, expressed using free text.
    We have conducted several experiments on resources extracted from the BioCatalogue registry. For a sample set of queries that reflect common Bioinformatics-related research questions, the results show that our method is effective and provides useful answers.

    References

    [1]
    N. Y. Ayadi, Z. Lacroix, and M.-E. Vidal. BiOnMap: a deductive approach for resource discovery. In iiWAS, pages 477--482, 2008.
    [2]
    R. Berlanga, V. Nebot, and E. Jimenez-Ruiz. Semantic annotation of biomedical texts through concept retrieval. In Procesamiento del Lenguaje Natural, 2010.
    [3]
    J. Bhagat, F. Tanoh, E. Nzuobontane, T. Laurent, J. Orlowski, M. Roos, K. Wolstencroft, S. Aleksejevs, R. Stevens, S. Pettifer, R. Lopez, and C. Globe. BioCatalogue: a universal catalogue of web services for the life sciences. NAR, 2010.
    [4]
    D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, March 2003.
    [5]
    B. Carterette and P. Chandar. Probabilistic Models of Novel Document Rankings for Faceted Topic Retrieval. In CIKM'09, 2009.
    [6]
    F. M. Couto, M. J. Silva, and P. Coutinho. Finding genomic ontology terms in text using evidence content. BMC Bioinformatics, 6(S-1), 2005.
    [7]
    T. L. Griffiths and M. Steyvers. Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America, 101(Suppl 1):5228--5235, Apr. 2004.
    [8]
    A. Jimeno-Yepes, R. Berlanga, and D. Rebholz-Schuhmann. Ontology refinement for improved information retrieval. Inf. Process. Manage., 46(4):426--435, 2010.
    [9]
    A. Jimeno-Yepes, E. Jiménez-Ruiz, R. Berlanga, and D. Rebholz-Schuhmann. Reuse of terminological resources for efficient ontological engineering in life sciences. BMC Bioinformatics, 10(S-10):4, 2009.
    [10]
    J. Kahan, M.-R. Koivunen, E. Prud'Hommeaux, and R. R. Swick. Annotea: An open rdf infrastructure for shared web annotations, 2001.
    [11]
    A. Kiryakov, B. Popov, I. Terziev, D. Manov, and D. Ognyanoff. Semantic annotation, indexing, and retrieval. Web Semantics: Science, Services and Agents on the World Wide Web, 2(1), 2011.
    [12]
    P. Lord, P. Alper, C. Wroe, and C. Goble. Feta: A Light-Weight Architecture for User Oriented Semantic Service Discovery. pages 17--31. 2005.
    [13]
    D. J. MacKay and L. C. B. Peto. A hierarchical dirichlet language model. Natural Language Engineering, 1:1--19, 1994.
    [14]
    P. Missier, S. Soiland-Reyes, S. Owen, W. Tan, A. Nenadic, I. Dunlop, A. Williams, T. Oinn, and C. Goble. Taverna, reloaded. In M. Gertz, T. Hey, and B. Ludaescher, editors, SSDBM 2010, Heidelberg, Germany, June 2010.
    [15]
    A. Mottaz, Y. L. Yip, P. Ruch, and A.-L. Veuthey. Mapping proteins to disease terminologies: from UniProt to MeSH. BMC Bioinformatics, 9(S-5), 2008.
    [16]
    J. M. Pérez, R. B. Llavori, and M. J. A. Cabo. A relevance model for a data warehouse contextualized with documents. Inf. Process. Manage., 45(3):356--367, 2009.
    [17]
    M. Pérez, R. Berlanga, and I. Sanz. A semantic approach for the requirement-driven discovery of web services in the Life Sciences. In Proceedings of the 3rd International Workshop on SWAT4LS., 2010.
    [18]
    S. Pettifer, D. Thorne, P. McDermott, T. Attwood, J. Baran, J. Bryne, Hupponen, T., D. Mowbray, and G. Vriend. An active registry for bioinformatics web services. Bioinformatics, 25:2090--2091, 2010.
    [19]
    J. Rios, J. Karlsson, and O. Trelles. Magallanes: a web services discovery and automatic workflow composition tool. BMC Bioinformatics, 10(1):334, 2009.
    [20]
    G. M. Sacco and Y. Tzitzikas. Dynamic Taxonomies and Faceted Search: Theory, Practice, and Experience, volume 25. Springer, 2009.
    [21]
    M. Steyvers and T. Griffiths. Probabilistic Topic Models. Lawrence Erlbaum Associates, 2007.
    [22]
    The BioMoby Consortium. Interoperability with Moby 1.0. It's better than sharing your toothbrush! Briefings in Bioinformatics, 2008.
    [23]
    D. Tran, C. Dubay, P. Gorman, and W. Hersh. Applying Task Analysis to Describe and Facilitate Bioinformatics Tasks. MEDINFO, 107(Pt 2):818, 2004.
    [24]
    V. Uren, P. Cimiano, J. Iria, S. Handschuh, M. Vargas-Vera, E. Motta, and F. Ciravegna. Semantic annotation for knowledge management: Requirements and a survey of the state of the art. Web Semantics: Science, Services and Agents on the World Wide Web, 4(1):14--28, Jan. 2006.
    [25]
    K. Wolstencroft, P. Alper, D. Hull, C. Wroe, P. W. Lord, R. D. Stevens, and C. A. Goble. The myGrid ontology: bioinformatics service discovery. Int. J. Bioinformatics Res. Appl., 3(3):303--325, 2007.

    Index Terms

    1. Exploiting text-rich descriptions for faceted discovery of web resources

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      SWAT4LS '11: Proceedings of the 4th International Workshop on Semantic Web Applications and Tools for the Life Sciences
      December 2011
      139 pages
      ISBN:9781450310765
      DOI:10.1145/2166896
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      • Ontotext
      • Corporate Semantic Web: Corporate Semantic Web
      • BBRC: Biotechnology and Biological Sciences Research Council
      • NCBO: National Center for BioMedical Ontology
      • BioMed Central: BioMed Central

      In-Cooperation

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 07 December 2011

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. life sciences
      2. semantics
      3. user-driven
      4. web resources faceted discovery

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      SWAT4LS '11
      Sponsor:
      • Corporate Semantic Web
      • BBRC
      • NCBO
      • BioMed Central

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 66
        Total Downloads
      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 10 Aug 2024

      Other Metrics

      Citations

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media