Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2487788.2488159acmotherconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
research-article

Using SKOS vocabularies for improving web search

Published: 13 May 2013 Publication History
  • Get Citation Alerts
  • Abstract

    Knowledge organization systems such as thesauri or taxonomies are increasingly being expressed using the Simple Knowledge Organization System (SKOS) and published as structured data on the Web. Search engines can exploit these vocabularies and improve search by expanding terms at query or document indexing time. We propose a SKOS-based term expansion and scoring technique that leverages labels and semantic relationships of SKOS concept definitions. We also implemented this technique for Apache Lucene and Solr. Experiments with the Medical Subject Headings vocabulary and an early evaluation with Library of Congress Subject Headings indicated gains in precision when using SKOS-based expansion compared to pseudo relevance feedback and no expansion. Our findings are important for publishers and consumer of Web vocabularies who want to use them for improving search over Web documents.

    References

    [1]
    J. Bai, D. Song, P. Bruza, J.-Y. Nie, and G. Cao. Query expansion using term relationships in language models for information retrieval. In SIGIR'05, pages 688--69 ACM, 2005.
    [2]
    J. Bhogal, A. Macfarlane, and P. Smith. A review of ontology based query expansion. Information Processing & Management, 43(4):866--886, 2007.
    [3]
    C. Carpineto and G. Romano. A survey of automatic query expansion in information retrieval. ACM Computing Surveys (CSUR), 44(1):1, 2012.
    [4]
    T. Heath and C. Bizer. Linked Data: Evolving the Web into a Global Data Space. Morgan & Claypool, 2011.
    [5]
    W. Hersh, C. Buckley, T. J. Leone, and D. Hickam. OHSUMED: an interactive retrieval evaluation and new large test collection for research. In SIGIR '94, pages 192--201, Aug. 1994.
    [6]
    W. Hersh, S. Price, and L. Donohoe. Assessing thesaurus-based query expansion using the UMLS Metathesaurus. In Proceedings of the AMIA Symposium, page 344, Jan. 2000.
    [7]
    J. Lin and D. Demner-Fushman. The role of knowledge in conceptual retrieval. In SIGIR '06, pages 99--106, Aug. 2006.
    [8]
    N. A. A. Manaf, S. Bechhofer, and R. Stevens. The current state of skos vocabularies on the web. In ESWC, pages 270--284, 2012.
    [9]
    A. Miles and S. Bechhofer. SKOS Simple Knowledge Organization System Reference. Recommendation, W3C, 2009.
    [10]
    A. Miles, B. Matthews, M. Wilson, and D. Brickley. SKOS Core: Simple knowledge organisation for the Web, Dec. 2005.
    [11]
    E. Summers, A. Isaac, C. Redding, and D. Krech. Lcsh, skos and linked data. arXiv preprint arXiv:0805.2855, 2008.
    [12]
    M. Theobald, R. Schenkel, and G. Weikum. Efficient and self-tuning incremental query expansion for top-k query processing. In SIGIR '05, pages 242--249, Aug. 2005.
    [13]
    M. Van Assem, V. Malaisé, A. Miles, and G. Schreiber. A Method to Convert Thesauri to SKOS. The Semantic Web: Research and Applications, pages 95--109, 2006.
    [14]
    E. M. Voorhees. Query expansion using lexical-semantic relations. In SIGIR '94, pages 61--69, Aug. 1994.
    [15]
    W. Zhou, C. Yu, N. Smalheiser, V. Torvik, and J. Hong. Knowledge-intensive conceptual retrieval and passage extraction of biomedical literature. In SIGIR '07, pages 655--662, July 2007.

    Cited By

    View all
    • (2021)Validating Ontology-based Annotations of Biomedical Resources using Zero-shot LearningThe 12th International Conference on Computational Systems-Biology and Bioinformatics10.1145/3486713.3486730(37-43)Online publication date: 14-Oct-2021
    • (2021)Documenting Context-Based Quality Assessment of Controlled VocabulariesIEEE Transactions on Emerging Topics in Computing10.1109/TETC.2018.28650949:1(144-160)Online publication date: 1-Jan-2021
    • (2020)A User-Aware and Semantic Approach for Enterprise SearchNatural Language Processing10.4018/978-1-7998-0951-7.ch016(302-321)Online publication date: 2020
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '13 Companion: Proceedings of the 22nd International Conference on World Wide Web
    May 2013
    1636 pages
    ISBN:9781450320382
    DOI:10.1145/2487788

    Sponsors

    • NICBR: Nucleo de Informatcao e Coordenacao do Ponto BR
    • CGIBR: Comite Gestor da Internet no Brazil

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 May 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. linked data
    2. query expansion
    3. search
    4. skos
    5. thesauri

    Qualifiers

    • Research-article

    Conference

    WWW '13
    Sponsor:
    • NICBR
    • CGIBR
    WWW '13: 22nd International World Wide Web Conference
    May 13 - 17, 2013
    Rio de Janeiro, Brazil

    Acceptance Rates

    WWW '13 Companion Paper Acceptance Rate 831 of 1,250 submissions, 66%;
    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Validating Ontology-based Annotations of Biomedical Resources using Zero-shot LearningThe 12th International Conference on Computational Systems-Biology and Bioinformatics10.1145/3486713.3486730(37-43)Online publication date: 14-Oct-2021
    • (2021)Documenting Context-Based Quality Assessment of Controlled VocabulariesIEEE Transactions on Emerging Topics in Computing10.1109/TETC.2018.28650949:1(144-160)Online publication date: 1-Jan-2021
    • (2020)A User-Aware and Semantic Approach for Enterprise SearchNatural Language Processing10.4018/978-1-7998-0951-7.ch016(302-321)Online publication date: 2020
    • (2018)A User-Aware and Semantic Approach for Enterprise SearchInternational Journal on Semantic Web & Information Systems10.4018/IJSWIS.201810010714:4(129-146)Online publication date: 1-Oct-2018
    • (2018)Estimating Similarity Among Entities Aided by the Web when Only the Entity Name is AvailableProceedings of the 24th Brazilian Symposium on Multimedia and the Web10.1145/3243082.3243118(253-260)Online publication date: 16-Oct-2018
    • (2018)A Rule-Based Transducer for Querying Incompletely Aligned DatasetsACM Transactions on the Web10.1145/322832812:4(1-40)Online publication date: 27-Sep-2018
    • (2018)Developing a scientific knowledge graph through conceptual linking of academic classifications2018 13th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP)10.1109/SMAP.2018.8501869(113-118)Online publication date: Sep-2018
    • (2017)Overall quality assessment of SKOS thesauriJournal of Information Science10.1177/016555151667107943:6(816-834)Online publication date: 1-Dec-2017
    • (2017)Federated semantic search using terminological thesauri for learning object discoveryJournal of Enterprise Information Management10.1108/JEIM-06-2016-011630:5(795-808)Online publication date: 11-Sep-2017
    • (2016)Exploiting semantics for searching agricultural bibliographic dataJournal of Information Science10.1177/016555151560657942:6(748-762)Online publication date: 1-Dec-2016
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media