Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2396761.2398551acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Entity centric query expansion for enterprise search

Published: 29 October 2012 Publication History

Abstract

Enterprise search is important, and the search quality has a direct impact on the productivity of an enterprise. Many information needs of enterprise search center around entities. Intuitively, information related to the entities mentioned in the query, such as related entities, would be useful to reformulate the query and improve the retrieval performance. However, most existing studies on query expansion are term-centric. In this paper, we propose a novel entity-centric query expansion framework for enterprise search. Specifically, given a query containing entities, we first utilize both unstructured and structured information to find entities that are related to the ones in the query. We then discuss how to adapt existing feedback methods to use the related entities to improve search quality. Experiment results show that the proposed entity-centric query expansion strategy is more effective to improve the search performance than the state-of-the-art pseudo feedback methods on longer, natural language-like queries with entities.

References

[1]
K. Balog, L. Azzopardi, and M. de Rijke. Formal Models for Expert Finding in Enterprise Corpora. In SIGIR, pages 43--50, 2006.
[2]
K. Balog, A. P. de Vries, P. Serdyukov, P. Thomas, and T. Westerveld. Overview of the TREC 2009 Entity Track. In Proceedings of TREC, 2010.
[3]
K. Balog, P. Serdyukov, and A. P. de Vries. Overview of the TREC 2010 Entity Track. In Proceedings of TREC, 2011.
[4]
K. Balog, I. Soboroff, P. Thomas, P. Bailey, N. Craswell, and A. P. de Vries. Overview of the TREC 2008 Enterprise Track. In Proceedings of TREC '08, 2008.
[5]
A. Carlson, J. Betteridge, B. Kisiel, B. Settles, E. R. H. Jr., and T. M. Mitchell. Toward an Architecture for Never-Ending Language Learning. In AAAI, 2010.
[6]
W. W. Cohen, P. Ravikumar, and S. E. Fienberg. A Comparison of String Distance Metrics for Name-Matching Tasks. In IJCAI, pages 73--78, 2003.
[7]
N. Craswell, A. P. de Vries, and I. Soboroff. Overview of the TREC 2005 Enterprise Track. In Proceedings of TREC '05, 2005.
[8]
G. Demartini, T. Iofciu, and A. de Vries. Overview of the INEX 2009 Entity Ranking Track. In Focused Retrieval and Evaluation, 2010.
[9]
A. Doan, L. G. R. Ramakrishnan, and S. Vaithyanathan. Introduction to the Special Issue on Managing Information Extraction. SIGMOD Record, 37(4), 2009.
[10]
S. Feldman and C. Sherman. The High Cost of Not Finding Information. In Technical Report No. 29127, IDC, 2003.
[11]
D. Hawking. Challenges in Enterprise Search. In Proceedings of ADC '04, pages 15--24, 2004.
[12]
M. Kolla and O. Vechtomova. Retrieval of Discussions from Enterprise Mailing Lists. In SIGIR, pages 881--882, 2007.
[13]
J. D. Lafferty, A. McCallum, and F. C. N. Pereira. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In ICML, pages 282--289, 2001.
[14]
V. Lavrenko and W. B. Croft. Relevance-Based Language Models. In SIGIR, pages 120--127, 2001.
[15]
X. Liu, H. Fang, C.-L. Yao, and M. Wang. Finding Relevant Information of Certain Types from Enterprise Data. In CIKM, pages 47--56, 2011.
[16]
C. Macdonald and I. Ounis. Combining Fields in Known-Item Email Search. In SIGIR, pages 675--676, 2006.
[17]
J. M. Ponte and W. B. Croft. A Language Modeling Approach to Information Retrieval. In SIGIR, pages 275--281, 1998.
[18]
S. Sarawagi. Information Extraction. Foundations and Trends in Databases, 1(3):261--377, 2008.
[19]
P. Serdyukov, H. Rode, and D. Hiemstra. Modeling Multi-step Relevance Propagation for Expert Finding. In CIKM, pages 1133--1142, 2008.
[20]
D. Zelenko, C. Aone, and A. Richardella. Kernel methods for relation extraction. J. Mach. Learn. Res., 3:1083--1106, Mar. 2003.
[21]
C. Zhai and J. Lafferty. A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval. In SIGIR, pages 334--342, 2001.
[22]
C. Zhai and J. Lafferty. Model-Based Feedback in the Language Modeling Approach to Information Retrieval. In CIKM, 2001.

Cited By

View all
  • (2014)EntEXPOProceedings of the 36th European Conference on IR Research on Advances in Information Retrieval - Volume 841610.5555/2964060.2964139(784-788)Online publication date: 13-Apr-2014
  • (2014)EntEXPO: An Interactive Search System for Entity-Bearing QueriesAdvances in Information Retrieval10.1007/978-3-319-06028-6_96(784-788)Online publication date: 2014

Index Terms

  1. Entity centric query expansion for enterprise search

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management
    October 2012
    2840 pages
    ISBN:9781450311564
    DOI:10.1145/2396761
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 October 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. combining structured and unstructured data
    2. enterprise search
    3. entity centric
    4. query expansion
    5. retrieval

    Qualifiers

    • Short-paper

    Conference

    CIKM'12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2014)EntEXPOProceedings of the 36th European Conference on IR Research on Advances in Information Retrieval - Volume 841610.5555/2964060.2964139(784-788)Online publication date: 13-Apr-2014
    • (2014)EntEXPO: An Interactive Search System for Entity-Bearing QueriesAdvances in Information Retrieval10.1007/978-3-319-06028-6_96(784-788)Online publication date: 2014

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media