Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2663712.2666187acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Documents Search Using Semantics Criteria

Published: 07 November 2014 Publication History

Abstract

Current Information Retrieval systems generally search documents using a keywords model, which is often not expressive enough for the user. In this paper we describe some directions for improving an Information Retrieval system by letting the user specify different semantics constraints in her query, using a language based on a simplified version of first-order logic. The user can write queries that express the association between objects and attributes, temporal constraints and negation of attributes, and also perform synonyms expansion of queries. In order to evaluate the relevance of a candidate document with respect to the query, the dependency parse tree of the document is used, as well as other linguistic resources. The system was evaluated using a set of queries and a corpus extracted from the British newspaper The Times. The results are compared against the newspaper's own search engine and they look promising, showing an important improvement in precision in the first documents returned by the query.

References

[1]
T. Strzalkowski, G. Stein, G. Bowden Wise, P. Tapanainen, T. Jarvinen, A. Voutilainen, J. Karlgren, "Language Information Retrieval: TREC-7 Report", 1998.
[2]
C. Liu, H. Wang, S. McClean, E. Kapetianos, D. Carroll, "Weighting Common Syntactic Structures for Natural Language Based Information Retrieval", Proceedings of the 19th ACM International Conference on Information and Knowledge Management, 2010.
[3]
J. Liu, P. Pasupat, Y. Wang, S. Cyphers, J. Glass, "Query Undersanting Enhanced by Hierarchical Parsing Structures", 2013 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2013.
[4]
J. H. Park, W. B. Croft, "Query Term Ranking based on Dependency Parsing of Verbose Queries", Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval Pages 829--830, 2010.
[5]
K. T. Maxwell, J. Oberlander, W. B. Croft, "Feature-Based Selection of Dependency Paths in Ad Hoc Information Retrieval", Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013, pp 507--516.
[6]
C. D. Manning, P. Raghavan, H. Schütze, "Introduction to Information Retrieval", Cambridge University Press, 2008, s. 1.1, p. 4.
[7]
"CQL, Contextual Query Language", Library of Congress, http://www.loc.gov/standards/sru/cql/, last accessed 29 March 2014.
[8]
TIMEX3 Specification, "TimeML 1.2.1 A Formal Specification Language for Events and Temporal Expressions", http://www.timeml.org/site/publications/timeMLdocs/timeml_1.2.1.html#timex3, last accessed 23 April 2014.
[9]
G. A. Miller, "WordNet: A Lexical Database for English", Communications of the ACM Vol. 38, 1995, No. 11: 39--41.
[10]
C. Fellbaum, "WordNet: An Electronic Lexical Database", Cambridge, MA: MIT Press, 1998.
[11]
"Apache Lucene -- Apache Solr", The Apache Software Foundation, http://lucene.apache.org/solr/, last accessed 20 April 2014.
[12]
C. D. Manning, P. Raghavan, H. Schütze, "Introduction to Information Retrieval", Cambridge University Press, 2008, s. 1.2, pp. 6--9.
[13]
M. C. de Marneffe, B. MacCartney, C. D. Manning, 2006, "Generating Typed Dependency Parses from Phrase Structure Parses", LREC 2006.
[14]
A. X. Chang, C. D. Manning, 2012, "SUTIME: A Library for Recognizing and Normalizing Time Expressions", 8th International Conference on Language Resources and Evaluation, LREC 2012.
[15]
"The Times", Times Newspaper Limited 2014, http://www.thetimes.co.uk, last accessed 20 April 2014.

Cited By

View all
  • (2015)Report on the Seventh Workshop on Exploiting Semantic Annotations in Information Retrieval (ESAIR'14)ACM SIGIR Forum10.1145/2795403.279541249:1(27-34)Online publication date: 23-Jun-2015

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ESAIR '14: Proceedings of the 7th International Workshop on Exploiting Semantic Annotations in Information Retrieval
November 2014
52 pages
ISBN:9781450313650
DOI:10.1145/2663712
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 November 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. dependency parsing
  2. information retrieval
  3. natural language processing
  4. query language
  5. semantics

Qualifiers

  • Research-article

Conference

CIKM '14
Sponsor:

Acceptance Rates

ESAIR '14 Paper Acceptance Rate 11 of 15 submissions, 73%;
Overall Acceptance Rate 35 of 55 submissions, 64%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2015)Report on the Seventh Workshop on Exploiting Semantic Annotations in Information Retrieval (ESAIR'14)ACM SIGIR Forum10.1145/2795403.279541249:1(27-34)Online publication date: 23-Jun-2015

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media