Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2169095.2169102acmotherconferencesArticle/Chapter ViewAbstractPublication PagestempwebConference Proceedingsconference-collections
research-article

Identification of top relevant temporal expressions in documents

Published: 17 April 2012 Publication History

Abstract

Temporal information is very common in textual documents, and thus, identifying, normalizing, and organizing temporal expressions is an important task in IR. Although there are some tools for temporal tagging, there is a lack in research focusing on the relevance of temporal expressions. Besides counting their frequency and verifying whether they satisfy a temporal search query, temporal expressions are often considered in isolation only. There are no methods to calculate the relevance of temporal expressions, neither in general nor with respect to a query.
In this paper, we present an approach to identify top relevant temporal expressions in documents using expression-, document-, corpus-, and query-based features. We present two relevance functions: one to calculate relevance scores for temporal expressions in general, and one with respect to a search query, which consists of a textual part, a temporal part, or both. Using two evaluation scenarios, we demonstrate the effectiveness of our approach.

References

[1]
O. Alonso, M. Gertz, and R. A. Baeza-Yates. Temporal Analysis of Document Collections: Framework and Applications. In SPIRE'10, pages 290--296, 2010.
[2]
O. Alonso, M. Gertz, and R. A. Baeza-Yates. Enhancing Document Snippets Using Temporal Information. In SPIRE'11, pages 26--31, 2011.
[3]
O. Alonso, J. Strötgen, R. Baeza-Yates, and M. Gertz. Temporal Information Retrieval: Challenges and Opportunities. In TWAW'11, pages 1--8, 2011.
[4]
I. Arikan, S. J. Bedathur, and K. Berberich. Time Will Tell: Leveraging Temporal Expressions in IR. In WSDM'09, 2009.
[5]
K. Berberich, S. J. Bedathur, O. Alonso, and G. Weikum. A Language Modeling Approach for Temporal Information Needs. In ECIR'10, pages 13--25, 2010.
[6]
Y. Bestgen and W. Vonk. Temporal Adverbials as Segmentation Markers in Discourse Comprehension. Journal of Memory and Language, 42(1):74--87, 2000.
[7]
H. Llorens, E. Saquete, and B. Navarro. TIPSem (English and Spanish): Evaluating CRFs and Semantic Roles in TempEval-2. In SemEval'10, pages 284--291, 2010.
[8]
Lucene. http://lucene.apache.org/.
[9]
J. Makkonen, H. Ahonen-myka, and M. Salmenkivi. Topic Detection and Tracking with Spatio-Temporal Evidence. In ECIR'03, pages 251--265, 2003.
[10]
C. man Au Yeung and A. Jatowt. Studying How the Past is Remembered: Towards Computational History through Large Scale Text Mining. In CIKM, pages 1231--1240, 2011.
[11]
I. Mani, J. Pustejovsky, and R. Gaizauskas, editors. The Language of Time. Oxford University Press, 2005.
[12]
C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA, 2008.
[13]
C. D. Manning and H. Schuetze. Foundations of Statistical Natural Language Processing. The MIT Press, 1 edition, June 1999.
[14]
D. Metzler, R. Jones, F. Peng, and R. Zhang. Improving Search Relevance for Implicitly Temporal Queries. In SIGIR '09, pages 700--701, 2009.
[15]
S. Nunes, C. Ribeiro, and G. David. Use of Temporal Expressions in Web Search. In ECIR'08, pages 580--584, 2008.
[16]
OpenNLP. http://opennlp.sourceforge.net/.
[17]
S.-T. Park, D. M. Pennock, C. L. Giles, and R. Krovetz. Analysis of Lexical Signatures for Improving Information Persistence on the World Wide Web. ACM Trans. Inf. Syst., 22(4):540--572, 2004.
[18]
F. Schilder and C. Habel. From Temporal Expressions to Temporal Information: Semantic Tagging of News Messages. In Proceedings of the Workshop on Temporal and Spatial Information Processing, pages 65--72, 2001.
[19]
M. Shokouhi. Detecting seasonal queries by time-series analysis. In SIGIR, pages 1171--1172, 2011.
[20]
J. Strötgen and M. Gertz. HeidelTime: High Quality Rule-based Extraction and Normalization of Temporal Expressions. In SemEval'10, 2010.
[21]
J. Strötgen and M. Gertz. TimeTrails: A System for Exploring Spatio-Temporal Information in Documents. PVLDB, 3(2):1569--1572, 2010.
[22]
J. Strötgen and M. Gertz. Multilingual and Cross-domain Temporal Tagging. Language Resources and Evaluation, accepted for journal publication, 2012.
[23]
J. Strötgen, M. Gertz, and C. Junghans. An Event-centric Model for Multilingual Document Similarity. In SIGIR'11, pages 953--962, 2011.
[24]
TimeML. http://www.timeml.org/.
[25]
UIMA. http://uima.apache.org/.
[26]
M. Verhagen, R. Gaizauskas, F. Schilder, M. Hepple, G. Katz, and J. Pustejovsky. SemEval-2007 Task 15: TempEval Temporal Relation Identification. In SemEval'07, pages 75--80, 2007.
[27]
M. Verhagen and J. Pustejovsky. Temporal Processing with the TARSQI Toolkit. In Coling 2008: Companion volume: Demonstrations, pages 189--192, 2008.
[28]
M. Verhagen, R. Sauri, T. Caselli, and J. Pustejovsky. SemEval-2010 Task 13: TempEval-2. In SemEval'10, pages 57--62, 2010.
[29]
Wikipedia Featured Articles. http://en.wikipedia.org/wiki/Wikipedia: Featured_articles.

Cited By

View all
  • (2023)A survey on narrative extraction from textual dataArtificial Intelligence Review10.1007/s10462-022-10338-756:8(8393-8435)Online publication date: 6-Jan-2023
  • (2022)Semantic Modelling of Document Focus-Time for Temporal Information RetrievalCompanion Proceedings of the Web Conference 202210.1145/3487553.3524668(896-902)Online publication date: 25-Apr-2022
  • (2021)Time-Matters: Temporal Unfolding of TextsAdvances in Information Retrieval10.1007/978-3-030-72240-1_53(492-497)Online publication date: 30-Mar-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
TempWeb '12: Proceedings of the 2nd Temporal Web Analytics Workshop
April 2012
55 pages
ISBN:9781450311885
DOI:10.1145/2169095
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 April 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. ranking
  2. relevance scoring
  3. snippets
  4. temporal expressions
  5. temporal information

Qualifiers

  • Research-article

Conference

TempWeb '12

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)A survey on narrative extraction from textual dataArtificial Intelligence Review10.1007/s10462-022-10338-756:8(8393-8435)Online publication date: 6-Jan-2023
  • (2022)Semantic Modelling of Document Focus-Time for Temporal Information RetrievalCompanion Proceedings of the Web Conference 202210.1145/3487553.3524668(896-902)Online publication date: 25-Apr-2022
  • (2021)Time-Matters: Temporal Unfolding of TextsAdvances in Information Retrieval10.1007/978-3-030-72240-1_53(492-497)Online publication date: 30-Mar-2021
  • (2019)Semantic association computation: a comprehensive surveyArtificial Intelligence Review10.1007/s10462-019-09781-wOnline publication date: 20-Nov-2019
  • (2018)Leveraging linked entities to estimate focus time of short textsProceedings of the 22nd International Database Engineering & Applications Symposium10.1145/3216122.3216158(282-286)Online publication date: 18-Jun-2018
  • (2017)Towards Exploiting Social Networks for Detecting Epidemic OutbreaksGlobal Journal of Flexible Systems Management10.1007/s40171-016-0148-y18:1(61-71)Online publication date: 11-Jan-2017
  • (2017)Identifying top relevant dates for implicit time sensitive queriesInformation Retrieval Journal10.1007/s10791-017-9302-120:4(363-398)Online publication date: 5-May-2017
  • (2017)Estimating Time to Event of Future Events Based on Linguistic Cues on TwitterIntelligent Natural Language Processing: Trends and Applications10.1007/978-3-319-67056-0_5(67-97)Online publication date: 18-Nov-2017
  • (2017)Temporal Feature Space for Text ClassificationText, Speech, and Dialogue10.1007/978-3-319-64206-2_41(362-370)Online publication date: 29-Jul-2017
  • (2016)SOGOU-2012-CRAWLProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval10.1145/2911451.2914668(709-712)Online publication date: 7-Jul-2016
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media