Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Temporal document retrieval model for business news archives

Published: 01 May 2005 Publication History

Abstract

Temporal expressions occurring in business news, such as "last week" or "at the end of this month," carry important information about the time context of the news document and were proved to be useful for document retrieval. We found that about 10% of these expressions are difficult to project onto the calendar due to the uncertainty about their bounds. This paper introduces a novel approach to representing temporal expressions. A user study is conducted to measure the degree of uncertainty for selected temporal expressions and a method for representing uncertainty based on fuzzy numbers is proposed. The classical Vector Space Model is extended to the Temporal Document Retrieval Model (TDRM) that incorporates the proposed fuzzy representations of temporal expressions.

References

[1]
Abramowicz, W., Chmiel, D., Kalczynski, P. J., & Wecel, K. A. (2001). Time consistency among structured and unstructured contents in the data warehouse. In M. Khosrow-pour (Ed.), Proceedings of the 2001 Information Resource Management Association international conference (pp. 815-818). Toronto, Canada: Idea Group Publishing.]]
[2]
Abramowicz, W., Kalczynski, P. J., & Wecel, K. A. (2002). Filtering the Web to feed data warehouses. London: Springer-Verlag.]]
[3]
Allan, J., Papka, A., & Lavrenko, V. (1998). On-line new event tracking. In J. Zobel (Ed.), Proceedings of the 21st annual international ACM SIGIR conference on research and development in information retrieval (pp. 37-45). New York: ACM Press.]]
[4]
Aramburu, M., & Berlanga, R. (1997). An approach to a digital library of newspapers. Information Processing & Management, 33(5), 645-661.]]
[5]
Aramburu, M., & Berlanga, R. (1998). A retrieval language for historical documents. In T. J. M. Bench-Capon (Ed.). Proceedings of the database and expert systems applications, 9th international conference (DEXA'98) (pp. 216-225). Berlin: Springer-Verlag.]]
[6]
Baeza-Yates, R., & Ribeiro-Neto, B. (1999). Modern information retrieval. New York: ACM Press.]]
[7]
Berlanga, R., Perez, J., Aramburu, M., & Llido, D. (2001). Techniques and tools for temporal analysis of the retrieved information. In P. Vogel (Ed.), Proceedings of the database and expert systems applications, 12th international conference (DEXA-2001) (pp. 72-81). Berlin: Springer-Verlag.]]
[8]
Bettini, C., Jajodia, S., & Wang, S. X. (2000). Time granularities in databases, data mining, and temporal reasoning. Berlin, Heidelberg: Springer-Verlag.]]
[9]
Combi, C., & Chittaro, L. (2001). Representation of temporal intervals and relations: information visualization aspects and their evaluation. In Proceedings of the 8th international symposium on temporal representation and reasoning (TIME-01). New York: IEEE Computer Society Press.]]
[10]
Cousins, S. B., & Kahn, G. (1991). The visual display of temporal information. Artificial Intelligence in Medicine, 3, 341-357.]]
[11]
Dorr, B., & Gaasterland, T. J. (2002). Constraints on the generation of tense, aspect and connecting words from temporal expressions. Journal of Artificial Intelligence Research, 1, 1-47.]]
[12]
Filatova, E., & Hovy, E. (2001). Assigning TimeStamps to EventClauses. In Proceedings of the 39th meeting of the Association of Computational Linguistics (ACL 2001), Toulouse, France (pp. 88-95).]]
[13]
Goralwalla, I. A., Leontiev, Y., Ozsu, T. M., & Szafron, D. (1998). Temporal granularity for unanchored temporal data. In Proceedings of the 7th international conference on information and knowledge management (CIKM'98) (pp. 24-31). New York: ACM Press.]]
[14]
Goralwalla, I. A., Leontiev, Y., Ozsu, T. M., & Szafron, D. (2001). Temporal granularity: completing the puzzle. Journal of Intelligent Information Systems, 16, 41-46.]]
[15]
Kalczynski, P. J. (2002). Software agents to filter business information from the Web to the data warehouse. Unpublished doctoral dissertation, The Poznan University of Economics, Poznan.]]
[16]
Kalczynski, P. J., Abramowicz, W., Wecel, K. A., & Kaczmarek, T. (2003). Time indexer: a tool for extracting temporal references from business news. In M. Khosrow-Pour (Ed.), Proceedings of the 2003 Information Resource Management Association international conference (pp. 832-835). Philadelphia, PA: Idea Group Inc.]]
[17]
Kaufmann, A., & Gupta, M. M. (1985). Introduction to fuzzy arithmetics. Theory and applications. New York: Van Nostrand Reinhold.]]
[18]
Koen, D. B., & Bender, W. (2000). Time frames: temporal augmentation of the news. IBM Systems, 39(3&4), 597-616.]]
[19]
Llido, D., Berlanga, R., & Aramburu, M. J. (2001). Extracting temporal references to assign document-event time periods. In P. Vogel (Ed.), Proceedings of the database and expert systems applications, 12th international conference (DEXA-2001) (pp. 62-71). Berlin: Springer Verlag.]]
[20]
Salton, G., & Buckley, C. (1988). Term weighting approaches in automatic retrieval. Information Processing & amp;& amp; Management, 24(5), 513-523.]]
[21]
Salton, G., & Lesk, M. E. (1968). Computer evaluation of indexing and text processing. Journal of the ACM, 15(January), 8-36.]]
[22]
Schilder, F., & Habel, C. (2001). From temporal expressions to temporal information: semantic tagging of news messages. In Proceedings of the 39th meeting of the Association of Computational Linguistics (ACL 2001) (pp. 65-72). Toulouse, France: Morgan Kaufmann Publishers.]]
[23]
Setzer, A., & Gaizauskas, R. (2002). On the importance of annotating event-event temporal relations in text. In Proceedings of the third international conference on language resources and evaluation (LREC 2002). Paris: European Language Resources Distribution Agency (ELDA).]]
[24]
Swan, R., & Allan, J. (2000). Automatic generation of overview timelines. In P. Ingwersen (Ed.), Proceedings of the 23rd annual international ACM SIGIR conference on research and development in information retrieval (pp. 49-56). New York: ACM Press.]]
[25]
Tansel, A. U. (1993). Temporal databases: theory, design, and implementation. Redwood City, Calif: Benjamin/Cummings Pub. Co.]]
[26]
Wilson, G., & Mani, I. (2000). Robust temporal processing of news. In Proceedings of the 38th meeting of the Association of Computational Linguistics (ACL 2000), Hong Kong (pp. 69-76).]]
[27]
Zadeh, L. A. (1965). Fuzzy sets. Information and Control, 8, 338-353.]]

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Information Processing and Management: an International Journal
Information Processing and Management: an International Journal  Volume 41, Issue 3
Special issue: Cross-language information retrieval
May 2005
303 pages

Publisher

Pergamon Press, Inc.

United States

Publication History

Published: 01 May 2005

Author Tags

  1. document retrieval
  2. fuzzy numbers
  3. temporal expressions
  4. temporal retrieval
  5. vector space model

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2015)TemporalClassifierInternational Journal of Information Technology and Web Engineering10.4018/IJITWE.201510010310:4(44-66)Online publication date: 1-Oct-2015
  • (2015)Temporal Information RetrievalFoundations and Trends in Information Retrieval10.1561/15000000439:2(91-208)Online publication date: 1-Jul-2015
  • (2014)Time-aware topic-based contextualizationProceedings of the 23rd International Conference on World Wide Web10.1145/2567948.2567957(15-20)Online publication date: 7-Apr-2014
  • (2012)Learning to rank search results for time-sensitive queriesProceedings of the 21st ACM international conference on Information and knowledge management10.1145/2396761.2398667(2463-2466)Online publication date: 29-Oct-2012
  • (2011)A comparison of time-aware ranking methodsProceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval10.1145/2009916.2010147(1257-1258)Online publication date: 24-Jul-2011
  • (2011)Ranking related news predictionsProceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval10.1145/2009916.2010018(755-764)Online publication date: 24-Jul-2011
  • (2010)On the evaluation of Geographic Information Retrieval systemsInternational Journal on Digital Libraries10.5555/3269943.326995711:2(91-109)Online publication date: 1-Jun-2010
  • (2010)A language modeling approach for temporal information needsProceedings of the 32nd European conference on Advances in Information Retrieval10.1007/978-3-642-12275-0_5(13-25)Online publication date: 28-Mar-2010
  • (2009)Clustering and exploring search results using timeline constructionsProceedings of the 18th ACM conference on Information and knowledge management10.1145/1645953.1645968(97-106)Online publication date: 2-Nov-2009
  • (2009)A document classification and retrieval system for R&D in semiconductor industry - A hybrid approachExpert Systems with Applications: An International Journal10.1016/j.eswa.2008.06.02436:3(4753-4764)Online publication date: 1-Apr-2009
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media