research-article

Free access

Model-portability experiments for textual temporal analysis

Authors:

Oleksandr Kolomiyets,

Steven Bethard,

Marie-Francine MoensAuthors Info & Claims

HLT '11: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2

Pages 271 - 276

Published: 19 June 2011 Publication History

Abstract

We explore a semi-supervised approach for improving the portability of time expression recognition to non-newswire domains: we generate additional training examples by substituting temporal expression words with potential synonyms. We explore using synonyms both from WordNet and from the Latent Words Language Model (LWLM), which predicts synonyms in context using an unsupervised approach. We evaluate a state-of-the-art time expression recognition system trained both with and without the additional training examples using data from TempEval 2010, Reuters and Wikipedia. We find that the LWLM provides substantial improvements on the Reuters corpus, and smaller improvements on the Wikipedia corpus. We find that WordNet alone never improves performance, though intersecting the examples from the LWLM and WordNet provides more stable results for Wikipedia.

References

[1]

David Ahn, Joris van Rantwijk, and Maarten de Rijke. 2007. A Cascaded Machine Learning Approach to Interpreting Temporal Expressions. In Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007).

[2]

Michael Collins and Yoram Singer. 1999. Unsupervised Models for Named Entity Classification. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, pp. 100--110, College Park, MD. ACL.

[3]

Koen Deschacht and Marie-Francine Moens. 2009. Using the Latent Words Language Model for Semi-Supervised Semantic Role Labeling. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing.

Digital Library

[4]

Ralph Grishman and Beth Sundheim. 1996. Message Understanding Conference-6: A Brief History. In Proceedings of the 16th Conference on Computational Linguistics, pp. 466--471.

Digital Library

[5]

Kadri Hacioglu, Ying Chen, and Benjamin Douglas 2005. Automatic Time Expression Labeling for English and Chinese Text. In Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 548--559. Springer, Heidelberg.

Digital Library

[6]

Oleksandr Kolomiyets, Marie-Francine Moens. 2010. KUL: Recognition and Normalization of Temporal Expressions. In Proceedings of SemEval-2 5th Workshop on Semantic Evaluation. pp. 325--328. Uppsala, Sweden. ACL.

Digital Library

[7]

David D. Lewis, Yiming Yang, Tony G. Rose, and Fan Li. 2004. RCV1: A New Benchmark Collection for Text Categorization Research. Machine Learning Research. 5: 361--397

Digital Library

[8]

Inderjeet Mani, and George Wilson. 2000. Robust Temporal Processing of News. In Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, pp. 69--76, Morristown, NJ. ACL.

Digital Library

[9]

George A. Miller. 1995. WordNet: A Lexical Database for English. Communications of the ACM, 38(11): 39--41.

Digital Library

[10]

Matteo Negri, and Luca Marseglia. 2004. Recognition and Normalization of Time Expressions: ITC-irst at TERN 2004. Technical Report, ITC-irst, Trento.

[11]

Hector Llorens, Estela Saquete, and Borja Navarro. 2010. TIPSem (English and Spanish): Evaluating CRFs and Semantic Roles in TempEval 2. In Proceedings of the 5th International Workshop on Semantic Evaluation, pp. 284--291, Uppsala, Sweden. ACL.

Digital Library

[12]

Jordi Poveda, Mihai Surdeanu, and Jordi Turmo. 2007. A Comparison of Statistical and Rule-Induction Learners for Automatic Tagging of Time Expressions in English. In Proceedings of the International Symposium on Temporal Representation and Reasoning, pp. 141--149.

Digital Library

[13]

Jordi Poveda, Mihai Surdeanu, and Jordi Turmo. 2009. An Analysis of Bootstrapping for the Recognition of Temporal Expressions. In Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing, pp. 49--57, Stroudsburg, PA, USA. ACL.

Digital Library

[14]

Jannik Strötgen and Michael Gertz. 2010. HeidelTime: High Quality Rule-Based Extraction and Normalization of Temporal Expressions. In Proceedings of the 5th International Workshop on Semantic Evaluation, pp. 321--324, Uppsala, Sweden. ACL.

Digital Library

[15]

Mihai Surdeanu, Jordi Turmo, and Alicia Ageno. 2006. A Hybrid Approach for the Acquisition of Information Extraction Patterns. In Proceedings of the EACL 2006 Workshop on Adaptive Text Extraction and Mining (ATEM 2006). ACL.

[16]

Marc Verhagen, Roser Sauri, Tommaso Caselli, and James Pustejovsky. 2010. SemEval-2010 Task 13: TempEval 2. In Proceedings of the 5th International Workshop on Semantic Evaluation, pp. 57--62, Uppsala, Sweden. ACL.

Digital Library

[17]

David Yarowsky. 1995. Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pp. 189--196, Cambridge, MA. ACL.

Digital Library

Cited By

Wu LZhang FCheng CSong S(2024)Supervised Contrast Learning Text Classification Model Based on Data Quality AugmentationACM Transactions on Asian and Low-Resource Language Information Processing10.1145/365330023:5(1-12)Online publication date: 10-May-2024
https://dl.acm.org/doi/10.1145/3653300
Bayer MKaufhold MReuter C(2022)A Survey on Data Augmentation for Text ClassificationACM Computing Surveys10.1145/354455855:7(1-39)Online publication date: 17-Jun-2022
https://dl.acm.org/doi/10.1145/3544558
Llorens HSaquete ENavarro-Colorado B(2019)Applying semantic knowledge to the automatic processing of temporal expressions and events in natural languageInformation Processing and Management: an International Journal10.1016/j.ipm.2012.05.00549:1(179-197)Online publication date: 22-Nov-2019
https://dl.acm.org/doi/10.1016/j.ipm.2012.05.005

Index Terms

Model-portability experiments for textual temporal analysis

Recommendations

Morphosyntactic Parser and Textual Corpora: Processing Uncommon Phenomena of Tibetan Language
IMS2017: Proceedings of the International Conference IMS-2017

This article analyzes the problems of parsing texts with linguistic phenomena of controversial nature which may rarely be encountered in NLP projects focusing on Indo-European languages, but are quite frequent in other languages, e.g. in the corpus of ...
Multi-word expressions in textual inference: much ado about nothing?
TextInfer '09: Proceedings of the 2009 Workshop on Applied Textual Inference

Multi-word expressions (MWE) have seen much attention from the NLP community. In this paper, we investigate their impact on the recognition of textual entailment (RTE). Using the manual Microsoft Research annotations, we first manually count and ...
A lexical alignment model for probabilistic textual entailment
MLCW'05: Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment

This paper describes the Bar-Ilan system participating in the Recognising Textual Entailment Challenge. The paper proposes first a general probabilistic setting that formalizes the notion of textual entailment. We then describe a concrete alignment-...

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings

HLT '11: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2

June 2011

765 pages

ISBN:9781932432886

General Chair:
Dekang Lin
Google

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 19 June 2011

Qualifiers

Research-article

Acceptance Rates

Overall Acceptance Rate 240 of 768 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
220
Total Downloads

Downloads (Last 12 months)52
Downloads (Last 6 weeks)7

Reflects downloads up to 07 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Wu LZhang FCheng CSong S(2024)Supervised Contrast Learning Text Classification Model Based on Data Quality AugmentationACM Transactions on Asian and Low-Resource Language Information Processing10.1145/365330023:5(1-12)Online publication date: 10-May-2024
https://dl.acm.org/doi/10.1145/3653300
Bayer MKaufhold MReuter C(2022)A Survey on Data Augmentation for Text ClassificationACM Computing Surveys10.1145/354455855:7(1-39)Online publication date: 17-Jun-2022
https://dl.acm.org/doi/10.1145/3544558
Llorens HSaquete ENavarro-Colorado B(2019)Applying semantic knowledge to the automatic processing of temporal expressions and events in natural languageInformation Processing and Management: an International Journal10.1016/j.ipm.2012.05.00549:1(179-197)Online publication date: 22-Nov-2019
https://dl.acm.org/doi/10.1016/j.ipm.2012.05.005

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten