Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2002736.2002793dlproceedingsArticle/Chapter ViewAbstractPublication PageshltConference Proceedingsconference-collections
research-article
Free access

Model-portability experiments for textual temporal analysis

Published: 19 June 2011 Publication History

Abstract

We explore a semi-supervised approach for improving the portability of time expression recognition to non-newswire domains: we generate additional training examples by substituting temporal expression words with potential synonyms. We explore using synonyms both from WordNet and from the Latent Words Language Model (LWLM), which predicts synonyms in context using an unsupervised approach. We evaluate a state-of-the-art time expression recognition system trained both with and without the additional training examples using data from TempEval 2010, Reuters and Wikipedia. We find that the LWLM provides substantial improvements on the Reuters corpus, and smaller improvements on the Wikipedia corpus. We find that WordNet alone never improves performance, though intersecting the examples from the LWLM and WordNet provides more stable results for Wikipedia.

References

[1]
David Ahn, Joris van Rantwijk, and Maarten de Rijke. 2007. A Cascaded Machine Learning Approach to Interpreting Temporal Expressions. In Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007).
[2]
Michael Collins and Yoram Singer. 1999. Unsupervised Models for Named Entity Classification. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, pp. 100--110, College Park, MD. ACL.
[3]
Koen Deschacht and Marie-Francine Moens. 2009. Using the Latent Words Language Model for Semi-Supervised Semantic Role Labeling. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing.
[4]
Ralph Grishman and Beth Sundheim. 1996. Message Understanding Conference-6: A Brief History. In Proceedings of the 16th Conference on Computational Linguistics, pp. 466--471.
[5]
Kadri Hacioglu, Ying Chen, and Benjamin Douglas 2005. Automatic Time Expression Labeling for English and Chinese Text. In Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 548--559. Springer, Heidelberg.
[6]
Oleksandr Kolomiyets, Marie-Francine Moens. 2010. KUL: Recognition and Normalization of Temporal Expressions. In Proceedings of SemEval-2 5th Workshop on Semantic Evaluation. pp. 325--328. Uppsala, Sweden. ACL.
[7]
David D. Lewis, Yiming Yang, Tony G. Rose, and Fan Li. 2004. RCV1: A New Benchmark Collection for Text Categorization Research. Machine Learning Research. 5: 361--397
[8]
Inderjeet Mani, and George Wilson. 2000. Robust Temporal Processing of News. In Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, pp. 69--76, Morristown, NJ. ACL.
[9]
George A. Miller. 1995. WordNet: A Lexical Database for English. Communications of the ACM, 38(11): 39--41.
[10]
Matteo Negri, and Luca Marseglia. 2004. Recognition and Normalization of Time Expressions: ITC-irst at TERN 2004. Technical Report, ITC-irst, Trento.
[11]
Hector Llorens, Estela Saquete, and Borja Navarro. 2010. TIPSem (English and Spanish): Evaluating CRFs and Semantic Roles in TempEval 2. In Proceedings of the 5th International Workshop on Semantic Evaluation, pp. 284--291, Uppsala, Sweden. ACL.
[12]
Jordi Poveda, Mihai Surdeanu, and Jordi Turmo. 2007. A Comparison of Statistical and Rule-Induction Learners for Automatic Tagging of Time Expressions in English. In Proceedings of the International Symposium on Temporal Representation and Reasoning, pp. 141--149.
[13]
Jordi Poveda, Mihai Surdeanu, and Jordi Turmo. 2009. An Analysis of Bootstrapping for the Recognition of Temporal Expressions. In Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing, pp. 49--57, Stroudsburg, PA, USA. ACL.
[14]
Jannik Strötgen and Michael Gertz. 2010. HeidelTime: High Quality Rule-Based Extraction and Normalization of Temporal Expressions. In Proceedings of the 5th International Workshop on Semantic Evaluation, pp. 321--324, Uppsala, Sweden. ACL.
[15]
Mihai Surdeanu, Jordi Turmo, and Alicia Ageno. 2006. A Hybrid Approach for the Acquisition of Information Extraction Patterns. In Proceedings of the EACL 2006 Workshop on Adaptive Text Extraction and Mining (ATEM 2006). ACL.
[16]
Marc Verhagen, Roser Sauri, Tommaso Caselli, and James Pustejovsky. 2010. SemEval-2010 Task 13: TempEval 2. In Proceedings of the 5th International Workshop on Semantic Evaluation, pp. 57--62, Uppsala, Sweden. ACL.
[17]
David Yarowsky. 1995. Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pp. 189--196, Cambridge, MA. ACL.

Cited By

View all
  • (2024)Supervised Contrast Learning Text Classification Model Based on Data Quality AugmentationACM Transactions on Asian and Low-Resource Language Information Processing10.1145/365330023:5(1-12)Online publication date: 10-May-2024
  • (2022)A Survey on Data Augmentation for Text ClassificationACM Computing Surveys10.1145/354455855:7(1-39)Online publication date: 17-Jun-2022
  • (2019)Applying semantic knowledge to the automatic processing of temporal expressions and events in natural languageInformation Processing and Management: an International Journal10.1016/j.ipm.2012.05.00549:1(179-197)Online publication date: 22-Nov-2019

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
HLT '11: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
June 2011
765 pages
ISBN:9781932432886

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 19 June 2011

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 240 of 768 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)52
  • Downloads (Last 6 weeks)7
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Supervised Contrast Learning Text Classification Model Based on Data Quality AugmentationACM Transactions on Asian and Low-Resource Language Information Processing10.1145/365330023:5(1-12)Online publication date: 10-May-2024
  • (2022)A Survey on Data Augmentation for Text ClassificationACM Computing Surveys10.1145/354455855:7(1-39)Online publication date: 17-Jun-2022
  • (2019)Applying semantic knowledge to the automatic processing of temporal expressions and events in natural languageInformation Processing and Management: an International Journal10.1016/j.ipm.2012.05.00549:1(179-197)Online publication date: 22-Nov-2019

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media