Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1613715.1613756dlproceedingsArticle/Chapter ViewAbstractPublication PagesemnlpConference Proceedingsconference-collections
research-article
Free access

Understanding the value of features for coreference resolution

Published: 25 October 2008 Publication History

Abstract

In recent years there has been substantial work on the important problem of coreference resolution, most of which has concentrated on the development of new models and algorithmic techniques. These works often show that complex models improve over a weak pairwise baseline. However, less attention has been given to the importance of selecting strong features to support learning a coreference model.
This paper describes a rather simple pairwise classification model for coreference resolution, developed with a well-designed set of features. We show that this produces a state-of-the-art system that outperforms systems built with complex models. We suggest that our system can be used as a baseline for the development of more complex models -- which may have less impact when a more robust set of features is used. The paper also presents an ablation study and discusses the relative contributions of various features.

References

[1]
A. Bagga and B. Baldwin. 1998. Algorithms for scoring coreference chains. In MUC7.
[2]
A. Culotta, M. Wick, R. Hall, and A. McCallum. 2007. First-order probabilistic models for coreference resolution. In HLT/NAACL, pages 81--88.
[3]
P. Denis and J. Baldridge. 2007. Joint determination of anaphoricity and coreference resolution using integer programming. In HLT/NAACL, pages 236--243, Rochester, New York.
[4]
C. Fellbaum. 1998. WordNet: An Electronic Lexical Database. MIT Press.
[5]
Y. Freund and R. E. Schapire. 1998. Large margin classification using the Perceptron algorithm. In COLT, pages 209--217.
[6]
H. Ji, D. Westbrook, and R. Grishman. 2005. Using semantic relations to refine coreference decisions. In EMNLP/HLT, pages 17--24, Vancouver, British Columbia, Canada.
[7]
X. Luo and I. Zitouni. 2005. Multi-lingual coreference resolution with syntactic features. In HLT/EMNLP, pages 660--667, Vancouver, British Columbia, Canada.
[8]
X. Luo, A. Ittycheriah, H. Jing, N. Kambhatla, and S. Roukos. 2004. A mention-synchronous coreference resolution algorithm based on the bell tree. In ACL, page 135, Morristown, NJ, USA.
[9]
V. Ng and C. Cardie. 2002a. Identifying anaphoric and non-anaphoric noun phrases to improve coreference resolution. In COLING-2002.
[10]
V. Ng and C. Cardie. 2002b. Improving machine learning approaches to coreference resolution. In ACL. NIST. 2004. The ace evaluation plan. www.nist.gov/speech/tests/ace/index.htm.
[11]
V. Punyakanok and D. Roth. 2001. The use of classifiers in sequential inference. In The Conference on Advances in Neural Information Processing Systems (NIPS), pages 995--1001. MIT Press.
[12]
N. Rizzolo and D. Roth. 2007. Modeling Discriminative Global Inference. In Proceedings of the First International Conference on Semantic Computing (ICSC), pages 597--604, Irvine, California.
[13]
W. M. Soon, H. T. Ng, and C. Y. Lim. 2001. A machine learning approach to coreference resolution of noun phrases. Computational Linguistics, 27(4):521--544.
[14]
M. Vilain, J. Burger, J. Aberdeen, D. Connolly, and L. Hirschman. 1995. A model-theoretic coreference scoring scheme. In MUC6, pages 45--52.

Cited By

View all
  • (2023)Chinese Event Discourse Deixis Resolution: Design of the Dataset and ModelACM Transactions on Asian and Low-Resource Language Information Processing10.1145/361810922:11(1-26)Online publication date: 6-Sep-2023
  • (2020)Understanding global feature contributions with additive importance measuresProceedings of the 34th International Conference on Neural Information Processing Systems10.5555/3495724.3497168(17212-17223)Online publication date: 6-Dec-2020
  • (2020)LINSPECTORComputational Linguistics10.1162/coli_a_0037646:2(335-385)Online publication date: 1-Jun-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
EMNLP '08: Proceedings of the Conference on Empirical Methods in Natural Language Processing
October 2008
1129 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 25 October 2008

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 73 of 234 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)47
  • Downloads (Last 6 weeks)5
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Chinese Event Discourse Deixis Resolution: Design of the Dataset and ModelACM Transactions on Asian and Low-Resource Language Information Processing10.1145/361810922:11(1-26)Online publication date: 6-Sep-2023
  • (2020)Understanding global feature contributions with additive importance measuresProceedings of the 34th International Conference on Neural Information Processing Systems10.5555/3495724.3497168(17212-17223)Online publication date: 6-Dec-2020
  • (2020)LINSPECTORComputational Linguistics10.1162/coli_a_0037646:2(335-385)Online publication date: 1-Jun-2020
  • (2017)Ontology-Based Entity Coreference Resolution For Sentiment AnalysisProceedings of the 8th International Symposium on Information and Communication Technology10.1145/3155133.3155168(50-56)Online publication date: 7-Dec-2017
  • (2017)A systematic review and comparative analysis of cross-document coreference resolution methods and toolsComputing10.1007/s00607-016-0490-099:4(313-349)Online publication date: 1-Apr-2017
  • (2015)An active learning approach to coreference resolutionProceedings of the 24th International Conference on Artificial Intelligence10.5555/2832415.2832432(1312-1318)Online publication date: 25-Jul-2015
  • (2015)Use of named entity recognition and co-reference resolution tools for segmenting english textsProceedings of the 19th Panhellenic Conference on Informatics10.1145/2801948.2802004(331-336)Online publication date: 1-Oct-2015
  • (2015)Diversionary Comments under Blog PostsACM Transactions on the Web10.1145/27892119:4(1-34)Online publication date: 24-Sep-2015
  • (2015)Effective Techniques for Static Race Detection in Java Parallel LoopsACM Transactions on Software Engineering and Methodology10.1145/272997524:4(1-30)Online publication date: 2-Sep-2015
  • (2015)Improving opinion retrieval in social media by combining features-based coreferencing and memory-based learningInformation Sciences: an International Journal10.1016/j.ins.2014.12.021299:C(20-31)Online publication date: 1-Apr-2015
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media