Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Two uses of anaphora resolution in summarization

Published: 01 November 2007 Publication History

Abstract

We propose a new method for using anaphoric information in Latent Semantic Analysis (LSA), and discuss its application to develop an LSA-based summarizer which achieves a significantly better performance than a system not using anaphoric information, and a better performance by the rouge measure than all but one of the single-document summarizers participating in DUC-2002. Anaphoric information is automatically extracted using a new release of our own anaphora resolution system, GUITAR, which incorporates proper noun resolution. Our summarizer also includes a new approach for automatically identifying the dimensionality reduction of a document on the basis of the desired summarization percentage. Anaphoric information is also used to check the coherence of the summary produced by our summarizer, by a reference checker module which identifies anaphoric resolution errors caused by sentence extraction.

References

[1]
Baldwin, B., & Morton, T. S. (1998). Dynamic coreference-based summarization. In Proceedings of EMNLP, Granada, Spain.
[2]
Barzilay, R., & Elhadad, M. (1997). Using lexical chains for text summarization. In Proceedings of the ACL/EACL workshop on intelligent scalable text summarization, Madrid, Spain.
[3]
Bergler, S., Witte, R., Khalife, M., Li, Z., & Rudzicz, F. (2003). Using knowledge-poor coreference resolution for text summarization. In Proceedings of DUC, Edmonton, Canada.
[4]
Using linear algebra for intelligent IR. SIAM Review. v37 i4.
[5]
Salience-based content characterization of text documents. In: Mani, I., Maybury, M.T. (Eds.), Advances in automatic text summarization, MIT Press, Cambridge, US.
[6]
Bontcheva, K., Dimitrov, M., Maynard, D., Tablan, V., & Cunningham, H. (2002). Shallow methods for named entity coreference resolution. In Chaínes de références et résolveurs d'anaphores, workshop TALN 2002, Nancy, France.
[7]
Charniak, E. (2000). A maximum-entropy-inspired parser. In Proceedings of NAACL, Philadelphia, US.
[8]
Choi, F. Y. Y., Wiemer-Hastings, P., & Moore, J. D. (2001) Latent semantic analysis for text segmentation. In Proceedings of EMNLP, Pittsburgh, US.
[9]
A probabilistic model for latent semantic indexing. Journal of the American Society for Information Science and Technology. v56 i6. 597-608.
[10]
Gong, Y., & Liu, X. (2002). Generic text summarization using relevance measure and latent semantic analysis. In Proceedings of ACM SIGIR, New Orleans, US.
[11]
Building better corpora for summarization. In: Proceedings of corpus linguistics, Lancaster, United Kingdom.
[12]
Hovy, E., & Lin, C. (1997). Automated text summarization in summarist. In ACL/EACL workshop on intelligent scalable text summarization, Madrid, Spain.
[13]
Kabadjov, M. A. (2007). Anaphora resolution and applications. PhD Dissertation, University of Essex, UK.
[14]
Kabadjov, M. A., Poesio, M. & Steinberger, J. (2005). Task-based evaluation of anaphora resolution: the case of summarization. In RANLP workshop "crossing barriers in text summarization research", Borovets, Bulgaria.
[15]
Beyond elaboration: the interaction of relations and focus in coherent text. In: Sanders, T., Schilperoord, J., Spooren, W. (Eds.), Text representation: linguistic and psycholinguistic aspects, John Benjamins.
[16]
A solution to Plato's problem: the latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review. v104. 211-240.
[17]
Lin, Ch. (2004). ROUGE: A package for automatic evaluation of summaries. In Proceedings of the workshop on text summarization branches out, Barcelona, Spain.
[18]
Lin, Ch., & Hovy, E. (2003). Automatic evaluation of summaries using n-gram co-occurrence statistics. In Proceedings of HLT-NAACL, Edmonton, Canada.
[19]
Mitkov, R. (1998). Robust pronoun resolution with limited knowledge. In Proceedings of COLING, Montreal, Canada.
[20]
Mueller, C., & Strube, M. (2003). MMAX: a tool for the annotation of multi-modal corpora. In Proceedings of the 2nd IJCAI workshop on knowledge and reasoning in practical dialogue systems, Seattle, US.
[21]
Orasan, C., Mitkov, R., & Hasler, L. (2003). CAST: a computer-aided summarization tool. In Proceedings of EACL, Budapest, Hungary.
[22]
Poesio, M., & Kabadjov, M. A. (2004). A general-purpose, off-the-shelf anaphora resolution module: implementation and preliminary evaluation. In Proceedings of LREC, Lisbon, Portugal.
[23]
Centering: a parametric theory and its instantiations. Computational Linguistics. v30 i3.
[24]
Radev, D. R., Jing, H., & Budzikowska, M. (2000). Centroid-based summarization of multiple documents. In ANLP/NAACL workshop on automatic summarization, Seattle, US.
[25]
Steinberger, J., & Jezek, K. (2004). Text summarization and singular value decomposition. In Proceedings of ADVIS, Izmir, Turkey.
[26]
Steinberger, J., Kabadjov, M. A., & Poesio, M. (2005). Improving LSA-based summarization with anaphora resolution. In Proceedings of HLT/EMNLP, The Association for Computational Linguistics, Vancouver, Canada (pp. 1-8).
[27]
Stuckardt, R. (2003). Coreference-based summarization and question answering: a case for high precision anaphor resolution. In International symposium on reference resolution, Venice, Italy.
[28]
An empirically-based system for processing definite descriptions. Computational Linguistics. v26 i4.

Cited By

View all
  • (2023)Chinese Event Discourse Deixis Resolution: Design of the Dataset and ModelACM Transactions on Asian and Low-Resource Language Information Processing10.1145/361810922:11(1-26)Online publication date: 6-Sep-2023
  • (2023)DialogRE: An Extension of DialogRE to Investigate How Much Coreference Helps Relation Extraction in DialogsNatural Language Processing and Chinese Computing10.1007/978-3-031-44693-1_18(222-234)Online publication date: 12-Oct-2023
  • (2022)Generating extractive sentiment summaries for natural language user queries on productsACM SIGAPP Applied Computing Review10.1145/3558053.355805422:2(5-20)Online publication date: 17-Aug-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

Publisher

Pergamon Press, Inc.

United States

Publication History

Published: 01 November 2007

Author Tags

  1. Anaphora resolution
  2. Latent semantic analysis
  3. Singular value decomposition
  4. Summarization

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 04 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Chinese Event Discourse Deixis Resolution: Design of the Dataset and ModelACM Transactions on Asian and Low-Resource Language Information Processing10.1145/361810922:11(1-26)Online publication date: 6-Sep-2023
  • (2023)DialogRE: An Extension of DialogRE to Investigate How Much Coreference Helps Relation Extraction in DialogsNatural Language Processing and Chinese Computing10.1007/978-3-031-44693-1_18(222-234)Online publication date: 12-Oct-2023
  • (2022)Generating extractive sentiment summaries for natural language user queries on productsACM SIGAPP Applied Computing Review10.1145/3558053.355805422:2(5-20)Online publication date: 17-Aug-2022
  • (2021)A comprehensive review on feature set used for anaphora resolutionArtificial Intelligence Review10.1007/s10462-020-09917-354:4(2917-3006)Online publication date: 1-Apr-2021
  • (2021)PE-MSC: partial entailment-based minimum set cover for text summarizationKnowledge and Information Systems10.1007/s10115-020-01537-163:5(1045-1068)Online publication date: 1-May-2021
  • (2019)CQASUMMProceedings of the ACM India Joint International Conference on Data Science and Management of Data10.1145/3297001.3297004(18-26)Online publication date: 3-Jan-2019
  • (2019)A Survey of Discourse Representations for Chinese Discourse AnnotationACM Transactions on Asian and Low-Resource Language Information Processing10.1145/329344218:3(1-25)Online publication date: 25-Jan-2019
  • (2018)A New LSA and Entropy-Based Approach for Automatic Text Document SummarizationInternational Journal on Semantic Web & Information Systems10.4018/IJSWIS.201810010114:4(1-32)Online publication date: 1-Oct-2018
  • (2018)Automatic cohesive summarization with pronominal anaphora resolutionComputer Speech and Language10.1016/j.csl.2018.05.00452:C(141-164)Online publication date: 1-Nov-2018
  • (2017)Extracting Product Features for Opinion Mining Using Public Conversations in TwitterProcedia Computer Science10.1016/j.procs.2017.08.122112:C(927-935)Online publication date: 1-Sep-2017
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media