Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2766462.2767713acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Summarizing Contrastive Themes via Hierarchical Non-Parametric Processes

Published: 09 August 2015 Publication History

Abstract

Given a topic of interest, a contrastive theme is a group of opposing pairs of viewpoints. We address the task of summarizing contrastive themes: given a set of opinionated documents, select meaningful sentences to represent contrastive themes present in those documents. Several factors make this a challenging problem: unknown numbers of topics, unknown relationships among topics, and the extraction of comparative sentences. Our approach has three core ingredients: contrastive theme modeling, diverse theme extraction, and contrastive theme summarization. Specifically, we present a hierarchical non-parametric model to describe hierarchical relations among topics; this model is used to infer threads of topics as themes from the nested Chinese restaurant process. We enhance the diversity of themes by using structured determinantal point processes for selecting a set of diverse themes with high quality. Finally, we pair contrastive themes and employ an iterative optimization algorithm to select sentences, explicitly considering contrast, relevance, and diversity. Experiments on three datasets demonstrate the effectiveness of our method.

References

[1]
A. Ahmed, L. Hong, and A. Smola. Nested chinese restaurant franchise process: Applications to user tracking and document modeling. In ICML, 2013.
[2]
J. Allan, C. Wade, and A. Bolivar. Retrieval and novelty detection at the sentence level. In SIGIR, 2003.
[3]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Machine Learning research, 3: 993--1022, 2003.
[4]
D. M. Blei, T. L. Griffiths, and M. I. Jordan. The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies. J. ACM, 57 (2), 2010.
[5]
A. Borodin. Determinantal point processes. In The Oxford Handbook of Random Matrix Theory. Oxford University Press, 2009.
[6]
A. Celikyilmaz and D. Hakkani-Tur. A hybrid hierarchical model for multi-document summarization. In ACL, 2010.
[7]
D. Chakrabarti and K. Punera. Event summarization using tweets. In ICWSM, 2011.
[8]
S. Dori-Hacohen and J. Allan. Detecting controversy on the web. In CIKM, 2013.
[9]
Y. Duan, F. Wei, C. Zhumin, Z. Ming, and Y. Shum. Twitter topic summarization by ranking tweets using social influence and content quality. In Coling, 2012.
[10]
G. Erkan and D. Radev. Lexrank: Graph-based lexical centrality as salience in text summarization. J. Artificial Intelligence Research, 22: 457--479, 2004.
[11]
K. Filippova. Multi-sentence compression: Finding shortest paths in word graphs. In Coling, 2010.
[12]
K. Ganesan, C. Zhai, and J. Han. Opinosis: a graph-based approach to abstractive summarization of highly redundant opinions. In Coling, 2010.
[13]
K. Ganesan, C. Zhai, and E. Viegas. Micropinion generation: an unsupervised approach to generating ultra-concise summaries of opinions. In WWW, 2012.
[14]
J. Gillenwater, A. Kulesza, and B. Taskar. Discovering diverse and salient threads in document collections. In EMNLP-CoNLL, 2012.
[15]
M. Hu and B. Liu. Mining opinion features in customer reviews. In AAAI, 2004.
[16]
M. Hu and B. Liu. Opinion extraction and summarization on the web. In AAAI, 2006.
[17]
X. Huang, X. Wan, and J. Xiao. Comparative news summarization using concept-based optimization. Knowledge and Information Systems, 38 (3): 691--716, 2013.
[18]
H. D. Kim and C. Zhai. Generating comparative summaries of contradictory opinions in text. In CIKM, 2009.
[19]
H. D. Kim, K. Ganesan, P. Sondhi, and C. Zhai. Comprehensive review of opinion summarization. Tech. Rep, 2011.
[20]
H. D. Kim, M. G. Castellanos, M. Hsu, C. Zhai, U. Dayal, and R. Ghosh. Ranking explanatory sentences for opinion summarization. In SIGIR, 2013.
[21]
A. Kulesza and B. Taskar. Structured determinantal point processes. In NIPS, 2010.
[22]
A. Kulesza and B. Taskar. Determinantal point processes for machine learning. Found. & Tr. Machine Learning, 5 (2--3): 123--286, 2012.
[23]
K. Lerman and R. McDonald. Contrastive summarization: an experiment with consumer reviews. In NAACL, 2009.
[24]
F. Li, C. Han, M. Huang, X. Zhu, Y.-J. Xia, S. Zhang, and H. Yu. Structure-aware review mining and summarization. In Coling, 2010.
[25]
L. Li, K. Zhou, G. Xue, H. Zha, and Y. Yu. Enhancing diversity, coverage and balance for summarization through structure learning. In WWW, 2009.
[26]
S. Liang, Z. Ren, and M. de Rijke. Fusion helps diversification. In SIGIR, 2014.
[27]
C.-Y. Lin. Rouge: A package for automatic evaluation of summaries. In ACL, 2004.
[28]
C.-Y. Lin and E. Hovy. From single to multi-document summarization: A prototype system and its evaluation. In ACL, 2002.
[29]
W.-H. Lin, T. Wilson, J. Wiebe, and A. Hauptmann. Which side are you on? Identifying perspectives at the document and sentence levels. In CoNLL, 2006.
[30]
Y. Lu, C. Zhai, and N. Sundaresan. Rated aspect summarization of short comments. In WWW, 2009.
[31]
R. McCreadie, C. Macdonald, and I. Ounis. Incremental update summarization: Adaptive sentence selection based on prevalence and novelty. In CIKM, 2014.
[32]
Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic sentiment mixture: modeling facets and opinions in weblogs. In WWW, 2007.
[33]
X. Meng, F. Wei, X. Liu, M. Zhou, S. Li, and H. Wang. Entity-centric topic-oriented opinion summarization in twitter. In KDD, 2012.
[34]
A. Nenkova and K. McKeown. Automatic summarization. Found. & Tr. Information Retrieval, 5 (2--3): 103--233, 2012.
[35]
J. Nichols, J. Mahmud, and C. Drews. Summarizing sporting events using twitter. In IUI, 2012.
[36]
M. Paul and R. Girju. A two-dimensional topic-aspect model for discovering multi-faceted topics. AAAI, 2010.
[37]
M. J. Paul, C. Zhai, and R. Girju. Summarizing contrastive viewpoints in opinionated text. In EMNLP, 2010.
[38]
D. Radev et al. MEAD--A platform for multidocument multilingual text summarization. In LREC, 2004.
[39]
L. Ren, D. B. Dunson, and L. Carin. The dynamic hierarchical dirichlet process. In ICML, 2008.
[40]
Z. Ren, J. Ma, S. Wang, and Y. Liu. Summarizing web forum threads based on a latent topic propagation process. In CIKM, 2011.
[41]
Z. Ren, S. Liang, E. Meij, and M. de Rijke. Personalized time-aware tweets summarization. In SIGIR, 2013.
[42]
Z. Ren, M.-H. Peetz, S. Liang, W. van Dolen, and M. de Rijke. Hierarchical multi-label classification of social text streams. In SIGIR, 2014.
[43]
C. Shen and T. Li. Learning to rank for query-focused multi-document summarization. In ICDM, 2011.
[44]
D. Shen, J. Sun, H. Li, Q. Yang, and Z. Chen. Document summarization using conditional random fields. In IJCAI, 2007.
[45]
L. Shou, Z. Wang, K. Chen, and G. Chen. Sumblr: continuous summarization of evolving tweet streams. In SIGIR, 2013.
[46]
R. Socher et al. Recursive deep models for semantic compositionality over a sentiment treebank. In EMNLP, 2013.
[47]
K. Toutanova, C. Brockett, M. Gamon, J. Jagarlamudi, H. Suzuki, and L. Vanderwende. The pythy summarization system: Microsoft research at DUC 2007. In DUC, 2007.
[48]
X. Wan and J. Yang. Multi-document summarization using cluster-based link analysis. In SIGIR, 2008.
[49]
F. Wei, W. Li, Q. Lu, and Y. He. Query-sensitive mutual reinforcement chain and its application in query-oriented multi-document summarization. In SIGIR, 2008.
[50]
Y. Zhu, Y. Lan, J. Guo, P. Du, and X. Cheng. A novel relational learning-to-rank approach for topic-focused multi-document summarization. In ICDM, 2013.

Cited By

View all

Index Terms

  1. Summarizing Contrastive Themes via Hierarchical Non-Parametric Processes

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGIR '15: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
      August 2015
      1198 pages
      ISBN:9781450336215
      DOI:10.1145/2766462
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 09 August 2015

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. contrastive theme summarization
      2. hierarchical sentiment-LDA
      3. structured determinantal point processes
      4. topic modeling

      Qualifiers

      • Research-article

      Funding Sources

      • the European Community's Seventh Framework Programme (FP7/2007-2013)
      • the Yahoo! Faculty Research and Engagement Program
      • Amsterdam Data Science
      • the Microsoft Research PhD program
      • the Netherlands eScience Center
      • the Netherlands Organisation for Scientific Research (NWO)

      Conference

      SIGIR '15
      Sponsor:

      Acceptance Rates

      SIGIR '15 Paper Acceptance Rate 70 of 351 submissions, 20%;
      Overall Acceptance Rate 792 of 3,983 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)9
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 30 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Contrastive text summarization: a surveyInternational Journal of Data Science and Analytics10.1007/s41060-023-00434-4Online publication date: 9-Aug-2023
      • (2020)Distribted learning dynamics of Multi-Armed Bandits for edge intelligenceJournal of Systems Architecture10.1016/j.sysarc.2020.101919(101919)Online publication date: Oct-2020
      • (2019)ContraVis: Contrastive and Visual Topic Modeling for Comparing Document CollectionsThe World Wide Web Conference10.1145/3308558.3313617(928-938)Online publication date: 13-May-2019
      • (2019)Across-Time Comparative Summarization of News ArticlesProceedings of the Twelfth ACM International Conference on Web Search and Data Mining10.1145/3289600.3291008(735-743)Online publication date: 30-Jan-2019
      • (2019)Discovering Latent Threads in Entity HistoriesData Science and Engineering10.1007/s41019-019-00108-xOnline publication date: 15-Nov-2019
      • (2018)Computational linguistics literature and citations oriented citation linkage, classification and summarizationInternational Journal on Digital Libraries10.1007/s00799-017-0219-519:2-3(173-190)Online publication date: 1-Sep-2018
      • (2017)Sentiment diversification for short review summarizationProceedings of the International Conference on Web Intelligence10.1145/3106426.3106525(723-729)Online publication date: 23-Aug-2017
      • (2017)Inferring Dynamic User Interests in Streams of Short Texts for User ClusteringACM Transactions on Information Systems10.1145/307260636:1(1-37)Online publication date: 17-Jul-2017
      • (2017)Summarizing Answers in Non-Factoid Community Question-AnsweringProceedings of the Tenth ACM International Conference on Web Search and Data Mining10.1145/3018661.3018704(405-414)Online publication date: 2-Feb-2017
      • (2017)Social Collaborative Viewpoint Regression with Explainable RecommendationsProceedings of the Tenth ACM International Conference on Web Search and Data Mining10.1145/3018661.3018686(485-494)Online publication date: 2-Feb-2017
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media