Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2009916.2009954acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Social context summarization

Published: 24 July 2011 Publication History

Abstract

We study a novel problem of social context summarization for Web documents. Traditional summarization research has focused on extracting informative sentences from standard documents. With the rapid growth of online social networks, abundant user generated content (e.g., comments) associated with the standard documents is available. Which parts in a document are social users really caring about? How can we generate summaries for standard documents by considering both the informativeness of sentences and interests of social users? This paper explores such an approach by modeling Web documents and social contexts into a unified framework. We propose a dual wing factor graph (DWFG) model, which utilizes the mutual reinforcement between Web documents and their associated social contexts to generate summaries. An efficient algorithm is designed to learn the proposed factor graph model.Experimental results on a Twitter data set validate the effectiveness of the proposed model. By leveraging the social context information, our approach obtains significant improvement (averagely +5.0%-17.3%) over several alternative methods (CRF, SVM, LR, PR, and DocLead) on the performance of summarization.

References

[1]
E. Amitay. Automatically summarising web sites - is there a way around it? In CIKM'00, pages 173--179, 2000.
[2]
R. Barzilay and M. Elhadad. Using lexical chains for text summarization. In ACL Workshop on Intelligent Scalable Text Summarization, pages 10--17, 1997.
[3]
D. boyd, S. Golder, and G. Lotan. Tweet, tweet, retweet: Conversational aspects of retweeting on twitter. In HICSS'10, pages 1--10, 2010.
[4]
J. Carbonell and J. Goldstein. The use of mmr, diversity-based reranking for reordering documents and producing summaries. In SIGIR'98, pages 335--336, 1998.
[5]
M. Cha, H. Haddadi, F. Benevenuto, and K. P. Gummadi. Measuring user influence in twitter: The million follower fallacy. In ICWSM'10, pages 10--17, 2010.
[6]
S. F. Chen and R. Rosenfeld. A gaussian prior for smoothing maximum entropy models. Technical Report Carnegie Mellon University-CS-99--108, Carnegie Mellon University, 1999.
[7]
M. Cheong and V. Lee. Integrating web-based intelligence retrieval and decision-making from the twitter trends knowledge base. In SWSM'09, pages 1--8, 2009.
[8]
J. M. Conroy and D. P. O'Leary. Text summarization via hidden markov models. In SIGIR'01, pages 406--407, 2001.
[9]
J.-Y. Delort, B. Bouchon-Meunier, and M. Rifqi. Enhanced web document summarization using hyperlinks. In Hypertext'03, pages 208--215, 2003.
[10]
A. Díaz and P. Gervás. User-model based personalized summarization. Information Processing & Management, 43(6):1715--1734, 2007.
[11]
K. Ganesan, C. Zhai, and J. Han. Opinosis: a graph-based approach to abstractive summarization of highly redundant opinions. In COLING'10, pages 340--348, 2010.
[12]
Y. Gong and X. Liu. Generic text summarization using relevance measure and latent semantic analysis. In SIGIR'01, pages 19--25, 2001.
[13]
M. Hu, A. Sun, and E.-P. Lim. Comments-oriented document summarization: understanding documents with readers' feedback. In SIGIR'08, pages 291--298, 2008.
[14]
A. Java, X. Song, T. Finin, and B. Tseng. Why we twitter: understanding microblogging usage and communities. In WebKDD/SNA-KDD'07, pages 56--65, 2007.
[15]
H. D. Kim and C. Zhai. Generating comparative summaries of contradictory opinions in text. In CIKM'09, pages 385--394, 2009.
[16]
J. Kupiec, J. Pedersen, and F. Chen. A trainable document summarizer. In SIGIR'95, pages 68--73, 1995.
[17]
H. Kwak, C. Lee, H. Park, and S. Moon. What is twitter, a social network or a news media? In WWW'10, pages 591--600, 2010.
[18]
C.-Y. Lin and E. Hovy. Automatic evaluation of summaries using n-gram co-occurrence statistics. In NAACL'03, pages 71--78, 2003.
[19]
Y. Lu, C. Zhai, and N. Sundaresan. Rated aspect summarization of short comments. In WWW'09, pages 131--140, 2009.
[20]
H. P. Luhn. The automatic creation of literature abstracts. IBM Journal of Research and Development, 2(2):159--165, 1958.
[21]
I. Mani and E. Bloedorn. Machine learning of generic and user-focused summarization. In AAAI'98/IAAI'98, pages 820--826, 1998.
[22]
D. Marcu. From discourse structures to text summaries. In ACL Workshop on Intelligent Scalable Text Summarization, pages 82--88, 1997.
[23]
Q. Mei and C. Zhai. Generating impact-cased summaries for scientific literature. In ACL'08, pages 816--824, 2008.
[24]
R. Mihalcea. Language independent extractive summarization. In ACL'05, pages 49--52, 2005.
[25]
M. Osborne. Using maximum entropy for sentence extraction. In ACL Workshop on Automatic Summarization, pages 1--8, 2002.
[26]
M. J. Paul, C. Zhai, and R. Girju. Summarizing contrastive viewpoints in opinionated text. In EMNLP'10, pages 66--76, 2010.
[27]
F. Sha and F. Pereira. Shallow parsing with conditional random fields. In NAACL'03, pages 134--141, 2003.
[28]
D. Shen, J. tao Sun, H. Li, Q. Yang, and Z. Chen. Document summarization using conditional random fields. In IJCAI'07, pages 2862--2867, 2007.
[29]
J.-T. Sun, D. Shen, H.-J. Zeng, Q. Yang, Y. Lu, and Z. Chen. Web-page summarization using clickthrough data. In SIGIR'05, pages 194--201, 2005.
[30]
J. Tang, J. Sun, C. Wang, and Z. Yang. Social influence analysis in large-scale networks. In SIGKDD'09, pages 807--816, 2009.
[31]
J. Tang, L. Yao, and D. Chen. Multi-topic based query-oriented summarization. In SDM'09, pages 1147--1158, 2009.
[32]
C. Teng, N. Xiong, Y. He, L. T. Yang, and D. Liu. A behavioural mode research on user-focus summarization. Mathematical and Computer Modelling, 51(7--8):985--994, 2010.
[33]
X. Wan and J. Yang. Multi-document summarization using cluster-based link analysis. In SIGIR'08, pages 299--306, 2008.
[34]
J. Weng, E.-P. Lim, J. Jiang, and Q. He. Twitterrank: Finding topic-sensitive influential twitterers. In WSDM'10, pages 261--270, 2010.
[35]
Z. Yang, J. Guo, K. Cai, J. Tang, J. Li, L. Zhang, and Z. Su. Understanding retweeting behaviors in social networks. In CIKM'10, pages 1633--1636, 2010.
[36]
J.-Y. Yeh, H.-R. Ke, W.-P. Yang, and I.-H. Meng. Text summarization using a trainable summarizer and latent semantic analysis. Inf. Process. Manage., 41(1):75--95, 2005.

Cited By

View all
  • (2023)Diving into a Sea of Opinions: Multi-modal Abstractive Summarization with Comment SensitivityProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614849(1117-1126)Online publication date: 21-Oct-2023
  • (2023)Towards Social Context Summarization with Convolutional Neural NetworksComputational Linguistics and Intelligent Text Processing10.1007/978-3-031-23804-8_27(341-353)Online publication date: 26-Feb-2023
  • (2022)Exploiting comments information to improve legal public opinion news abstractive summarizationFrontiers of Computer Science10.1007/s11704-021-0561-z16:6Online publication date: 22-Jan-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '11: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
July 2011
1374 pages
ISBN:9781450307574
DOI:10.1145/2009916
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 July 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. document summarization
  2. factor graph
  3. social context
  4. twitter

Qualifiers

  • Research-article

Conference

SIGIR '11
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)1
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Diving into a Sea of Opinions: Multi-modal Abstractive Summarization with Comment SensitivityProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614849(1117-1126)Online publication date: 21-Oct-2023
  • (2023)Towards Social Context Summarization with Convolutional Neural NetworksComputational Linguistics and Intelligent Text Processing10.1007/978-3-031-23804-8_27(341-353)Online publication date: 26-Feb-2023
  • (2022)Exploiting comments information to improve legal public opinion news abstractive summarizationFrontiers of Computer Science10.1007/s11704-021-0561-z16:6Online publication date: 22-Jan-2022
  • (2021)Extractive Multi-Document Summarization: A Review of Progress in the Last DecadeIEEE Access10.1109/ACCESS.2021.31124969(130928-130946)Online publication date: 2021
  • (2020)The combination of term relations analysis and weighted frequent itemset model for multidocument summarizationComputational Intelligence10.1111/coin.1227036:2(783-812)Online publication date: 29-Jan-2020
  • (2020)Transformer-based Summarization by Exploiting Social Information2020 12th International Conference on Knowledge and Systems Engineering (KSE)10.1109/KSE50997.2020.9287388(25-30)Online publication date: 12-Nov-2020
  • (2019)Exploiting User Comments for Document Summarization with Matrix FactorizationProceedings of the 10th International Symposium on Information and Communication Technology10.1145/3368926.3369699(118-124)Online publication date: 4-Dec-2019
  • (2019)Heterogeneous-Length Text Topic Modeling for Reader-Aware Multi-Document SummarizationACM Transactions on Knowledge Discovery from Data10.1145/333303013:4(1-21)Online publication date: 8-Aug-2019
  • (2019)ELSAACM Transactions on Information Systems10.1145/329898737:2(1-33)Online publication date: 16-Jan-2019
  • (2019)Document Specific Supervised Keyphrase Extraction With Strong Semantic RelationsIEEE Access10.1109/ACCESS.2019.29488917(167507-167520)Online publication date: 2019
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media