Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/3304415.3304587guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

A comparative study of transactional and semantic approaches for predicting cascades on Twitter

Published: 13 July 2018 Publication History

Abstract

The availability of massive social media data has enabled the prediction of people's future behavioral trends at an unprecedented large scale. Information cascades study on Twitter has been an integral part of behavior analysis. A number of methods based on the transactional features (such as keyword frequency) and the semantic features (such as sentiment) have been proposed to predict the future cascading trends. However, an in-depth understanding of the pros and cons of semantic and transactional models is lacking. This paper conducts a comparative study of both approaches in predicting information diffusion with three mechanisms: retweet cascade, url cascade, and hashtag cascade. Experiments on Twitter data show that the semantic model outperforms the transactional model, if the exterior pattern is less directly observable (i.e. hashtag cascade). When it becomes more directly observable (i.e. retweet and url cascades), the semantic method yet delivers approximate accuracy (i.e. url cascade) or even worse accuracy (i.e. retweet cascade). Further, we demonstrate that the transactional and semantic models are not independent, and the performance gets greatly enhanced when combining both.

References

[1]
Mohammad Akbari, Xia Hu, Nie Liqiang, and Tat-Seng Chua. From tweets to wellness: Wellness event detection from twitter streams. In AAAI'16, pages 87-93, 2016.
[2]
Xiaoran An, Auroop R. Ganguly, Yi Fang, Steven B. Scyphers, Ann M. Hunter, and Jennifer G. Dy. Tracking climate change opinions from twitter data. In KDD'14, pages 1-9, New York, 2014.
[3]
Xiang Bai, Song Bai, and Xinggang Wang. Beyond diffusion process: Neighbor set similarity for fast re-ranking. Information Sciences, 325:342 - 354, 2015.
[4]
James Benhardus and Jugal Kalita. Streaming trend detection in twitter. International Journal of Web Based Communities, 9:122-139, 2013.
[5]
David M. Blei and John D. Lafferty. Dynamic topic models. In ICML'06, Pittsburgh, Pennsylvania, USA, 2006.
[6]
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993-1022, 2003.
[7]
Johan Bollen, Huina Mao, and Xiaojun Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2:1-8, 2011.
[8]
Margaret M. Bradley, Peter J. Lang, Margaret M. Bradley, and Peter J. Lang. Affective norms for english words (anew): Instruction manual and affective ratings. Center for Research in Psychophysiology, University of Florida, Gainesville, Technical Report C-1, 1999.
[9]
Justin Cheng, Lada A. Adamic, P. Alex Dow, Jon Kleinberg, and Jure Leskovec. Can cascades be predicted? WWW'14, pages 925 - 936, 2014.
[10]
Cynthia Chew and Gunther Eysenbach. Pandemics in the age of twitter: Content analysis of tweets during the 2009 h1n1 outbreak. PLoS ONE, 5:e14118, 2010.
[11]
Marco Guerini, Carlo Strapparava, and Gozde Ozbal. Exploring text virality in social networks. In ICWSM'11, pages 506-509, Barcelona, Spain, 2011.
[12]
Thi Bich Ngoc Hoang and Josiane Mothe. Predicting information diffusion on twitter - analysis of predictive features. Journal of Computational Science, pages Available online 28 October 2017, 2017.
[13]
Liangjie Hong, Ovidiu Dan, and Brian D. Davison. Predicting popular messages in twitter. In WWW'11, Hyderabad, India, 2011.
[14]
Ryota Kobayashi and Renaud Lambiotte. Tideh: Time-dependent hawkes process for predicting retweet dynamics. In ICWSM2016, pages 191-200, 2016.
[15]
Adolfo Paolo Masucci, Alkiviadis Kalampokis, Victor Martínez Eguíluz, and Emilio Hernández-García. Extracting directed information flow networks: An application to genetics and semantics. Physical Review E, 83:026103, 2011.
[16]
Adolfo Paolo Masucci, Alkiviadis Kalampokis, Victor Martínez Eguíluz, and Emilio Hernández-García. Wikipedia information flow analysis reveals the scale-free architecture of the semantic space. PLoS ONE, 6:e17333, 2011.
[17]
Mor Naaman, Hila Becker, and Luis Gravano. Hip and trendy: Characterizing emerging trends on twitter. Journal of the American Society for Information Science and Technology, pages 902-918, 2011.
[18]
Timm O. Sprenger, Andranik Tumasjan, Philipp G. Sandner, and Isabell M. Welpe. Tweets and trades: The information content of stock microblogs. European Financial Management, 20:926-957, 2013.
[19]
Toshimitsu Takahashi, Ryota Tomioka, and Kenji Yamanishi. Discovering emerging topics in social streams via link anomaly detection. In ICDM'11, pages 1230-1235, 2011.
[20]
Io Taxidou and Peter M. Fischer. Online analysis of information diffusion in twitter. In WWW'14, pages 1313-1318, Seoul, Korea, 2014.
[21]
Giulio Tononi and Olaf Sporns. Measuring information integration. BMC Neuroscience, 4:31, 2003.
[22]
Xiaojun Wang and Jianwu Wang. Learning information diffusion process on the web. In WWW'07, 2007.
[23]
Can Wang, Longbing Cao, Eric Gaussier, and Dan Luo. Coupled behavior representation, modeling, analysis, and reasoning. IEEE Intelligent Systems, 29(4):62-80, 2014.
[24]
Sen Wang, Xiaojun Chang, Xue Li, Quan Z. Sheng, and Weitong Chen. Multi-task support vector machines for feature selection with shared knowledge discovery. Signal Processing, pages 746-753, 2014.
[25]
Sen Wang, Feiping Nie, Xiaojun Chang, Lina Yao, Xue Li, and Quan Z. Sheng. Unsupervised feature analysis with class margin optimization. ECML-PKDD, pages 383-398, 2015.
[26]
Qingyuan Zhao, Murat A. Erdogdu, Hera Y. He, Anand Rajaraman, and Jure Leskovec. Seismic: A self-exciting point process model for predicting tweet popularity. SIGKDD'15, pages 1513 - 1522, 2015.
[27]
Zhou Zhao, Hanqing Lu, Deng Cai, Xiaofei He, and Yueting Zhuang. Microblog sentiment classication via recurrent random walk network learning. In IJCAI'17, pages 3532-3538, 2017.

Cited By

View all
  • (2021)A Survey of Information Cascade AnalysisACM Computing Surveys10.1145/343300054:2(1-36)Online publication date: 5-Mar-2021

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
IJCAI'18: Proceedings of the 27th International Joint Conference on Artificial Intelligence
July 2018
5885 pages
ISBN:9780999241127

Sponsors

  • Adobe
  • IBMR: IBM Research
  • ERICSSON
  • Microsoft: Microsoft
  • AI Journal: AI Journal

Publisher

AAAI Press

Publication History

Published: 13 July 2018

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2021)A Survey of Information Cascade AnalysisACM Computing Surveys10.1145/343300054:2(1-36)Online publication date: 5-Mar-2021

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media