Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Lifecycle Modeling for Buzz Temporal Pattern Discovery

Published: 09 December 2016 Publication History

Abstract

In social media analysis, one critical task is detecting a burst of topics or buzz, which is reflected by extremely frequent mentions of certain keywords in a short-time interval. Detecting buzz not only provides useful insights into the information propagation mechanism, but also plays an essential role in preventing malicious rumors. However, buzz modeling is a challenging task because a buzz time-series often exhibits sudden spikes and heavy tails, wherein most existing time-series models fail. In this article, we propose novel buzz modeling approaches that capture the rise and fade temporal patterns via Product Lifecycle (PLC) model, a classical concept in economics. More specifically, we propose to model multiple peaks in buzz time-series with PLC mixture or PLC group mixture and develop a probabilistic graphical model (K-Mixture of Product Lifecycle (K-MPLC) to automatically discover inherent lifecycle patterns within a collection of buzzes. Furthermore, we effectively utilize the model parameters of PLC mixture or PLC group mixture for burst prediction. Our experimental results show that our proposed methods significantly outperform existing leading approaches on buzz clustering and buzz-type prediction.

References

[1]
Gustavo Batista, Xiaoyue Wang, and Eamonn J. Keogh. 2011. A complexity-invariant distance measure for time series. In Proceedings of SDM.
[2]
Christian Bauckhage, Kristian Kersting, and Fabian Hadiji. 2013. Mathematical models of fads explain the temporal dynamics of internet memes. In Proceedings of EMNLP.
[3]
Christopher M. Bishop and Nasser M. Nasrabadi. 2006. Pattern Recognition and Machine Learning. Springer, New York, NY.
[4]
W. R. Blischke and D. N. P. Murthy. 2000. Reliability Modeling, Prediction and Optimization. John Wiley and Sons Inc.
[5]
Yi Chang, Lei Tang, Yoshiyuki Inagaki, and Yan Liu. 2014a. What is Tumblr: A statistical overview and comparison. SIGKDD Explorations 16, 1 (2014), 21--29.
[6]
Yi Chang, Makoto Yamada, Antonio Ortega, and Yan Liu. 2014b. Ups and downs in buzzes: Life cycle modeling for temporal pattern discovery. In Proceedings of ICDM.
[7]
Hyunyoung Choi and Hal Varian. 2009. Predicting the Present with Google Trends. In Technical Report, Google.
[8]
Michael Conover, Jacob Ratkiewicz, Matthew Francisco, Bruno Goncalves, Filippo Menczer, and Alessandro Flammini. 2011. Political polarization on twitter. In Proceedings of ICWSM.
[9]
Riley Crane and Didier Sornette. 2008. Robust dynamic classes revealed by measuring the response function of a social system. Proceedings of the National Academy of Sciences 105, 41 (2008), 15649--15653.
[10]
Peng Cui, Huan Liu, Charu Aggarwal, and Fei Wang. 2016. Uncovering and predicting human behaviors. IEEE Intelligent Systems 31, 2 (2016), 77--88.
[11]
Daniel Gruhl, Ramanathan V. Guha, Ravi Kumar, Jasmine Novak, and Andrew Tomkins. 2005. The predictive power of online chatter. In Proceedings of KDD.
[12]
Marco Guerini, Carlo Strapparava, and Gozde Ozbal. 2011. Exploring text virality in social networks. In Proceedings of ICWSM.
[13]
Peter Haider, Luca Chiarandini, and Ulf Brefeld. 2012. Discriminative clustering for market segmentation. In Proceedings of KDD.
[14]
A. G. Hawkes and D. Oakes. 1974. A cluster representation of a self-exciting process. Journal of Applied Probability 11 (1974), 493--503.
[15]
L. Hong, B. Dom, S. Gurumurthy, and K. Tsioutsiouliklis. 2011. A time-dependent topic model for multiple text streams. In Proceedings of KDD.
[16]
Al. Isaic-Maniu and V. Gh. Voda. 2008. On a model regarding the product life-cycle. Management and Marketing 3, 3 (2008), 87--96.
[17]
Shan Jiang, Joseph Ferreira, and Marta C. Gonzalez. 2012. Clustering daily patterns of human activities in the city. Data Mining and Knowledge Discovery 25, 1 (2012), 478--510.
[18]
Xiao-Jun Zeng, Johan Bollen, and Huina Mao. 2011. Twitter mood predicts the stock market. Journal of Computational Science. 1--8.
[19]
Leonard Kaufman and Peter J. Rousseeuw. 2005. Finding Groups in Data: An Introduction to Cluster Analysis. Wiley-Interscience.
[20]
Jon Kleinberg. 2003. Bursty and hierarchical structure in streams. Data Mining and Knowledge Discovery 7, 4 (2003), 373--397.
[21]
Mahesh Kumar, Nitin R. Patel, and Jonathan Woo. 2002. Clustering seasonality patterns in the presence of errors. In Proceedings of KDD.
[22]
Ravi Kumar, Mohammad Mahdian, and Mary McGlohon. 2010. Dynamics of conversations. In Proceedings of KDD.
[23]
Ravi Kumar, Jasmine Novak, Prabhakar Raghavan, and Andrew Tomkins. 2003. On the bursty evolution of blogspace. In Proceedings of WWW.
[24]
Haewoon Kwak, Changhyun Lee, Hosung Park, and Sue B. Moon. 2010. What is twitter, a social network or a news media. In Proceedings of WWW.
[25]
G. Latouche and V. Ramaswami. 1999. Quasi-Birth-and-Death Processes (1st ed.). ASA SIAM.
[26]
Kristina Lerman and Rumi Ghosh. 2010. Information contagion: An empirical study of the spread of news on digg and twitter social networks. In Proceedings of ICWSM.
[27]
Theodore Levitt. 1965. Exploit the product life cycle. Harvard Business Review 43 (Nov. 1965), 81--94.
[28]
Lei Li and B. Aditya Prakash. 2011. Time series clustering: Complex is simpler. In Proceedings of ICML.
[29]
Jessica Lin, Michail Vlachos, Eamonn J. Keogh, and Dimitrios Gunopulos. 2004. Iterative incremental clustering of time series. In Proceedings of EDBT.
[30]
Yasuko Matsubara, Yasushi Sakurai, B. Aditya Prakash, Lei Li, and Christos Faloutsos. 2012. Rise and fall patterns of information diffusion: Model and implications. In Proceedings of KDD.
[31]
Qiaozhu Mei, Chao Liu, Hang Su, and ChengXiang Zhai. 2006. A probabilistic approach to spatiotemporal theme pattern mining on weblogs. In Proceedings of WWW.
[32]
Thomas Minka. 2000. Estimating a Dirichlet Distribution. Technical Report, MIT.
[33]
Brendan O’Connor, Ramnath Balasubramanyan, Bryan R. Routledge, and Noah A. Smith. 2010. From tweets to polls: Linking text sentiment to public opinion time series. In Proceedings of ICWSM.
[34]
Michael Patriksson. 1999. Nonlinear Programming and Variational Inequality Problems: A Unified Approach. Vol. 23. Springer.
[35]
Emil PETRESCU. 2009. A statistical distribution useful in product life-cycle modeling. Management and Marketing 4, 2 (2009), 165--170.
[36]
Thanawin Rakthanmanon, Eamonn J. Keogh, Stefano Lonardi, and Scott Evans. 2011. Time series epenthesis: Clustering time series streams requires ignoring some data. In Proceedings of ICDM.
[37]
Jimeng Sun and Jie Tang. 2011. A survey of models and algorithms for social influence analysis. Social Network Data Analysis. Springer, 177--214.
[38]
Rikiya Takahashi, Hideyuki Mizuta, Naoki Abe, Ruby L. Kennedy, Vincent J. Jeffs, Ravi Shah, and Robert H. Crites. 2013. Collective response spike prediction for mutually interacting consumers. In Proceedings of ICDM.
[39]
Robert Tibshirani. 1996. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B 58 (1996), 267--288.
[40]
R. Tomioka, T. Suzuki, and M. Sugiyama. 2011. Super-linear convergence of dual augmented Lagrangian algorithm for sparsity regularized estimation. Journal of Machine Learning Research 12 (2011), 1537--1586.
[41]
Jaewon Yang and Jure Leskovec. 2011. Patterns of temporal variation in online media. In Proceedings of WSDM.
[42]
Linyun Yu, Peng Cui, Fei Wang, Chaoming Song, and Shiqiang Yang. 2015. From micro to macro: Uncovering and predicting information cascading process with behavioral dynamics. In Proceedings of ICDM.
[43]
Ming Yuan and Yi Lin. 2006. Model selection and estimation in regression with grouped variables. Journal of the Royal Statistical Society, Series B 68 (2006), 49--67.
[44]
Tianyang Zhang, Peng Cui, Chaoming Song, Wenwu Zhu, and Shiqiang Yang. 2016. A multiscale survival process for modeling human activity patterns. PLOS ONE 11, 3 (2016).
[45]
Qiang Zhu, Gustavo Batista, Thanawin Rakthanmanon, and Eamonn Keogh. 2012. A novel approximation to dynamic time warping allows anytime clustering of massive time series datasets. In Proceedings of SDM.

Cited By

View all
  • (2023)Social media analytics for innovation management research: A systematic literature review and future research agendaTechnovation10.1016/j.technovation.2023.102712123(102712)Online publication date: May-2023
  • (2021)A System for Fake News Detection by using Supervised Learning Model for Social Media Contents2021 5th International Conference on Intelligent Computing and Control Systems (ICICCS)10.1109/ICICCS51141.2021.9432096(1042-1047)Online publication date: 6-May-2021
  • (2020)Two-Path Deep Semisupervised Learning for Timely Fake News DetectionIEEE Transactions on Computational Social Systems10.1109/TCSS.2020.30276397:6(1386-1398)Online publication date: Dec-2020
  • Show More Cited By

Index Terms

  1. Lifecycle Modeling for Buzz Temporal Pattern Discovery

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Knowledge Discovery from Data
    ACM Transactions on Knowledge Discovery from Data  Volume 11, Issue 2
    May 2017
    419 pages
    ISSN:1556-4681
    EISSN:1556-472X
    DOI:10.1145/3017677
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 09 December 2016
    Accepted: 01 September 2016
    Revised: 01 July 2016
    Received: 01 October 2015
    Published in TKDD Volume 11, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Time-series modeling
    2. buzz clustering
    3. buzz type prediction

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    • MEXT KAKENHI
    • NSF research
    • U.S. Defense Advanced Research Projects Agency (DARPA) under the Social Media in Strategic Communication (SMISC) program

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 26 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Social media analytics for innovation management research: A systematic literature review and future research agendaTechnovation10.1016/j.technovation.2023.102712123(102712)Online publication date: May-2023
    • (2021)A System for Fake News Detection by using Supervised Learning Model for Social Media Contents2021 5th International Conference on Intelligent Computing and Control Systems (ICICCS)10.1109/ICICCS51141.2021.9432096(1042-1047)Online publication date: 6-May-2021
    • (2020)Two-Path Deep Semisupervised Learning for Timely Fake News DetectionIEEE Transactions on Computational Social Systems10.1109/TCSS.2020.30276397:6(1386-1398)Online publication date: Dec-2020
    • (2019)Adaptive Multi-view Subspace Clustering for High-dimensional DataPattern Recognition Letters10.1016/j.patrec.2019.01.016Online publication date: Jan-2019
    • (2019)A sensitive network jitter measurement for covert timing channels over interactive trafficMultimedia Tools and Applications10.1007/s11042-018-6281-178:3(3493-3509)Online publication date: 1-Feb-2019
    • (2018)Mining Event-Oriented Topics in Microblog Stream with Unsupervised Multi-View Hierarchical EmbeddingACM Transactions on Knowledge Discovery from Data10.1145/317304412:3(1-26)Online publication date: 11-Apr-2018
    • (2018)Multiple Time-Series Data Analysis for Rumor Detection on Social Media2018 IEEE International Conference on Big Data (Big Data)10.1109/BigData.2018.8622631(4413-4419)Online publication date: Dec-2018
    • (2018)Social image tag enrichment based on textual similarity modelingMultimedia Tools and Applications10.1007/s11042-017-5184-x77:3(3659-3676)Online publication date: 1-Feb-2018
    • (2018)Discriminative unsupervised 2D dimensionality reduction with graph embeddingMultimedia Tools and Applications10.1007/s11042-017-5019-977:3(3189-3207)Online publication date: 1-Feb-2018
    • (2018)A loss combination based deep model for person re-identificationMultimedia Tools and Applications10.1007/s11042-017-5009-y77:3(3049-3069)Online publication date: 1-Feb-2018
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media