Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2939672.2939852acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Burstiness Scale: A Parsimonious Model for Characterizing Random Series of Events

Published: 13 August 2016 Publication History

Abstract

The problem to accurately and parsimoniously characterize random series of events (RSEs) seen in the Web, such as Yelp reviews or Twitter hashtags, is not trivial. Reports found in the literature reveal two apparent conflicting visions of how RSEs should be modeled. From one side, the Poissonian processes, of which consecutive events follow each other at a relatively regular time and should not be correlated. On the other side, the self-exciting processes, which are able to generate bursts of correlated events. The existence of many and sometimes conflicting approaches to model RSEs is a consequence of the unpredictability of the aggregated dynamics of our individual and routine activities, which sometimes show simple patterns, but sometimes results in irregular rising and falling trends. In this paper we propose a parsimonious way to characterize general RSEs, namely the Burstiness Scale (BuSca) model. BuSca views each RSE as a mix of two independent process: a Poissonian and a self-exciting one. Here we describe a fast method to extract the two parameters of BuSca that, together, gives the burstiness scale ψ, which represents how much of the RSE is due to bursty and viral effects. We validated our method in eight diverse and large datasets containing real random series of events seen in Twitter, Yelp, e-mail conversations, Digg, and online forums. Results showed that, even using only two parameters, BuSca is able to accurately describe RSEs seen in these diverse systems, what can leverage many applications.

References

[1]
A.-L. Barabási. The origin of bursts and heavy tails in human dynamics. Nature, 435(7039):207--211, may 2005.
[2]
J. Cao, W. S. Cleveland, D. Lin, and D. X. Sun. Internet Traffic Tends Toward Poisson and Independent as the Load Increases. pages 83--109. 2003.
[3]
R. Crane and D. Sornette. Robust dynamic classes revealed by measuring the response function of a social system. Proceedings of the National Academy of Sciences, 105(41):15649--15653, oct 2008.
[4]
A. Ferraz Costa, Y. Yamaguchi, A. Juci Machado Traina, C. Traina, and C. Faloutsos. Rsc: Mining and modeling temporal activity in social media. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '15, pages 269--278, New York, New York, USA, 2015. ACM Press.
[5]
F. Figueiredo, J. M. Almeida, Y. Matsubara, B. Ribeiro, and C. Faloutsos. Revisit Behavior in Social Media: The Phoenix-R Model and Discoveries. pages 386--401. 2014.
[6]
F. A. Haight. Handbook of the Poisson distribution {by} Frank A. Haight. Wiley New York, 1967.
[7]
H. Jiang and C. Dovrolis. Why is the Internet Traffic Bursty in Short Time Scales? In Proceedings of the 2005 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS'05), pages 241--252, 2005.
[8]
T. Karagiannis, M. Molle, M. Faloutsos, and A. Broido. A nonstationary poisson view of internet traffic. In IEEE INFOCOM 2004, volume 3, pages 1558--1569. IEEE.
[9]
J. Kleinberg. Bursty and hierarchical structure in streams. In Proceedings of the eighth ACM SIGKDD, KDD '02, pages 91--101, New York, NY, USA, 2002. ACM.
[10]
J. Kleinberg and E. Tardos. Algorithm design. Pearson Education, 2006.
[11]
J. Lehmann, B. Gonçalves, J. J. Ramasco, and C. Cattuto. Dynamical classes of collective attention in twitter. In Proceedings of the 21st international conference on World Wide Web - WWW '12, page 251, New York, New York, USA, 2012. ACM Press.
[12]
R. D. Malmgren, J. M. Hofman, L. A. Amaral, and D. J. Watts. Characterizing individual communication patterns. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '09, page 607, New York, New York, USA, 2009. ACM Press.
[13]
Y. Matsubara, Y. Sakurai, B. A. Prakash, L. Li, and C. Faloutsos. Rise and fall patterns of information diffusion. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '12, page 6, New York, New York, USA, 2012. ACM Press.
[14]
J. G. Oliveira and A.-L. Barabasi. Human dynamics: Darwin and Einstein correspondence patterns. Nature, 437(7063):1251, 2005.
[15]
R. Peng. Multi-dimensional point process models in r. Journal of Statistical Software, 8(1):1--27, 2003.
[16]
J. C. L. Pinto, T. Chahed, and E. Altman. Trend detection in social networks using Hawkes processes. In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015 - ASONAM '15, pages 1441--1448, New York, New York, USA, 2015. ACM Press.
[17]
D. L. Snyder and M. I. Miller. Random Point Processes in Time and Space. Springer Texts in Electrical Engineering. Springer New York, New York, NY, 1991.
[18]
D. Vallet, S. Berkovsky, S. Ardon, A. Mahanti, and M. A. Kafaar. Characterizing and Predicting Viral-and-Popular Video Content. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM '15, pages 1591--1600, New York, NY, USA, 2015. ACM.
[19]
P. O. S. Vaz de Melo, C. Faloutsos, R. Assunção, R. Alves, and A. A. Loureiro. Universal and Distinct Properties of Communication Dynamics: How to Generate Realistic Inter-event Times. ACM Transactions on Knowledge Discovery in Data, 2015.
[20]
P. O. S. Vaz de Melo, C. Faloutsos, R. Assuncao, and A. A. F. Loureiro. The Self-Feeding Process: A Unifying Model for Communication Dynamics in the Web. In WWW '13: 22nd International World Wide Web Conference, 2013.
[21]
A. Vazquez, J. G. Oliveira, Z. Dezso, K.-I. Goh, I. Kondor, and A.-L. Barabasi. Modeling bursts and heavy tails in human dynamics. Phys Rev E Stat Nonlin Soft Matter Phys, 73:36127, 2006.
[22]
C. Wang, M. Ye, and B. A. Huberman. From user comments to on-line conversations. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '12, page 244, New York, New York, USA, 2012. ACM Press.
[23]
S. Wang, Z. Yan, X. Hu, P. S. Yu, and Z. Li. Burst Time Prediction in Cascades, 2015.
[24]
J. Yang and J. Leskovec. Patterns of temporal variation in online media. In Proceedings of the fourth ACM international conference on Web search and data mining - WSDM '11, page 177, New York, New York, USA, 2011. ACM Press.
[25]
S.-h. Yang and H. Zha. Mixture of Mutually Exciting Processes for Viral Diffusion. In S. Dasgupta and D. Mcallester, editors, Proceedings of the 30th International Conference on Machine Learning (ICML-13), volume 28, pages 1--9. JMLR Workshop and Conference Proceedings, 2013.
[26]
H. Yu, L. Xie, and S. Sanner. The Lifecyle of a Youtube Video: Phases, Content and Popularity, 2015.
[27]
Q. Zhao, M. A. Erdogdu, H. Y. He, A. Rajaraman, and J. Leskovec. SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '15, pages 1513--1522, New York, New York, USA, 2015. ACM Press.

Cited By

View all
  • (2024)Unraveling the Dynamics of Stable and Curious Audiences in Web SystemsProceedings of the ACM Web Conference 202410.1145/3589334.3645473(2464-2475)Online publication date: 13-May-2024
  • (2021)Burst-induced Multi-Armed Bandit for Learning RecommendationProceedings of the 15th ACM Conference on Recommender Systems10.1145/3460231.3474250(292-301)Online publication date: 13-Sep-2021
  • (2020)SCPPACM Transactions on Spatial Algorithms and Systems10.1145/34234057:1(1-30)Online publication date: 29-Oct-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August 2016
2176 pages
ISBN:9781450342322
DOI:10.1145/2939672
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 August 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. communication dynamics
  2. self-exciting point process
  3. social media

Qualifiers

  • Research-article

Funding Sources

  • FAPEMIG
  • CNPq

Conference

KDD '16
Sponsor:

Acceptance Rates

KDD '16 Paper Acceptance Rate 66 of 1,115 submissions, 6%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Unraveling the Dynamics of Stable and Curious Audiences in Web SystemsProceedings of the ACM Web Conference 202410.1145/3589334.3645473(2464-2475)Online publication date: 13-May-2024
  • (2021)Burst-induced Multi-Armed Bandit for Learning RecommendationProceedings of the 15th ACM Conference on Recommender Systems10.1145/3460231.3474250(292-301)Online publication date: 13-Sep-2021
  • (2020)SCPPACM Transactions on Spatial Algorithms and Systems10.1145/34234057:1(1-30)Online publication date: 29-Oct-2020
  • (2019)Effect of self-excitement and behavioral factors on epidemics on activity driven networks2019 18th European Control Conference (ECC)10.23919/ECC.2019.8795748(1512-1517)Online publication date: Jun-2019
  • (2018)Fast estimation of causal interactions using wold processesProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3327144.3327220(2975-2986)Online publication date: 3-Dec-2018
  • (2018)Adversarial Training Model Unifying Feature Driven and Point Process Perspectives for Event Popularity PredictionProceedings of the 27th ACM International Conference on Information and Knowledge Management10.1145/3269206.3271714(517-526)Online publication date: 17-Oct-2018
  • (2018)Social Network Monitoring for Bursty Cascade DetectionACM Transactions on Knowledge Discovery from Data10.1145/317804812:4(1-24)Online publication date: 16-Apr-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media