Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2433396.2433400acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Information-theoretic measures of influence based on content dynamics

Published: 04 February 2013 Publication History

Abstract

The fundamental building block of social influence is for one person to elicit a response in another. Researchers measuring a "response" in social media typically depend either on detailed models of human behavior or on platform-specific cues such as re-tweets, hash tags, URLs, or mentions. Most content on social networks is difficult to model because the modes and motivation of human expression are diverse and incompletely understood. We introduce content transfer, an information-theoretic measure with a predictive interpretation that directly quantifies the strength of the effect of one user's content on another's in a model-free way. Estimating this measure is made possible by combining recent advances in non-parametric entropy estimation with increasingly sophisticated tools for content representation. We demonstrate on Twitter data collected for thousands of users that content transfer is able to capture non-trivial, predictive relationships even for pairs of users not linked in the follower or mention graph. We suggest that this measure makes large quantities of previously under-utilized social media content accessible to rigorous statistical causal analysis.

Supplementary Material

ZIP File (wsdm25.zip)
Transfer entropy estimation Greg Ver Steeg and Aram Galstyan {gregv,galstyan}@isi.edu August 22,2012 This code is released under GPLv3. The associated paper "Information-Theoretic Measures of Influence Based on Content Dynamics" is available on arxiv: http://arxiv.org/abs/1208.4475 The paper uses this code along with the package "gensim" (available free online) To estimate "content transfer" among users on Twitter Twitter does not allow making data freely available, but to request the data used in the paper, contact Sofus Macskassy, [email protected]

References

[1]
Computation for the work described in this paper was supported by the University of Southern California Center for High-Performance Computing and Communications (www.usc.edu/hpcc).
[2]
E. Bakshy, J. M. Hofman, W. A. Mason, and D. J. Watts. Everyone's an influencer: quantifying influence on twitter. In Proc. fourth ACM international conference on Web search and data mining, WSDM'11, pages 65--74, New York, NY, USA, 2011. ACM.
[3]
P. Bramsen, M. Escobar-Molano, A. Patel, and R. Alonso. Extracting social power relationships from natural language. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1, pages 773--782, Stroudsburg, PA, USA, 2011.
[4]
M. Cha, H. Haddadi, F. Benevenuto, and K. P. Gummadi. Measuring user influence in twitter: The million follower fallacy. In ICWSM-10: Proceedings of international AAAI Conference on Weblogs and Social, 2010.
[5]
C. Danescu-Niculescu-Mizil, M. Gamon, and S. Dumais. Mark my words!: linguistic style accommodation in social media. In Proceedings of the 20th international conference on World wide web, WWW'11, pages 745--754, New York, NY, USA, 2011. ACM.
[6]
C. Danescu-Niculescu-Mizil, L. Lee, B. Pang, and J. Kleinberg. Echoes of power: language effects and power differences in social interaction. In Proceedings of the 21st international conference on World Wide Web, WWW'12, pages 699--708, New York, NY, USA, 2012. ACM.
[7]
C. P. Diehl, G. Namata, and L. Getoor. Relationship identification for social network discovery. In Proceedings of the 22nd national conference on Artificial intelligence - Volume 1, AAAI'07, pages 546--552. AAAI Press, 2007.
[8]
R. Ghosh and K. Lerman. Predicting influential users in online social networks. In Proceedings of KDD workshop on Social Network Analysis (SNA-KDD), July 2010.
[9]
E. Gilbert and K. Karahalios. Predicting tie strength with social media. In Proceedings of the 27th international conference on Human factors in computing systems, CHI'09, pages 211--220, New York, NY, USA, 2009. ACM.
[10]
C. W. J. Granger. Testing for causality : A personal viewpoint. Journal of Economic Dynamics and Control, 2(1):329--352, May 1980.
[11]
J. Hanley. The meaning and use of the area under a receiver operating characteristic (roc) curve. Radiology, 743:29--36, 1982.
[12]
K. Hlavackovaschindler, M. Palus, M. Vejmelka, and J. Bhattacharya. Causality detection based on information-theoretic approaches in time series analysis. Physics Reports, 441(1):1--46, Mar. 2007.
[13]
N. Hodas and K. Lerman. How visibility and divided attention constrain social contagion. Arxiv preprint arXiv:1205.2736, 2012.
[14]
L. Hong and B. Davison. Empirical study of topic modeling in twitter. In Proceedings of the First Workshop on Social Media Analytics. ACM, 2010.
[15]
V. P. J. K. Jakob Runge, Jobst Heitzig. Escaping the curse of dimensionality in estimating multivariate transfer entropy. Phys. Rev. Lett., 108(258701), Jun 2012.
[16]
G. Jeh and J. Widom. Scaling personalized web search. In Proceedings of the 12th international conference on World Wide Web, WWW'03, pages 271--279, New York, NY, USA, 2003. ACM.
[17]
M. Kaminski, M. Ding, W. A. Truccolo, and S. L. Bressler. Evaluating causal relations in neural systems: granger causality, directed transfer function and statistical assessment of significance. Biological cybernetics, 85(2):145--157, Aug. 2001.
[18]
L. F. Kozachenko and N. N. Leonenko. Sample estimate of the entropy of a random vector. Probl. Peredachi Inf., 23(2):95--101, Nov. 1987.
[19]
A. Kraskov, H. Stogbauer, and P. Grassberger. Estimating mutual information. Phys. Rev. E, 69:066138, Jun 2004.
[20]
H. Kwak, C. Lee, H. Park, and S. Moon. What is twitter, a social network or a news media? In Proceedings of the 19th international conference on World wide web, WWW'10, pages 591--600, New York, NY, USA, 2010. ACM.
[21]
J. Lizier, M. Prokopenko, and A. Zomaya. Local information transfer as a spatiotemporal filter for complex systems. Phys. Rev. E, 77(2):026110, 2008.
[22]
em A. C. Lozano, N. Abe, Y. Liu, and S. Rosset. Grouped graphical granger modeling for gene expression regulatory networks discovery. Bioinformatics, 25(12):i110--i118, 2009.
[23]
A. C. Lozano, H. Li, A. Niculescu-Mizil, Y. Liu, C. Perlich, J. Hosking, and N. Abe. Spatial-temporal causal modeling for climate change attribution. In Proceedings of the 15th ACM SIGKDD, KDD'09, New York, NY, USA, 2009. ACM.
[24]
M. Muja and D. Lowe. Fast approximate nearest neighbors with automatic algorithm configuration. In International Conference on Computer Vision Theory and Applications (VISSAPP'09), pages 331--340, 2009.
[25]
L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank Citation Ranking: Bringing Order to the Web. Technical report, Stanford Digital Library Technologies Project, 1998.
[26]
S. Panzeri, R. Senatore, M. A. Montemurro, and R. S. Petersen. Correcting for the sampling bias problem in spike train information measures. Journal of Neurophysiology, 98(3):1064--1072, 2007.
[27]
J. W. Pennebaker and M. E. Francis. Linguistic inquiry and word count: A computerized text analysis program, 2007.
[28]
F. Perez-Cruz. Estimation of information theoretic measures for continuous random variables. In Proceedings of NIPS-08, pages 1257--1264, 2008.
[29]
B. Poczos and J. Schneider. Nonparametric estimation of conditional information and divergences. 2012.
[30]
R. Rehurek and P. Sojka. Software Framework for Topic Modelling with Large Corpora. In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, pages 45--50, Valletta, Malta, May 2010. ELRA. http://is.muni.cz/publication/884893/en.
[31]
D. Romero, W. Galuba, S. Asur, and B. Huberman. Influence and passivity in social media. Machine Learning and Knowledge Discovery in Databases, pages 18--33, 2011.
[32]
S.A. Macskassy. On the study of social interactions in twitter. In Sixth International AAAI Conference on Weblogs and Social Media. ICWSM, 2012.
[33]
S.A. Macskassy and M. Michelson. Why do people retweet? Anti-Homophily wins the day! In Proceedings of the Fifth International Conference on Weblogs and Social Media, Menlo Park, CA, USA, 2011. AAAI.
[34]
T. Schreiber. Measuring information transfer. Phys. Rev. Lett., 85(2):461--464, Jul 2000.
[35]
M. Vejmelka and M. Palus. Inferring the directionality of coupling with conditional mutual information. Physical Review E, 77(2):026214, 2008.
[36]
G. Ver Steeg and A. Galstyan. Information transfer in social media. In Proc of. World Wide Web Conference, 2012.
[37]
J. Victor. Binless strategies for estimation of information from neural data. Physical Review E, 66(5):051903, 2002.
[38]
J. D. Victor. Approaches to information-theoretic analysis of neural activity. Biological Theory, 1(3):302--316, 2006.
[39]
Q. Wang, S. R. Kulkarni, and S. Verdu. Divergence estimation for multidimensional densities via k-nearest-neighbor distances. IEEE Trans. Inf. Theor., 55:2392--2405, May 2009.
[40]
J. Weng, E.-P. Lim, J. Jiang, and Q. He. Twitterrank: finding topic-sensitive influential twitterers. In Proceedings of the third ACM international conference on Web search and data mining, WSDM'10, pages 261--270, New York, NY, USA, 2010. ACM.
[41]
W. Zhao, J. Jiang, J. Weng, J. He, E. Lim, H. Yan, and X. Li. Comparing twitter and traditional media using topic models. Advances in Information Retrieval, pages 338--349, 2011.

Cited By

View all
  • (2023)Differential Effects of Astrocyte Manipulations on Learned Motor Behavior and Neuronal Ensembles in the Motor CortexThe Journal of Neuroscience10.1523/JNEUROSCI.1982-22.202343:15(2696-2713)Online publication date: 9-Mar-2023
  • (2023)A novel regularized weighted estimation method for information diffusion prediction in social networksApplied Network Science10.1007/s41109-023-00605-z8:1Online publication date: 30-Nov-2023
  • (2022)Tight mutual information estimation with contrastive fenchel-legendre optimizationProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3602323(28319-28334)Online publication date: 28-Nov-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WSDM '13: Proceedings of the sixth ACM international conference on Web search and data mining
February 2013
816 pages
ISBN:9781450318693
DOI:10.1145/2433396
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 February 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. causality
  2. entropy
  3. influence
  4. social networks

Qualifiers

  • Research-article

Conference

WSDM 2013

Acceptance Rates

Overall Acceptance Rate 498 of 2,863 submissions, 17%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)34
  • Downloads (Last 6 weeks)3
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Differential Effects of Astrocyte Manipulations on Learned Motor Behavior and Neuronal Ensembles in the Motor CortexThe Journal of Neuroscience10.1523/JNEUROSCI.1982-22.202343:15(2696-2713)Online publication date: 9-Mar-2023
  • (2023)A novel regularized weighted estimation method for information diffusion prediction in social networksApplied Network Science10.1007/s41109-023-00605-z8:1Online publication date: 30-Nov-2023
  • (2022)Tight mutual information estimation with contrastive fenchel-legendre optimizationProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3602323(28319-28334)Online publication date: 28-Nov-2022
  • (2022)Social Influence Network Simulation Design Affects Behavior of Aggregated EntropyIEEE Transactions on Computational Social Systems10.1109/TCSS.2021.30926949:2(594-604)Online publication date: Apr-2022
  • (2022)Entropy-Based Characterization of Influence Pathways in Traditional and Social Media2022 IEEE 8th International Conference on Collaboration and Internet Computing (CIC)10.1109/CIC56439.2022.00016(38-44)Online publication date: Dec-2022
  • (2022)Information flow estimation: A study of news on TwitterOnline Social Networks and Media10.1016/j.osnem.2022.10023131(100231)Online publication date: Sep-2022
  • (2022)Classifying Regions of High Model Error Within a Data-Driven RANS Closure: Application to Wind Turbine WakesFlow, Turbulence and Combustion10.1007/s10494-022-00346-6109:3(545-570)Online publication date: 9-Aug-2022
  • (2022)Self-distilled Pruning of Deep Neural NetworksMachine Learning and Knowledge Discovery in Databases10.1007/978-3-031-26390-3_38(655-670)Online publication date: 19-Sep-2022
  • (2021)Influence Cascades: Entropy-Based Characterization of Behavioral Influence Patterns in Social MediaEntropy10.3390/e2302016023:2(160)Online publication date: 28-Jan-2021
  • (2021)Credible Influence Analysis in Mass Media Using Causal Inference2021 IEEE International Conference on Intelligence and Security Informatics (ISI)10.1109/ISI53945.2021.9624679(1-5)Online publication date: 2-Nov-2021
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media