Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Open access

A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks

Published: 08 August 2019 Publication History

Abstract

Analysis of opinion dynamics in social networks plays an important role in today’s life. For predicting users’ political preference, it is particularly important to be able to analyze the dynamics of competing polar opinions, such as pro-Democrat vs. pro-Republican. While observing the evolution of polar opinions in a social network over time, can we tell when the network evolved abnormally? Furthermore, can we predict how the opinions of the users will change in the future? To answer such questions, it is insufficient to study individual user behavior, since opinions can spread beyond users’ ego-networks. Instead, we need to consider the opinion dynamics of all users simultaneously and capture the connection between the individuals’ behavior and the global evolution pattern of the social network.
In this work, we introduce the Social Network Distance (SND)—a distance measure that quantifies the likelihood of evolution of one snapshot of a social network into another snapshot under a chosen model of polar opinion dynamics. SND has a rich semantics of a transportation problem, yet, is computable in time linear in the number of users and, as such, is applicable to large-scale online social networks. In our experiments with synthetic and Twitter data, we demonstrate the utility of our distance measure for anomalous event detection. It achieves a true positive rate of 0.83, twice as high as that of alternatives. The same predictions presented in precision-recall space show that SND retains perfect precision for recall up to 0.2. Its precision then decreases while maintaining more than 2-fold improvement over alternatives for recall up to 0.95. When used for opinion prediction in Twitter data, SND’s accuracy is 75.6%, which is 7.5% higher than that of the next best method.

References

[1]
Daron Acemoglu and Asuman Ozdaglar. 2011. Opinion dynamics and learning in social networks. Dynamic Games and Applications 1, 1 (2011), 3--49.
[2]
Ravindra K. Ahuja, Kurt Mehlhorn, James Orlin, and Robert E. Tarjan. 1990. Faster algorithms for the shortest path problem. Journal of the ACM 37, 2 (1990), 213--223.
[3]
Ravindra K. Ahuja, James B. Orlin, Clifford Stein, and Robert E. Tarjan. 1994. Improved algorithms for bipartite network flow. SIAM Journal on Computing 23, 5 (1994), 906--933.
[4]
Leman Akoglu, Mary McGlohon, and Christos Faloutsos. 2010. Oddball: Spotting anomalies in weighted graphs. In Proceedings of Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 410--421.
[5]
Leman Akoglu, Hanghang Tong, and Danai Koutra. 2015. Graph based anomaly detection and description: A survey. Data Mining and Knowledge Discovery 29, 3 (2015), 626--688.
[6]
Victor Amelkin, Francesco Bullo, and Ambuj K. Singh. 2017. Polar opinion dynamics in social networks. IEEE Transactions on Automatic Control 62, 11 (2017), 5650--5665.
[7]
Michele Berlingerio, Danai Koutra, Tina Eliassi-Rad, and Christos Faloutsos. 2013. Network similarity via multiple social theories. In Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM’13). 1439--1440.
[8]
Vincent D. Blondel, Anahí Gajardo, Maureen Heymans, Pierre Senellart, and Paul Van Dooren. 2004. A measure of similarity between graph vertices: Applications to synonym extraction and web searching. SIAM Review 46, 4 (2004), 647--666.
[9]
Allan Borodin, Yuval Filmus, and Joel Oren. 2010. Threshold models for competitive influence in social networks. In Proceedings of International Workshop on Internet and Network Economics (WINE’10). Springer, 539--550.
[10]
Horst Bunke and Kim Shearer. 1998. A graph distance metric based on the maximal common subgraph. Pattern Recognition Letters 19, 3 (1998), 255--259.
[11]
Tim Carnes, Chandrashekhar Nagarajan, Stefan M. Wild, and Anke Van Zuylen. 2007. Maximizing influence in a competitive social network: A follower’s perspective. In Proceedings of ACM International Conference on Electronic Commerce (EC’07). 351--360.
[12]
Claudio Castellano, Santo Fortunato, and Vittorio Loreto. 2009. Statistical physics of social dynamics. Reviews of Modern Physics 81, 2 (2009), 591.
[13]
Kenneth L. Clarkson. 2006. Nearest-neighbor searching and metric space dimensions. In Nearest-Neighbor Methods in Learning and Vision: Theory and Practice. Gregory Shakhnarovich, Trevor Darrell, and Piotr Indyk (Eds.), The MIT Press.
[14]
Raviv Cohen and Derek Ruths. 2013. Classifying political orientation on Twitter: It’s not easy! In Proceedings of AAAI Conference on Web and Social Media (ICWSM’13). Retrieved from https://www.aaai.org/ocs/index.php/ICWSM/ICWSM13/paper/view/6128/6347.
[15]
Michael D. Conover, Bruno Gonçalves, Jacob Ratkiewicz, Alessandro Flammini, and Filippo Menczer. 2011. Predicting the political alignment of Twitter users. In Proceedings of IEEE International Conference on Social Computing and Networking (SocialCom’11). 192--199.
[16]
IBM ILOG CPLEX. 2009. V12. 1: User’s manual for CPLEX. International Business Machines Corporation 46, 53 (2009), 157.
[17]
Abir De, Isabel Valera, Niloy Ganguly, Sourangshu Bhattacharya, and Manuel Gomez Rodriguez. 2016. Learning and forecasting opinion dynamics in social networks. In Proceedings of the 30th International Conference on Neural Information Processing Systems. 397--405.
[18]
Xinbo Gao, Bing Xiao, Dacheng Tao, and Xuelong Li. 2010. A survey of Graph Edit Distance. Pattern Analysis and Applications 13, 1 (2010), 113--129.
[19]
Andrew Goldberg. 1997. An efficient implementation of a scaling minimum-cost flow algorithm. Journal of Algorithms 22, 1 (1997), 1--29.
[20]
Andrew Goldberg and Robert Tarjan. 1987. Solving minimum-cost flow problems by successive approximation. In Proceedings of ACM Symposium on Theory of Computing (STOC’87). 7--18.
[21]
Manuel Gomez-Rodriguez, Jure Leskovec, and Andreas Krause. 2010. Inferring networks of diffusion and influence. In Proceedings of ACM Conference on Knowledge Discovery and Data Mining (ACM SIGKDD’10). 1019--1028.
[22]
Amit Goyal, Francesco Bonchi, and Laks V. S. Lakshmanan. 2010. Learning influence probabilities in social networks. In Proceedings of ACM International Conference on Web Search and Data Mining (WSDM’10). 241--250.
[23]
James Hafner, Harpreet S. Sawhney, William Equitz, Myron Flickner, and Wayne Niblack. 1995. Efficient color histogram indexing for quadratic form distance functions. Pattern Analysis and Machine Intelligence 17, 7 (1995), 729--736.
[24]
David K. Hammond, Yaniv Gur, and Chris R. Johnson. 2013. Graph diffusion distance: A difference measure for weighted graphs based on the graph Laplacian exponential kernel. In Proceedings of the IEEE Global Conference on Signal and Information Processing (GlobalSIP’13). 419--422.
[25]
Frederick S. Hillier and Gerald J. Lieberman. 1995. Introduction to Mathematical Programming. New York, N.Y., McGraw-Hill.
[26]
Glen Jeh and Jennifer Widom. 2002. SimRank: A measure of structural-context similarity. In Proceedings of ACM Conference on Knowledge Discovery and Data Mining (ACM SIGKDD’02). 538--543.
[27]
Narendra Karmarkar. 1984. A new polynomial-time algorithm for linear programming. In Proceedings of ACM Symposium on Theory of Computing (STOC’84). 302--311.
[28]
Danai Koutra, Joshua T. Vogelstein, and Christos Faloutsos. 2013. DeltaCon: A principled massive-graph similarity function. In Proceedings of SIAM International Conference on Data Mining (SDM’13). SIAM, 162--170.
[29]
Elizabeth A. Leicht, Petter Holme, and Mark E. J. Newman. 2006. Vertex similarity in networks. Physical Review E 73, 2 (2006), 026120.
[30]
Sanjiva K. Lele. 1992. Compact finite difference schemes with spectral-like resolution. Journal of Computational Physics 103, 1 (1992), 16--42.
[31]
Longjie Li, Min Ma, Peng Lei, Xiaoping Wang, and Xiaoyun Chen. 2014. A linear approximate algorithm for Earth Mover’s Distance with thresholded ground distance. Mathematical Problems in Engineering 2014 (2014), Article 406358, 9 pages.
[32]
Haibin Ling and Kazunori Okada. 2006. Diffusion distance for histogram comparison. In Proceedings of IEEE Computer Vision and Pattern Recognition (CVPR’06), Vol. 1. 246--253.
[33]
Haibin Ling and Kazunori Okada. 2007. An efficient Earth Mover’s Distance algorithm for robust histogram comparison. IEEE Pattern Analysis and Machine Intelligence 29, 5 (2007), 840--853.
[34]
Vebjorn Ljosa, Arnab Bhattacharya, and Ambuj K. Singh. 2006. Indexing spatially sensitive distance measures using multi-resolution lower bounds. In Proceedings of International Conference on Extending Database Technology (EDBT’06). Springer.
[35]
Kathy Macropol, Petko Bogdanov, Ambuj K. Singh, Linda Petzold, and Xifeng Yan. 2013. I act, therefore I judge: Network sentiment dynamics based on user activity change. In Proceedings of IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM’13). 396--402.
[36]
Kathy Macropol, Petko Bogdanov, Ambuj K. Singh, Linda Petzold, and Xifeng Yan. 2013. I act, therefore I judge: Network sentiment dynamics based on user activity change (Supplemental Material). Retrieved May 5, 2018 from http://cs.ucsb.edu/ dbl/papers/sentimentappendix.pdf.
[37]
Andrew McGregor and Daniel Stubbs. 2013. Sketching Earth Mover’s Distance on graph metrics. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques. M. Goemans, K. Jansen, J. D. P. Rolim, and L. Trevisan (Eds.), Springer, 274--286.
[38]
Sergey Melnik, Hector Garcia-Molina, and Erhard Rahm. 2002. Similarity flooding: A versatile graph matching algorithm and its application to schema matching. In Proceedings of IEEE International Conference on Data Engineering (ICDE’02). 117--128.
[39]
Russell Merris. 1994. Laplacian matrices of graphs: A survey. Linear Algebra and Its Applications 197 (1994), 143--176.
[40]
Anis Najar, Ludovic Denoyer, and Patrick Gallinari. 2012. Predicting information diffusion on social networks with partial knowledge. In Proceedings of the Web Conference Companion (WWW Companion’12). Leslie Car (Ed.), 1197--1204.
[41]
Ofir Pele and Michael Werman. 2008. A linear time histogram metric for improved SIFT matching. In Proceedings of European Conference on Computer Vision (ECCV’08). Springer, 495--508.
[42]
Ofir Pele and Michael Werman. 2009. Fast and robust Earth Mover’s Distances. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’09). 460--467.
[43]
Anton V. Proskurnikov and Roberto Tempo. 2017. A tutorial on modeling and analysis of dynamic social networks. Part I. Annual Reviews in Control 43 (2017), 65--79.
[44]
Anton V. Proskurnikov and Roberto Tempo. 2018. A tutorial on modeling and analysis of dynamic social networks. Part II. Annual Reviews in Control 45 (2018), 166--190.
[45]
Stephen Ranshous, Shitian Shen, Danai Koutra, Steve Harenberg, Christos Faloutsos, and Nagiza F. Samatova. 2015. Anomaly detection in dynamic networks: A survey. Wiley Interdisciplinary Reviews: Computational Statistics 7, 3 (2015), 223--247.
[46]
Yossi Rubner, Carlo Tomasi, and Leonidas J. Guibas. 2000. The Earth Mover’s Distance as a metric for image retrieval. International Journal of Computer Vision 40, 2 (2000), 99--121.
[47]
Kazumi Saito, Masahiro Kimura, Kouzou Ohara, and Hiroshi Motoda. 2011. Detecting changes in opinion value distribution for voter model. In Proceedings of Social Computing, Behavioral-Cultural Modeling and Prediction. 89--96.
[48]
Santiago Segarra, Weiyu Huang, and Alejandro Ribeiro. 2015. Diffusion and superposition distances for signals supported on networks. IEEE Signal and Information Processing over Networks 1, 1 (2015), 20--32.
[49]
Chenhao Tan, Lillian Lee, Jie Tang, Long Jiang, Ming Zhou, and Ping Li. 2011. User-level sentiment analysis incorporating social networks. In Proceedings of ACM Conference on Knowledge Discovery and Data Mining (ACM SIGKDD’11). 1397--1405.
[50]
Yu Tang, U. Leong Hou, Yilun Cai, Nikos Mamoulis, and Reynold Cheng. 2013. Earth mover’s distance based similarity search at scale. The VLDB Journal 7, 4 (2013), 313--324.
[51]
Xiaolong Wang, Furu Wei, Xiaohua Liu, Ming Zhou, and Ming Zhang. 2011. Topic sentiment analysis in Twitter: A graph-based hashtag sentiment classification approach. In Proceedings of ACM Conference on Information and Knowledge Management (CIKM’11). 1031--1040.
[52]
Richard C. Wilson, Edwin R. Hancock, and Bin Luo. 2005. Pattern vectors from algebraic graph theory. IEEE Pattern Analysis and Machine Intelligence 27, 7 (2005), 1112--1124.
[53]
Richard C. Wilson and Ping Zhu. 2008. A study of graph spectra for comparing graphs and trees. Pattern Recognition 41, 9 (2008), 2833--2841.
[54]
Zhi Yang, Christo Wilson, Xiao Wang, Tingting Gao, Ben Y. Zhao, and Yafei Dai. 2014. Uncovering social network sybils in the wild. ACM Transactions on Knowledge Discovery from Data 8, 1 (2014), 2.

Cited By

View all
  • (2023)Application of Big Data-based Text Processing Technology in Public Opinion Monitoring System in Universities2023 International Conference on Intelligent Communication and Networking (ICN)10.1109/ICN60549.2023.10426298(191-195)Online publication date: 10-Nov-2023
  • (2022)Modelling coupled human–environment complexity for the future of the biosphere: strengths, gaps and promising directionsPhilosophical Transactions of the Royal Society B: Biological Sciences10.1098/rstb.2021.0382377:1857Online publication date: 27-Jun-2022
  • (2021)Social network analysis using deep learning: applications and schemesSocial Network Analysis and Mining10.1007/s13278-021-00799-z11:1Online publication date: 25-Oct-2021

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data
ACM Transactions on Knowledge Discovery from Data  Volume 13, Issue 4
August 2019
235 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3343141
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 August 2019
Accepted: 01 May 2019
Revised: 01 May 2019
Received: 01 May 2018
Published in TKDD Volume 13, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Social network
  2. anomaly detection
  3. competing opinions
  4. distance measure
  5. earth mover’s distance
  6. minimum-cost network flow
  7. model-driven analysis
  8. opinion dynamics
  9. opinion prediction
  10. polar opinions
  11. polarization
  12. time-series
  13. transportation problem
  14. wasserstein metric

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • U. S. Army Research Laboratory
  • U. S. Army Research Office

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)189
  • Downloads (Last 6 weeks)28
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Application of Big Data-based Text Processing Technology in Public Opinion Monitoring System in Universities2023 International Conference on Intelligent Communication and Networking (ICN)10.1109/ICN60549.2023.10426298(191-195)Online publication date: 10-Nov-2023
  • (2022)Modelling coupled human–environment complexity for the future of the biosphere: strengths, gaps and promising directionsPhilosophical Transactions of the Royal Society B: Biological Sciences10.1098/rstb.2021.0382377:1857Online publication date: 27-Jun-2022
  • (2021)Social network analysis using deep learning: applications and schemesSocial Network Analysis and Mining10.1007/s13278-021-00799-z11:1Online publication date: 25-Oct-2021

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Get Access

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media