Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Uncovering Media Bias via Social Network Learning

Published: 22 December 2020 Publication History

Abstract

It is known that media outlets, such as CNN and FOX, have intrinsic political bias that is reflected in their news reports. The computational prediction of such bias has broad application prospects. However, the prediction is difficult via directly analyzing the news content without high-level context. In contrast, social signals (e.g., the network structure of media followers) provide inspiring cues to uncover such bias. In this article, we realize the first attempt of predicting the latent bias of media outlets by analyzing their social network structures. In particular, we address two key challenges: network sparsity and label sparsity. The network sparsity refers to the partial sampling of the entire follower network in practical analysis and computing, whereas the label sparsity refers to the difficulty of annotating sufficient labels to train the prediction model. To cope with the network sparsity, we propose a hybrid sampling strategy to construct a training corpus that contains network information from micro to macro views. Based on this training corpus, a semi-supervised network embedding approach is proposed to learn low-dimensional yet effective network representations. To deal with the label sparsity, we adopt a graph-based label propagation scheme to supplement the missing links and augment label information for model training. The preceding two steps are iteratively optimized to reinforce each other. We further collect a large-scale dataset containing social networks of 10 media outlets together with about 300,000 followers and more than 5 million connections. Over this dataset, we compare our model to a range of state of the art. Superior performance gains demonstrate the merits of the proposed approach. More importantly, the experimental results and analyses confirm the validity of our approach for the computerized prediction of media bias.

References

[1]
Mikhail Belkin and Partha Niyogi. 2001. Laplacian eigenmaps and spectral techniques for embedding and clustering. In Proceedings of Neural Information Processing Systems, Vol. 14. 585--591.
[2]
Yoshua Bengio, Olivier Delalleau, and Nicolas Le Roux. 2006. Label propagation and quadratic criterion. In Semi-supervised Learning, O. Chapelle, B. Scholkopf, and A. Zien (Eds.). MIT Press, 193--216.
[3]
Jingwen Bian, Yang Yang, Hanwang Zhang, and Tatseng Chua. 2015. Multimedia summarization for social events in microblog stream. IEEE Transactions on Multimedia 17, 2 (2015), 216--228.
[4]
Daniel Bradley, Christos Pantzalis, and Xiaojing Yuan. 2016. The influence of political bias in state pension funds. Journal of Financial Economics 119, 1 (2016), 69--91.
[5]
Shaosheng Cao, Wei Lu, and Qiongkai Xu. 2015. GraRep: Learning graph representations with global structural information. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. ACM, New York, NY, 891--900.
[6]
Raviv Cohen and Derek Ruths. 2013. Classifying political orientation on Twitter: It’s not easy! In Proceedings of the 7th International AAAI Conference on Weblogs and Social Media (ICWSM’13).
[7]
Michael Conover, Jacob Ratkiewicz, Matthew R. Francisco, Bruno Gonçalves, Filippo Menczer, and Alessandro Flammini. 2011. Political polarization on Twitter. In Proceedings of the 5th International AAAI Conference on Weblogs and Social Media (ICWSM’11). 89--96.
[8]
Stefano DellaVigna and Ethan Kaplan. 2006. The Fox News Effect: Media Bias and Voting. Technical Report. National Bureau of Economic Research.
[9]
James N. Druckman and Michael Parkin. 2005. The impact of media bias: How editorial slant affects voters. Journal of Politics 67, 4 (2005), 1030--1049.
[10]
Andrew Gelman. 2009. Red State, Blue State, Rich State, Poor State: Why Americans Vote the Way They Do. Princeton University Press.
[11]
Jennifer Golbeck and Derek Hansen. 2014. A method for computing political preference among Twitter followers. Social Networks 36 (2014), 177--184.
[12]
Tim Groseclose and Jeffrey Milyo. 2005. A measure of media bias. Quarterly Journal of Economics 120, 4 (2005), 1191--1237.
[13]
Aditya Grover and Jure Leskovec. 2016. Node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’16). 855--864.
[14]
Michael Gutmann and Aapo Hyvärinen. 2010. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models. In Proceedings of the 13th International Conference on Artificial Intelligence and Statistics (AISTATS’10), Vol. 1. 6.
[15]
Quoc V. Le and Tomas Mikolov. 2014. Distributed representations of sentences and documents. In Proceedings of the 31st International Conference on Machine Learning (ICML’14), Vol. 14. 1188--1196.
[16]
Wei Liu, Junfeng He, and Shih-Fu Chang. 2010. Large graph construction for scalable semi-supervised learning. In Proceedings of the 27th International Conference on Machine Learning (ICML’10). 679--686.
[17]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv:1301.3781
[18]
T. Mikolov and J. Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems.
[19]
Andriy Mnih and Koray Kavukcuoglu. 2013. Learning word embeddings efficiently with noise-contrastive estimation. In Advances in Neural Information Processing Systems. 2265--2273.
[20]
Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. DeepWalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 701--710.
[21]
Sam T. Roweis and Lawrence K. Saul. 2000. Nonlinear dimensionality reduction by locally linear embedding. Science 290, 5500 (2000), 2323--2326.
[22]
Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015. Line: Large-scale information network embedding. In Proceedings of the 24th International Conference on World Wide Web. ACM, New York, NY, 1067--1077.
[23]
Lei Tang and Huan Liu. 2011. Leveraging social media networks for classification. Data Mining and Knowledge Discovery 23, 3 (2011), 447--478.
[24]
Fei Tian, Bin Gao, Qing Cui, Enhong Chen, and Tie-Yan Liu. 2014. Learning deep representations for graph clustering. In Proceedings of the 28th AAAI Conference on Artificial Intelligence (AAAI’14). 1293--1299.
[25]
Andranik Tumasjan, Timm Oliver Sprenger, Philipp G. Sandner, and Isabell M. Welpe. 2010. Predicting elections with Twitter: What 140 characters reveal about political sentiment. In Proceedings of the 4th International AAAI Conference on Weblogs and Social Media (ICWSM’10) 10 (2010), 178--185.
[26]
Svitlana Volkova, Yoram Bachrach, Michael Armstrong, and Vijay Sharma. 2015. Inferring latent user properties from texts published in social media. In Proceedings of the 27th AAAI Conference on Artificial Intelligence (AAAI’15). 4296--4297.
[27]
Daixin Wang, Peng Cui, and Wenwu Zhu. 2016. Structural deep network embedding. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’16). 1225--1234.
[28]
Jun Wang, Tony Jebara, and Shih-Fu Chang. 2013. Semi-supervised learning using greedy max-cut. Journal of Machine Learning Research 14 (March 2013), 771--800.
[29]
Meng Wang, Weijie Fu, Shijie Hao, Hengchang Liu, and Xindong Wu. 2017. Learning on big graph: Label inference and regularization with anchor hierarchy. IEEE Transactions on Knowledge and Data Engineering 29, 5 (2017), 1101--1114.
[30]
Meng Wang, Weijie Fu, Shijie Hao, Dacheng Tao, and Xindong Wu. 2016. Scalable semi-supervised learning by efficient anchor graph regularization. IEEE Transactions on Knowledge and Data Engineering 28, 7 (2016), 1864--1877.
[31]
Meng Wang, Hao Li, Dacheng Tao, Ke Lu, and Xindong Wu. 2012. Multimodal graph-based reranking for web image search. IEEE Transactions on Image Processing 21, 11 (2012), 4649--4661.
[32]
Yang Yang, Yaqian Duan, Xinze Wang, Zi Huang, Ning Xie, and Heng Tao Shen. 2018. Hierarchical multi-clue modelling for POI popularity prediction with heterogeneous tourist information. IEEE Transactions on Knowledge and Data Engineering 31, 4 (2018), 757--768.
[33]
Dengyong Zhou, Olivier Bousquet, Thomas Navin Lal, Jason Weston, and Bernhard Schölkopf. 2004. Learning with local and global consistency. In Advances in Neural Information Processing Systems 16. 321--328.
[34]
Xiaojin Zhu, Zoubin Ghahramani, and John Lafferty. 2003. Semi-supervised learning using Gaussian fields and harmonic functions. In Proceedings of the 20th International Conference on Machine Learning (ICML’03), Vol. 3. 912--919.

Cited By

View all
  • (2023)Relation-aware Graph Convolutional Networks for Multi-relational Network AlignmentACM Transactions on Intelligent Systems and Technology10.1145/357982714:2(1-23)Online publication date: 9-Jan-2023
  • (2023)Machine learning in predicting stock indexes: the role of online stock forum sentiment in MIDAS modelAsia-Pacific Journal of Accounting & Economics10.1080/16081625.2023.221523431:4(618-637)Online publication date: 22-May-2023
  • (2022)Real-Time Focused Extraction of Social Media UsersIEEE Access10.1109/ACCESS.2022.316897710(42607-42622)Online publication date: 2022

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Intelligent Systems and Technology
ACM Transactions on Intelligent Systems and Technology  Volume 12, Issue 1
Regular Papers
February 2021
280 pages
ISSN:2157-6904
EISSN:2157-6912
DOI:10.1145/3436534
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 December 2020
Accepted: 01 July 2020
Revised: 01 May 2019
Received: 01 June 2018
Published in TIST Volume 12, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Social network embedding
  2. media bias prediction

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • National Natural Science Foundation of China

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)48
  • Downloads (Last 6 weeks)2
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Relation-aware Graph Convolutional Networks for Multi-relational Network AlignmentACM Transactions on Intelligent Systems and Technology10.1145/357982714:2(1-23)Online publication date: 9-Jan-2023
  • (2023)Machine learning in predicting stock indexes: the role of online stock forum sentiment in MIDAS modelAsia-Pacific Journal of Accounting & Economics10.1080/16081625.2023.221523431:4(618-637)Online publication date: 22-May-2023
  • (2022)Real-Time Focused Extraction of Social Media UsersIEEE Access10.1109/ACCESS.2022.316897710(42607-42622)Online publication date: 2022

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media