Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Combating fake news: a data management and mining perspective

Published: 01 August 2019 Publication History

Abstract

Fake news is a major threat to global democracy resulting in diminished trust in government, journalism and civil society. The public popularity of social media and social networks has caused a contagion of fake news where conspiracy theories, disinformation and extreme views flourish. Detection and mitigation of fake news is one of the fundamental problems of our times and has attracted widespread attention. While fact checking websites such as snopes, politifact and major companies such as Google, Facebook, and Twitter have taken preliminary steps towards addressing fake news, much more remains to be done. As an interdisciplinary topic, various facets of fake news have been studied by communities as diverse as machine learning, databases, journalism, political science and many more.
The objective of this tutorial is two-fold. First, we wish to familiarize the database community with the efforts by other communities on combating fake news. We provide a panoramic view of the state-of-the-art of research on various aspects including detection, propagation, mitigation, and intervention of fake news. Next, we provide a concise and intuitive summary of prior research by the database community and discuss how it could be used to counteract fake news. The tutorial covers research from areas such as data integration, truth discovery and fusion, probabilistic databases, knowledge graphs and crowdsourcing from the lens of fake news. Effective tools for addressing fake news could only be built by leveraging the synergistic relationship between database and other research communities. We hope that our tutorial provides an impetus towards such synthesis of ideas and the creation of new ones.

References

[1]
H. Allcott and M. Gentzkow. Social media and fake news in the 2016 election. Journal of economic perspectives, 31(2):211--36, 2017.
[2]
V. Bakir and A. McStay. Fake news and the economy of emotions: Problems, causes, solutions. Digital Journalism, 6(2):154--175, 2018.
[3]
S. Bharathi, D. Kempe, and M. Salek. Competitive influence maximization in social networks. In International workshop on web and internet economics, pages 306--311. Springer, 2007.
[4]
A. Borodin, Y. Filmus, and J. Oren. Threshold models for competitive influence in social networks. In International workshop on internet and network economics, pages 539--550. Springer, 2010.
[5]
C. Castillo, M. Mendoza, and B. Poblete. Information credibility on twitter. In WWW, pages 675--684. ACM, 2011.
[6]
G. L. Ciampaglia, P. Shiralkar, L. M. Rocha, J. Bollen, F. Menczer, and A. Flammini. Computational fact checking from knowledge networks. PloS one, 10(6):e0128193, 2015.
[7]
S. Cohen, C. Li, J. Yang, and C. Yu. Computational journalism: A call to arms to database researchers. 2011.
[8]
X. Dong, E. Gabrilovich, G. Heitz, W. Horn, N. Lao, K. Murphy, T. Strohmann, S. Sun, and W. Zhang. Knowledge vault: A web-scale approach to probabilistic knowledge fusion. In KDD, pages 601--610. ACM, 2014.
[9]
X. L. Dong, L. Berti-Equille, and D. Srivastava. Integrating conflicting data: the role of source dependence. PVLDB, 2(1):550--561, 2009.
[10]
A. Halevy, A. Rajaraman, and J. Ordille. Data integration: the teenage years. In PVLDB, pages 9--16. VLDB Endowment, 2006.
[11]
W. Hamilton, P. Bajaj, M. Zitnik, D. Jurafsky, and J. Leskovec. Embedding logical queries on knowledge graphs. In Advances in Neural Information Processing Systems, pages 2026--2037, 2018.
[12]
N. Hassan, G. Zhang, F. Arslan, J. Caraballo, D. Jimenez, S. Gawsane, S. Hasan, M. Joseph, A. Kulkarni, A. K. Nayak, et al. Claimbuster: The first-ever end-to-end fact-checking system. PVLDB, 10(12):1945--1948, 2017.
[13]
Z. Jin, J. Cao, Y. Zhang, and J. Luo. News verification by exploiting conflicting social viewpoints in microblogs. In AAAI, 2016.
[14]
J. Kim, B. Tabibian, A. Oh, B. Schölkopf, and M. Gomez-Rodriguez. Leveraging the crowd to detect and reduce the spread of fake news and misinformation. In WSDM, pages 324--332. ACM, 2018.
[15]
D. M. Lazer, M. A. Baum, Y. Benkler, A. J. Berinsky, K. M. Greenhill, F. Menczer, M. J. Metzger, B. Nyhan, G. Pennycook, D. Rothschild, et al. The science of fake news. Science, 359(6380):1094--1096, 2018.
[16]
Y. Li, J. Gao, C. Meng, Q. Li, L. Su, B. Zhao, W. Fan, and J. Han. A survey on truth discovery. ACM Sigkdd Explorations Newsletter, 17(2):1--16, 2016.
[17]
Y. Lin, Z. Liu, M. Sun, Y. Liu, and X. Zhu. Learning entity and relation embeddings for knowledge graph completion. In Twenty-ninth AAAI conference on artificial intelligence, 2015.
[18]
N. Mele, D. Lazer, M. Baum, N. Grinberg, L. Friedland, K. Joseph, W. Hobbs, and C. Mattsson. Combating fake news: An agenda for research and action, 2017.
[19]
R. Pastor-Satorras and A. Vespignani. Immunization of complex networks. Physical review E, 65(3):036104, 2002.
[20]
G. Pennycook and D. Rand. Crowdsourcing judgments of news source quality. SSRN. com, 2018.
[21]
K. Popat, S. Mukherjee, J. Strötgen, and G. Weikum. Where the truth lies: Explaining the credibility of emerging claims on the web and social media. In WWW, pages 1003--1012, 2017.
[22]
C. Shao, G. L. Ciampaglia, O. Varol, A. Flammini, and F. Menczer. The spread of fake news by social bots. arXiv preprint arXiv:1707.07592, pages 96--104, 2017.
[23]
B. Shi and T. Weninger. Discriminative predicate path mining for fact checking in knowledge graphs. Knowledge-based systems, 104:123--133, 2016.
[24]
B. Shi and T. Weninger. Proje: Embedding projection for knowledge graph completion. In Thirty-First AAAI Conference on Artificial Intelligence, 2017.
[25]
P. Shiralkar, A. Flammini, F. Menczer, and G. L. Ciampaglia. Finding streams in knowledge graphs to support fact checking. In 2017 IEEE International Conference on Data Mining (ICDM), pages 859--864. IEEE, 2017.
[26]
K. Shu, H. R. Bernard, and H. Liu. Studying fake news via network analysis: detection and mitigation. In Emerging Research Challenges and Opportunities in Computational Social Network Analysis and Mining, pages 43--65. Springer, 2019.
[27]
K. Shu, A. Sliva, S. Wang, J. Tang, and H. Liu. Fake news detection on social media: A data mining perspective. ACM SIGKDD Explorations Newsletter, 19(1):22--36, 2017.
[28]
E. C. Tandoc Jr, Z. W. Lim, and R. Ling. Defining "fake news" a typology of scholarly definitions. Digital Journalism, 6(2):137--153, 2018.
[29]
S. Tschiatschek, A. Singla, M. Gomez Rodriguez, A. Merchant, and A. Krause. Fake news detection in social networks via crowd signals. In WWW, pages 517--524. WWW, 2018.
[30]
S. Vosoughi, D. Roy, and S. Aral. The spread of true and false news online. Science, 359(6380):1146--1151, 2018.
[31]
J. Widom. Trio: A system for integrated management of data, accuracy, and lineage. Technical report, Stanford InfoLab, 2004.
[32]
L. Wu and H. Liu. Tracing fake-news footprints: Characterizing social media messages by how they propagate. In WSDM, pages 637--645. ACM, 2018.
[33]
X. Zhou and R. Zafarani. Fake news: A survey of research, detection methods, and opportunities. arXiv preprint arXiv:1812.00315, 2018.

Cited By

View all
  • (2024)Misinformation, Disinformation, and Generative AI: Implications for Perception and PolicyDigital Government: Research and Practice10.1145/3689372Online publication date: 23-Aug-2024
  • (2024)Comquest: Large Scale User Comment Crawling and IntegrationCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3654736(432-435)Online publication date: 9-Jun-2024
  • (2024)Toward Mitigating Misinformation and Social Media Manipulation in LLM EraCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3641256(1302-1305)Online publication date: 13-May-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 12, Issue 12
August 2019
547 pages

Publisher

VLDB Endowment

Publication History

Published: 01 August 2019
Published in PVLDB Volume 12, Issue 12

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)87
  • Downloads (Last 6 weeks)6
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Misinformation, Disinformation, and Generative AI: Implications for Perception and PolicyDigital Government: Research and Practice10.1145/3689372Online publication date: 23-Aug-2024
  • (2024)Comquest: Large Scale User Comment Crawling and IntegrationCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3654736(432-435)Online publication date: 9-Jun-2024
  • (2024)Toward Mitigating Misinformation and Social Media Manipulation in LLM EraCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3641256(1302-1305)Online publication date: 13-May-2024
  • (2022)From BERT to GPT-3 codexProceedings of the VLDB Endowment10.14778/3554821.355489615:12(3770-3773)Online publication date: 1-Aug-2022
  • (2022)RETRACTED ARTICLE: SCLAVOEM: hyper parameter optimization approach to predictive modelling of COVID-19 infodemic tweets using smote and classifier vote ensembleSoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-022-06940-027:6(3531-3550)Online publication date: 15-Mar-2022
  • (2021)To Intervene or Not To Intervene: Cost based Intervention for Combating Fake NewsProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3452778(2300-2309)Online publication date: 9-Jun-2021
  • (2021)Thai Fake News Detection Based on Information Retrieval, Natural Language Processing and Machine LearningSN Computer Science10.1007/s42979-021-00775-62:6Online publication date: 23-Aug-2021
  • (2020)The Kauwa-Kaate Fake News Detection SystemProceedings of the 7th ACM IKDD CoDS and 25th COMAD10.1145/3371158.3371402(302-306)Online publication date: 5-Jan-2020

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media