Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2983323.2983911acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper
Public Access

Probabilistic Approaches to Controversy Detection

Published: 24 October 2016 Publication History

Abstract

Recently, the problem of automated controversy detection has attracted a lot of interest in the information retrieval community. Existing approaches to this problem have set forth a number of detection algorithms, but there has been little effort to model the probability of controversy in a document directly. In this paper, we propose a probabilistic framework to detect controversy on the web, and investigate two models. We first recast a state-of-the-art controversy detection algorithm into a model in our framework. Based on insights from social science research, we also introduce a language modeling approach to this problem.
We evaluate different methods of creating controversy language models based on a diverse set of public datasets including Wikipedia, Web and News corpora. Our automatically derived language models show a significant relative improvement of 18% in AUC over prior work,and 23% over two manually curated lexicons.

References

[1]
R. Awadallah, M. Ramanath, and G. Weikum. Harmony and dissonance: Organizing the people's voices on political controversies. In WSDM'12.
[2]
M.-A. Cartright, E. Aktolga, and J. Dalton. Characterizing the subjectivity of topics. In SIGIR' 09.
[3]
Y. Choi, Y. Jung, and S.-H. Myaeng. Identifying controversial issues and their sub-topics in news articles. In Intelligence and Security Informatics. Springer, 2010.
[4]
D. Corney, D. Albakour, M. Martinez, and S. Moussa. What do a million news articles look like? In Workshop on Recent Trends in News IR, ECIR'16.
[5]
P. A. Cramer. Controversy as news discourse, volume 19. Springer Science & Business Media, 2011.
[6]
W. B. Croft, D. Metzler, and T. Strohman. Search engines:Information retrieval in practice, volume 283. Addison-Wesley Reading, 2010.
[7]
S. Das, A. Lavoie, and M. Magdon-Ismail. Manipulation among the arbiters of collective intelligence: how wikipedia administrators mold public opinion. In CIKM'13.
[8]
S. Dori-Hacohen and J. Allan. Automated controversy detection on the web. In ECIR'15.
[9]
S. Dori-Hacohen, E. Yom-Tov, and J. Allan. Navigating Controversy as a Complex Search Task. In Supporting Complex Search Tasks, at ECIR' 15.
[10]
S. L. Gerhart. Do Web search engines suppress controversy? First Monday, 9, 2004.{1 {11} M. Jang and J. Allan. Improving automated controversy detection on the web. In SIGIR' 16.
[11]
M. Kacimi and J. Gamper. Diversifying search results of controversial queries. In CIKM'11.
[12]
A. Kittur, B. Suh, B. A. Pendleton, and E. H. Chi. He says,she says: con ict and coordination in wikipedia. In SIGCHI'07.
[13]
Y. Mejova, A. X. Zhang, N. Diakopoulos, and C. Castillo. Controversy and sentiment in online news.arXiv preprintarXiv:1409.8152, 2014.
[14]
E. Pariser. The Filter Bubble: What the Internet Is Hiding from You. 2011.
[15]
A.-M. Popescu and M. Pennacchiotti. Detecting controversial events from twitter. In CIKM'10.
[16]
H. S. Rad and D. Barbosa. Identifying controversial articles in wikipedia: A comparative study. In WikiSym 2012.
[17]
H. Roitman, S. Hummel, E. Rabinovich, B. Sznajder, N. Slonim, and E. Aharoni. On the Retrieval of Wikipedia Articles Containing Claims on Controversial Topics. In WWW'16.
[18]
T. Rose, M. Stevenson, and M. Whitehead. The Reuters Corpus Volume 1. In LREC 2002.{20} E. Sandhaus. The New York Times annotated corpus.LDC,6(12):e26752, 2008.
[19]
T. Yasseri, R. Sumi, A. Rung, A. Kornai, and J. Kert--esz. Dynamics of conflicts in wikipedia. PloS one, 7(6):e38869,2012

Cited By

View all
  • (2023)Comprehensive Browser Extension for Analysing YouTube User Engagement, Controversy, User Requirements, and Trending Keywords2023 33rd International Telecommunication Networks and Applications Conference10.1109/ITNAC59571.2023.10368503(134-139)Online publication date: 29-Nov-2023
  • (2023)Understanding conflict origin and dynamics on Twitter: A real-time detection systemExpert Systems with Applications10.1016/j.eswa.2022.118748212(118748)Online publication date: Feb-2023
  • (2022)Wikinformetrics: Construction and description of an open Wikipedia knowledge graph data set for informetric purposesQuantitative Science Studies10.1162/qss_a_002263:4(931-952)Online publication date: 20-Dec-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
October 2016
2566 pages
ISBN:9781450340731
DOI:10.1145/2983323
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 October 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. controversy detection
  2. critical literacy
  3. language modeling

Qualifiers

  • Short-paper

Funding Sources

Conference

CIKM'16
Sponsor:
CIKM'16: ACM Conference on Information and Knowledge Management
October 24 - 28, 2016
Indiana, Indianapolis, USA

Acceptance Rates

CIKM '16 Paper Acceptance Rate 160 of 701 submissions, 23%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)98
  • Downloads (Last 6 weeks)10
Reflects downloads up to 12 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Comprehensive Browser Extension for Analysing YouTube User Engagement, Controversy, User Requirements, and Trending Keywords2023 33rd International Telecommunication Networks and Applications Conference10.1109/ITNAC59571.2023.10368503(134-139)Online publication date: 29-Nov-2023
  • (2023)Understanding conflict origin and dynamics on Twitter: A real-time detection systemExpert Systems with Applications10.1016/j.eswa.2022.118748212(118748)Online publication date: Feb-2023
  • (2022)Wikinformetrics: Construction and description of an open Wikipedia knowledge graph data set for informetric purposesQuantitative Science Studies10.1162/qss_a_002263:4(931-952)Online publication date: 20-Dec-2022
  • (2022)Fighting post-truth using natural language processingExpert Systems with Applications: An International Journal10.1016/j.eswa.2019.112943141:COnline publication date: 21-Apr-2022
  • (2022)A text and GNN based controversy detection method on social mediaWorld Wide Web10.1007/s11280-022-01116-026:2(799-825)Online publication date: 17-Nov-2022
  • (2022)Net activism and whistleblowing on YouTube: a text mining analysisMultimedia Tools and Applications10.1007/s11042-022-13777-082:6(9201-9221)Online publication date: 29-Sep-2022
  • (2022)Controversy Detection: A Text and Graph Neural Network Based ApproachWeb Information Systems Engineering – WISE 202110.1007/978-3-030-90888-1_26(339-354)Online publication date: 1-Jan-2022
  • (2021)Toward a Standard Approach for Echo Chamber Detection: Reddit Case StudyApplied Sciences10.3390/app1112539011:12(5390)Online publication date: 10-Jun-2021
  • (2021)Detecting Environmental, Social and Governance (ESG) Topics Using Domain-Specific Language Models and Data AugmentationFlexible Query Answering Systems10.1007/978-3-030-86967-0_12(157-169)Online publication date: 16-Sep-2021
  • (2020)Stance DetectionACM Computing Surveys10.1145/336902653:1(1-37)Online publication date: 6-Feb-2020
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media