Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2808797.2808819acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Using Arabic Microblogs Features in Determining Credibility

Published: 25 August 2015 Publication History

Abstract

The increased usage of Twitter as a medium for reporting news and sharing information between people has caught the attention of researchers from different disciplines. One of the research directions is the analysis of online information from the perspective of its credibility. This paper aims to assess and analyze the credibility of tweets in Arabic language. In order to achieve the stated goal, first we employ the idea of crowdsourcing where users can explicitly express their opinions about credibility of a set of tweets. This information coupled with the data about tweets' features enable us to investigate which features may indicate the credibility level of a tweet, e.g. tweet with attached image and was authored by a person who posts a lot of tweets will be, with high probability, a credible tweet. We distinguish three main groups of features: authority and topical expertise (of the source), data quality (of the content), and popularity (of the content and the source). We argue that content data quality factor based on content linguistic features in addition to source authority is more important than content popularity in identifying credible messages. In addition to this, we identified three experts who also rated the credibility of tweets and based on that we investigate the level of agreement between experts and the crowd, and we identify which expert represents the crowd in the best way. This can allow us to select the most representative expert when it is needed. This study is a pilot of a large study that aims at predicting credibility of Arabic Twitter messages using machine learning approaches.

References

[1]
H. Kwak, C. Lee, H. Park, and S. Moon, "What is Twitter, a Social Network or a News Media?," in Proceedings of the 19th International Conference on World Wide Web, New York, NY, USA, 2010, pp. 591--600.
[2]
A. A. AlMansour, L. Brankovic, and C. S. Iliopoulos, "Evaluation of Credibility Assessment for Microblogging: Models and Future Directions," in Proceedings of the 14th International Conference on Knowledge Technologies and Data-driven Business, New York, NY, USA, 2014, pp. 32:1--32:4.
[3]
C. Castillo, M. Mendoza, and B. Poblete, "Information Credibility on Twitter," in Proceedings of the 20th International Conference on World Wide Web, New York, NY, USA, 2011, pp. 675--684.
[4]
F. Yang, Y. Liu, X. Yu, and M. Yang, "Automatic Detection of Rumor on Sina Weibo," in Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics, New York, NY, USA, 2012, pp. 13:1--13:7.
[5]
X. Xia, X. Yang, C. Wu, S. Li, and L. Bao, "Information Credibility on Twitter in Emergency Situation," in Proceedings of the 2012 Pacific Asia Conference on Intelligence and Security Informatics, Berlin, Heidelberg, 2012, pp. 45--59.
[6]
B. Kang, J. O'Donovan, and T. Höllerer, "Modeling Topic Specific Credibility on Twitter," in Proceedings of the 2012 ACM International Conference on Intelligent User Interfaces, New York, NY, USA, 2012, pp. 179--188.
[7]
A. Gupta and P. Kumaraguru, "Credibility Ranking of Tweets During High Impact Events," in Proceedings of the 1st Workshop on Privacy and Security in Online Social Media, New York, NY, USA, 2012, pp. 2:2--2:8.
[8]
S. Bhattacharya, H. Tran, P. Srinivasan, and J. Suls, "Belief Surveillance with Twitter," in Proceedings of the 4th Annual ACM Web Science Conference, New York, NY, USA, 2012, pp. 43--46.
[9]
V. Qazvinian, E. Rosengren, D. R. Radev, and Q. Mei, "Rumor Has It: Identifying Misinformation in Microblogs," in Proceedings of the Conference on Empirical Methods in Natural Language Processing, Stroudsburg, PA, USA, 2011, pp. 1589--1599.
[10]
J. Ratkiewicz, M. Conover, M. Meiss, B. Gonçalves, S. Patil, A. Flammini, and F. Menczer, "Truthy: Mapping the Spread of Astroturf in Microblog Streams," in Proceedings of the 20th International Conference Companion on World Wide Web, New York, NY, USA, 2011, pp. 249--252.
[11]
M. Gupta, P. Zhao, and J. Han, "Evaluating Event Credibility on Twitter.," in SDM, 2012, pp. 153--164.
[12]
S. Ravikumar, R. Balakrishnan, and S. Kambhampati, "Ranking tweets considering trust and relevance," in Proceedings of the Ninth International Workshop on Information Integration on the Web, 2012, p. 4.
[13]
M. Mendoza, B. Poblete, and C. Castillo, "Twitter Under Crisis: Can we trust what we RT?," in Proceedings of the first workshop on social media analytics, 2010, pp. 71--79.
[14]
J. O'Donovan, B. Kang, G. Meyer, T. Hollerer, and S. Adalii, "Credibility in context: An analysis of feature distributions in twitter," in Privacy, Security, Risk and Trust (PASSAT), 2012 International Conference on and 2012 International Confernece on Social Computing (SocialCom), 2012, pp. 293--301.
[15]
H. S. Al-Khalifa and R. M. Al-Eidan, "An experimental system for measuring the credibility of news content in Twitter," Int. J. Web Inf. Syst., vol. 7, no. 2, pp. 130--151, 2011.
[16]
K. R. Canini, B. Suh, and P. L. Pirolli, "Finding credible information sources in social networks based on content and social structure," in Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on, 2011, pp. 1--8.
[17]
A. Pal and S. Counts, "Identifying topical authorities in microblogs," in Proceedings of the fourth ACM international conference on Web search and data mining, 2011, pp. 45--54.
[18]
A. Pal and S. Counts, "What's in a@ name? How Name Value Biases Judgment of Microblog Authors.," in ICWSM, 2011.
[19]
M. R. Morris, S. Counts, A. Roseway, A. Hoff, and J. Schwarz, "Tweeting is believing?: understanding microblog credibility perceptions," in Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work, 2012, pp. 441--450.
[20]
J. Yang, S. Counts, M. R. Morris, and A. Hoff, "Microblog credibility perceptions: Comparing the usa and china," in Proceedings of the 2013 conference on Computer supported cooperative work, 2013, pp. 575--586.
[21]
D. Westerman, P. R. Spence, and B. Van Der Heide, "A social network as information: The effect of system generated reports of connectedness on credibility on Twitter," Comput. Hum. Behav., vol. 28, no. 1, pp. 199--206, 2012.
[22]
J. R. Landis and G. G. Koch, "The measurement of observer agreement for categorical data," biometrics, pp. 159--174, 1977.
[23]
K. A. Hallgren, "Computing inter-rater reliability for observational data: An overview and tutorial," Tutor. Quant. Methods Psychol., vol. 8, no. 1, p. 23, 2012.
[24]
N. Hayeri, C. K. Chung, and J. W. Pennebaker, "The development of linguistic inquiry and word count (LIWC) for Arabic texts," Austin TX LIWC Net, 2010.

Cited By

View all
  • (2021)Social Media and Microblogs Credibility: Identification, Theory Driven Framework, and RecommendationIEEE Access10.1109/ACCESS.2021.31144179(137744-137781)Online publication date: 2021
  • (2017)Twitter features distributions across similar labelers2017 13th International Computer Engineering Conference (ICENCO)10.1109/ICENCO.2017.8289823(405-410)Online publication date: Dec-2017
  • (2016)Labeling Agreement Level and Classification Accuracy2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)10.1109/SITIS.2016.51(271-274)Online publication date: 2016

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ASONAM '15: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015
August 2015
835 pages
ISBN:9781450338547
DOI:10.1145/2808797
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 August 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. arabic
  2. credibility
  3. microblogs
  4. social networks
  5. trust
  6. twitter

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ASONAM '15
Sponsor:

Acceptance Rates

Overall Acceptance Rate 116 of 549 submissions, 21%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Social Media and Microblogs Credibility: Identification, Theory Driven Framework, and RecommendationIEEE Access10.1109/ACCESS.2021.31144179(137744-137781)Online publication date: 2021
  • (2017)Twitter features distributions across similar labelers2017 13th International Computer Engineering Conference (ICENCO)10.1109/ICENCO.2017.8289823(405-410)Online publication date: Dec-2017
  • (2016)Labeling Agreement Level and Classification Accuracy2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)10.1109/SITIS.2016.51(271-274)Online publication date: 2016

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media