Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3022227.3022283acmconferencesArticle/Chapter ViewAbstractPublication PagesicuimcConference Proceedingsconference-collections
research-article

Predicting political affiliation of posts on Facebook

Published: 05 January 2017 Publication History

Abstract

Recently, social media such as Facebook has been more popular. Receiving information from Facebook and generating or spreading information on Facebook every day has become a general lifestyle. This new information-exchanging platform contains a lot of meaningful messages including users' emotions and preferences. Using messages on Facebook or in general social media to predict the election result and political affiliation has been a trend. In Taiwan, for example, almost every politician tries to have public opinion polls by using social media; almost every politician has his or her own fan page on Facebook, and so do the parties. We make an effort to predict to what party, DPP or KMT, two major parties in Taiwan, a post would be related or affiliated. We design features and models for the prediction, and we evaluate as well as compare them with the data collected from several political fan pages on Facebook. The results show that we can obtain accuracy higher than 90% when the text and interaction features are used with a nearest neighbor classifier.

References

[1]
Shigeo Abe. 2012. Support Vector Machines for Pattern Classification (2nd ed.). Springer Publishing Company, Incorporated.
[2]
David M. Blei, Andrew Y. Ng, Michael I. Jordan. 2003. Latent Dirichlet Allocation. J. Mach. Learn. Res. 3 (January 2003), 993--1022.
[3]
Keh-Jiann Chen and Ming-Hong Bai. 1998. Unknown word detection for Chinese by a corpus-based learning method. International Journal of Computational Linguistics and Chinese Language Processing 3, 1, 27--44.
[4]
Keh-Jiann Chen and Wei-Yun Ma. 2002. Unknown word extraction for Chinese documents. In Proceedings of the 19th international conference on Computational linguistics - Volume 1 (COLING '02), Vol. 1. Association for Computational Linguistics, Stroudsburg, PA, USA, 1--7.
[5]
Michael D. Conover, Bruno Goncalves, Jacob Ratkiewicz, Alessandro Flammini, and Filippo Menczer. 2011. Predicting the political Alignment of Twitter users. In Proceedings of IEEE 3rd International Conference on Privacy, Security, Risk and Trust and IEEE 3rd International Conference on Social Computing (PASSAT-SocialCom '11). IEEE, Boston, MA, USA, 192--199.
[6]
Corinna Cortes and Vladimir Vapnik. 1995. Support-vector networks. Mach. Learn. 20, 3 (September 1995), 273--297. DOI=http://dx.dot.org/
[7]
Thomas M. Cover and Peter E. Hart. 2006. Nearest neighbor pattern classification. IEEE Trans. Inf. Theor. 13, 1 (September 2006), 21--27.
[8]
Nello Cristianini and John Shawe-Taylor. 1999. An Introduction to Support Vector Machines: And Other Kernel-Based Learning Methods. Cambridge University Press, New York, NY, USA.
[9]
Zhendong Dong and Qiang Dong. 2006. Hownet and the Computation of Meaning. World Scientific Publishing Co., Inc., River Edge, NJ, USA.
[10]
Clay Fink, Nathan Bos, Alexander Perrone, Edwina Liu, and Jonathon Kopecky. 2013. Twitter, Public Opinion, and the 2011 Nigerian Presidential Election. In Proceedings of the International Conference on Social Computing (SOCIALCOM '13). IEEE Computer Society, Washington, DC, USA, 311--320.
[11]
Yoav Freund and Robert E. Schapire. 1996. Experiments with a new boosting algorithm. In Proceedings of the 13th International Conference on Machine Learning (ICML '96). Lorenza Saitta (Ed.). Morgan Kaufmann, 148--156.
[12]
David J. Hand and Keming Yu. 2001. Idiot's Bayes---not so stupid after all?. International Statistical Review 69, 3 (December 2001), 385--398.
[13]
Lun-Wei Ku and Hsin-Hsi Chen. 2007. Mining opinions from the Web: Beyond relevance retrieval. J. Am. Soc. Inf. Sci. Technol. 58, 12 (October 2007), 1838--1850.
[14]
Wei-Yun Ma and Keh-Jiann Chen. 2003. A bottom-up merging algorithm for Chinese unknown word extraction. In Proceedings of the second SIGHAN workshop on Chinese language processing - Volume 17 (SIGHAN '03), Vol. 17. Association for Computational Linguistics, Stroudsburg, PA, USA, 31--38.
[15]
Wei-Yun Ma and Keh-Jiann Chen. 2003. Introduction to CKIP Chinese word segmentation system for the first international Chinese Word Segmentation Bakeoff. In Proceedings of the second SIGHAN workshop on Chinese language processing - Volume 17 (SIGHAN '03), Vol. 17. Association for Computational Linguistics, Stroudsburg, PA, USA, 168--171.
[16]
Wei-Yun Ma and Keh-Jiann Chen. 2004. Design of CKIP Chinese word segmentation system. International Journal of Asian Language Processing 14, 3 (May 2004), 235--249.
[17]
Aibek Makazhanov and Davood Rafiei. 2013. Predicting political preference of Twitter users. In Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM '13). ACM, New York, NY, USA, 298--305.
[18]
Antonio Mucherino, Petraq J. Papajorgji, and Panos M. Pardalos. 2009. k-nearest neighbor classification. Data Mining in Agriculture, Volume 34 of the series Springer Optimization and Its Applications. Springer, New York, NY, USA, 83--106.
[19]
Brendan O'Connor, Ramnath Balasubramanyan, Bryan R. Routledge, and Noah A. Smith. 2010. From tweets to polls: Linking text sentiment to public opinion time series. In Proceedings of the 4th International Conference on Weblogs and Social Media (ICWSM '10). AAAI, 122--129.
[20]
Shawn O'Banion and Larry Birnbaum. 2013. Using explicit linguistic expressions of preference in social media to predict voting behavior. In Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM '13). ACM, New York, NY, USA, 207--214.
[21]
Marco Pennacchiotti and Ana-Maria Popescu. 2011. Democrats, republicans and starbucks afficionados: user classification in twitter. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '11). ACM, New York, NY, USA, 430--438.
[22]
J. Ross Quinlan. 1993. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, Inc.
[23]
Robert E. Schapire. 1999. A brief introduction to boosting. In Proceedings of the 16th International Ioint Conference on Artificial Intelligence - Volume 2 (IJCAI '99), Vol. 2. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1401--1406.
[24]
Ingo Steinwart and Andreas Christmann. 2008. Support Vector Machines (1st ed.). Springer Publishing Company, Incorporated.
[25]
Andranik Tumasjan, Timm O. Sprenger, Philipp G. Sandner, and Isabell M. Welpe. 2010. Predicting elections with Twitter: What 140 characters reveal about political sentiment. In Proceedings of the 4th International Conference on Weblogs and Social Media (ICWSM '10). AAAI, 178--185.
[26]
Diego Tumitan and Karin Becker. 2014. Sentiment-Based Features for Predicting Election Polls: A Case Study on the Brazilian Scenario. In Proceedings of the IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies - Volume 02 (WI-IAT '14), Vol. 2. IEEE Computer Society, Washington DC, USA, 126--133.
[27]
Xindong Wu, Vipin Kumar, J. Ross Quinlan, Joydeep Ghosh, Qiang Yang, Hiroshi Motoda, Geoffrey J. McLachlan, Angus Ng, Bing Liu, Philip S. Yu, Zhi-Hua Zhou, Michael Steinbach, David J. Hand, and Dan Steinberg. 2007. Top 10 algorithms in data mining. Knowl. Inf. Syst. 14, 1 (December 2007), 1--37.
[28]
Harry Zhang. 2004. The optimality of naive Bayes. In Proceedings of the 17th International Florida Artificial Intelligence Research Society Conference (FLAIRS '04). AAAI, 562--567.

Cited By

View all
  • (2024)Deep Learning Models for Predicting Political Tendency in Facebook Posts Incorporating Texts and Emojis2024 6th International Conference on Data-driven Optimization of Complex Systems (DOCS)10.1109/DOCS63458.2024.10704299(294-300)Online publication date: 16-Aug-2024
  • (2021)Toward a Standard Approach for Echo Chamber Detection: Reddit Case StudyApplied Sciences10.3390/app1112539011:12(5390)Online publication date: 10-Jun-2021
  • (2021)Inferring political preference from Twitter tweets2021 6th International Conference on Inventive Computation Technologies (ICICT)10.1109/ICICT50816.2021.9358593(471-475)Online publication date: 20-Jan-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
IMCOM '17: Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication
January 2017
746 pages
ISBN:9781450348881
DOI:10.1145/3022227
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 January 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Facebook
  2. classification
  3. political affiliation
  4. text mining

Qualifiers

  • Research-article

Conference

IMCOM '17
Sponsor:

Acceptance Rates

IMCOM '17 Paper Acceptance Rate 113 of 366 submissions, 31%;
Overall Acceptance Rate 213 of 621 submissions, 34%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)4
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Deep Learning Models for Predicting Political Tendency in Facebook Posts Incorporating Texts and Emojis2024 6th International Conference on Data-driven Optimization of Complex Systems (DOCS)10.1109/DOCS63458.2024.10704299(294-300)Online publication date: 16-Aug-2024
  • (2021)Toward a Standard Approach for Echo Chamber Detection: Reddit Case StudyApplied Sciences10.3390/app1112539011:12(5390)Online publication date: 10-Jun-2021
  • (2021)Inferring political preference from Twitter tweets2021 6th International Conference on Inventive Computation Technologies (ICICT)10.1109/ICICT50816.2021.9358593(471-475)Online publication date: 20-Jan-2021
  • (2021)Extending DeGroot Opinion Formation for Signed Graphs and Minimizing PolarizationComplex Networks & Their Applications IX10.1007/978-3-030-65351-4_24(298-309)Online publication date: 5-Jan-2021
  • (2019)Polarization and Fake NewsACM Transactions on the Web10.1145/331680913:2(1-22)Online publication date: 27-Mar-2019
  • (2019)Predicting Neurodegenerative Diseases Using a Novel Blood Biomarkers-based Model by Machine Learning2019 International Conference on Technologies and Applications of Artificial Intelligence (TAAI)10.1109/TAAI48200.2019.8959854(1-6)Online publication date: Nov-2019
  • (2018)Predicting Political Tendency of Posts on FacebookProceedings of the 2018 7th International Conference on Software and Computer Applications10.1145/3185089.3185094(110-114)Online publication date: 8-Feb-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media