Abstract
Cyberbullying on social media platforms has been a severe problem with serious negative consequences. Therefore, a number of researches on automatic detection of cyberbullying using machine learning techniques have been conducted in recent years. While cyberbullying detection has traditionally utilized linguistic features, the cyberbullying on social media does not have only linguistic features. In this paper, a holistic multi-dimensional feature set is developed which takes into account individual-based, social network-based, episode-based and linguistic content-based cyberbullying features. To test performance of the proposed multi-dimensional feature set, we designed and built cyberbullying detection models on the KNIME machine learning platform. Six different machine learning algorithms - Naïve Bayes, Decision Tree, Random Forest, Tree Ensemble, Logistic Regression, and Support Vector Machines - were used in our cyberbullying detection models. Our experimental results demonstrate that applying the proposed multi-dimensional feature set (i.e. the set not limited to the linguistic features) results in an improved cyberbullying detection for all tested machine learning algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Price, M., Dalgleish, J.: Cyberbullying: experiences, impacts and coping strategies as described by Australian young people. Youth Stud. Aust. 29, 51 (2010)
O’Sullivan, P.B.: Reconceptualizing ‘flaming’ and other problematic messages. New Media Soc. 5(1), 69–94 (2003)
Vandebosch, H., van Cleemput, K.: Cyberbullying among youngsters: profiles of bullies and victims. New Media Soc. 11(8), 1349–1371 (2009)
Willard, N.E.: Cyberbullying and Cyberthreats: Responding to the Challenge of Online Social Aggression, Threats, and Distress. Research Publishers LLC, Champaign (2007)
Cowie, H.: Cyberbullying and its impact on young people’s emotional health and well-being. Psychiatrist 37(5), 167–170 (2013)
Smith, P.K., et al.: Cyberbullying: its nature and impact in secondary school pupils. J. Child Psychol. Psychiatry Allied Discip. 49(4), 376–385 (2008)
Slonje, R., Smith, P.K., Frisén, A.: The nature of cyberbullying, and strategies for prevention. Comput. Hum. Behav. 29(1), 26–32 (2013)
Ghasem, Z., Frommholz, I., Maple, C.: Machine learning solutions for controlling cyberbullying and cyberstalking. J. Inf. Secur. Res. 6(2), 55–64 (2015)
Galán-García, P., et al.: Supervised machine learning for detection of troll profiles in twitter social network: application to real case of cyberbullying. Log. J. IGPL 24(1), 42–53 (2015)
Kasture, A.S., Nand, P., Tegginmath, S.: A predictive model to detect online cyberbullying (2015)
Zhao, R., Zhou, A., Mao, K.: Automatic detection of cyberbullying on social networks based on bullying features. In: 17th International Conference on Computer Networks - ICDCN 2016 (2016)
Engman, L., Janlert, L.E., Bjorklund, H.: Automatic detection of cyberbullying on social media. In: Proceedings of 16th International Multidisciplinary Scientific Conference SGEM 2016, pp. 505–512 (2016)
Chatzakou, D., et al.: Mean birds: detecting aggression and bullying on Twitter (2017)
Haidar, B., Chamoun, M., Serhrouchni, A.: A multilingual system for cyberbullying detection: Arabic content detection using machine learning. ASTES J. 2(6), 275–284 (2017)
Agrawal, S., Awekar, A.: Deep learning for detecting cyberbullying across multiple social media platforms. In: Pasi, G., Piwowarski, B., Azzopardi, L., Hanbury, A. (eds.) ECIR 2018. LNCS, vol. 10772, pp. 141–153. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-76941-7_11
Van Hee, C., et al.: Automatic detection of cyberbullying in social media text. Plos One 1–21 (2018)
Hosseinmardi, H., Mattson, S.A., Ibn Rafiq, R., Han, R., Lv, Q., Mishra, S.: Analyzing labeled cyberbullying incidents on the instagram social network. Social Informatics. LNCS, vol. 9471, pp. 49–66. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-27433-1_4
Hosseinmardi, H., et al.: Towards understanding cyberbullying behavior in a semi-anonymous social network. In: Proceedings of 2014 International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2014, pp. 244–252 (2014)
Hosseinmardi, H., et al.: A comparison of common users across Instagram and Ask.fm to better understand cyberbullying. In: Proceedings of 4th IEEE International Conference on Big Data and Cloud Computing, pp. 355–362 (2014)
Hosseinmardi, H.: Dataset - CU Cyber Safety Research Center, Univ. Colorado at Boulder. https://sites.google.com/site/cucybersafety/home/cyberbullying-detection-project/dataset
Pieschl, S., et al.: Relevant dimensions of cyberbullying - results from two experimental studies. J. Appl. Dev. Psychol. 34(5), 241–252 (2013)
NoSwearing.com: Swear word list, dictionary, filter, and API. https://www.noswearing.com
Hatebase. https://www.hatebase.org/
Berthold, M.R., et al.: KNIME-the Konstanz information miner: ver. 2.0 and beyond. ACM SIGKDD Explor. Newsl. 11(1), 26–31 (2009)
Fawcett, T.: An introduction to ROC analysis. Pattern Rec. Lett. 27(8), 861–874 (2006)
Textfixer.com English stop words list. https://www.textfixer.com/tutorials/common-english-words.txt
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Liu, Y., Zavarsky, P., Malik, Y. (2019). Non-linguistic Features for Cyberbullying Detection on a Social Media Platform Using Machine Learning. In: Vaidya, J., Zhang, X., Li, J. (eds) Cyberspace Safety and Security. CSS 2019. Lecture Notes in Computer Science(), vol 11982. Springer, Cham. https://doi.org/10.1007/978-3-030-37337-5_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-37337-5_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37336-8
Online ISBN: 978-3-030-37337-5
eBook Packages: Computer ScienceComputer Science (R0)