Abstract
This work presents a study of emotions to analyze the polarity of a set of data that was extracted from Twitter, detailing each of the resources in the different forms that a language has, and to be able to observe feelings such as irony, sarcasm, and happiness, among others. This research can help us classify the polarity of each one of them deeply in the corpus that deals with this research work. Experimental results conducted using different machine learning methods are presented: Support Vector Machines, Naïve Bayes, Logistic regression, KNN and Random Forest, with which a classification system based on cross-validation was implemented. All experiments were performed in Python. The results obtained are shown with two different Corpus; where the first set is made up of 10,653 tweets in total divided equally each with 3551 tweets with a positive, negative and neutral label; while the second set was handled with 10% of all the tweets contained in the database mentioned in the article, where the first set shows a polarity precision of 74.9%, having Logistic Regression as the best classifier using the classification scenario known as cross validation, while the second set shows an accuracy of 78.5%, also having Random Forest as the best classifier using Cross Validation as the best classification scenario.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Agarwal, A., Xie, B., Vovsha, I., Rambow, O., Passonneau, R.J.: Sentiment analysis of twitter data. In: Proceedings of the Workshop on Language in Social Media (LSM 2011), pp. 30–38 (2011)
Jackson, J., Gettings, S., Metcalfe, A.J.N.: “The power of Twitter”: Using social media at a conference with nursing students. 68. Elsevier, pp. 188–191 (2018)
Fiorini, P.M., Lipsky, L.R.: Search Marketing Traffic and Performance Models. 34(6), 517–526 (2012)
Fernandez, J., Boldrini, E., Manuel Gomez, J., Martinez-Barco, P.J.P.D.L.N.: Sentiment Analysis and Opinion Mining: The EmotiBlog Corpus. 47, pp. 179–187 (2011)
Reyes, A., Rosso, P., Veale, T.: A Multidimensional Approach for Detecting Irony in Twitter 47(1), 239–268 (2013)
Saberi, B., Saad, S.: Sentiment Analysis or Opinion Mining: A Review. 7(5), 1660–1666 (2017)
Hierons, R.:Machine learning. Tom M. Mitchell. Published by McGraw‐Hill, Maidenhead, UK, International Student Edition, 1997. ISBN: 0‐07‐115467‐1, 414 pages. Price: UK£ 22.99, soft cover, ed: Wiley Online Library (1999)
Chaovalit, P., Zhou, L.: Movie review mining: A comparison between supervised and unsupervised classification approaches. In: Proceedings of the 38th Annual Hawaii International Conference on System Sciences, pp. 112c-112c: IEEE (2005)
Brooke, J., Tofiloski, M., Taboada, M.: Cross-linguistic sentiment analysis: from english to Spanish. In: Proceedings of the International Conference RANLP-2009, pp. 50–54 (2009)
Refaeilzadeh, P., Tang, L., Liu, H.: Cross-validation. 5, p. 532–538 (2009)
Wright, R.E.: Logistic Regression (1995)
Castro, W.M., Cabrera, S.G.: Tuberculosis: Diagnosis by Image Processing. 24(2) (2020)
González, R.H., Morell, C., Blanco, A.: Regresión lineal local con reducción de rango para problemas de predicción con salidas compuestas. Revista Cubana de Ciencias Informáticas 10(4), 184–193 (2016)
Bowers, A.J., Zhou, R.: Receiver operating characteristic (ROC) area under the curve (AUC): a diagnostic measure for evaluating the accuracy of predictors of education outcomes. 24(1), 20–46 (2019)
Makhoul, J., Kubala, F., Schwartz, R., Weischedel, R.: Performance measures for information extraction. In: Proceedings of DARPA Broadcast News Workshop, 249–252 Herndon, VA (1999)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Castro, J.C.M., Cabrera, R.G., Pinales, J.R., Carrillo, L.M.L., Priego, B. (2023). Automatic Polarity Identification on Twitter Using Machine Learning. In: Arai, K. (eds) Proceedings of the Future Technologies Conference (FTC) 2022, Volume 3. FTC 2022 2022. Lecture Notes in Networks and Systems, vol 561. Springer, Cham. https://doi.org/10.1007/978-3-031-18344-7_35
Download citation
DOI: https://doi.org/10.1007/978-3-031-18344-7_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-18343-0
Online ISBN: 978-3-031-18344-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)