Abstract
This paper conducts a sentiment analysis of Twitter’s posts, between late October 2020 and late April 2021, regarding COVID-19 vaccination campaign in Mexico through several machine learning models such as Logistic Regression, Neuronal Network, Naive Bayes and Support Vector Machine. To prepare data, Natural Language Processing techniques were used such as tokenization, stemming, n-grams and stopwords. The best performance was achieved by Logistic Regression with an accuracy score of 83.42% while classifying tweets according to a positive or negative sense. This work suggests that sentiment analysis with Twitter information allows to witness a relevant part of the public discussion around specific topics. For this study, the tweets analyzed showed a similar behavior to other search and reference electronic tools, such as Google Trends regarding conversation around COVID. In addition, the present analysis allows the classification and tendency of public opinion. Furthermore, this study shows that measuring people’s opinion through machine learning and natural language processing techniques can generate significant benefits for institutions and businesses given that obtaining information on Twitter is less expensive and can be processed and analyzed faster than other opinion analysis techniques such as surveys or focus groups.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Albawi, S., Mohammed, T.A., Al-Zawi, S.: Understanding of a convolutional neural network. In: 2017 International Conference on Engineering and Technology (ICET), pp. 1–6 (2017). https://doi.org/10.1109/ICEngTechnol.2017.8308186
Aljameel, S.S., et al.: A sentiment analysis approach to predict an individual’s awareness of the precautionary procedures to prevent COVID-19 outbreaks in Saudi Arabia. Int. J. Environ. Res. Public Health 18(1), 218 (2020)
GOBMX: Covid-19 preguntas frecuentes. https://bit.ly/3cJEUBy
Google: Google trends. https://bit.ly/3s2e0ex
Lewis, D.D.: Naive (Bayes) at forty: the independence assumption in information retrieval. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 4–15. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0026666
Murphy, J.: Social media in public opinion research. American Association of Public Opinion Research, May 2014
Velásquez, N., et al.: Hate multiverse spreads malicious COVID-19 content online beyond individual platform control. https://arxiv.org/ftp/arxiv/papers/2004/2004.00673.pdf
Pisner, D.A., Schnyer, D.M.: Support vector machine. In: Mechelli, A., Vieira, S. (eds.) Machine Learning, pp. 101–121. Academic Press (2020). https://doi.org/10.1016/B978-0-12-815739-8.00006-7. https://www.sciencedirect.com/science/article/pii/B9780128157398000067
Provost, F., Fawcett, T.: Data Science for Business, 1st edn. O’Reilly Media, Sebastopol (2013)
Raschka, S.: Python Machine Learning. Packt Publishing Ltd., Birmingham (2015)
Sande, J.: Analisis de sentimientos en Twitter. http://openaccess.uoc.edu/webapps/o2/bitstream/10609/81435/6/jsobrinosTFM0618memoria.pdf
Shadpour, D.: How social media can serve as the new focus group for your brand. https://bit.ly/3r2ur9p
Snscrape. https://pypi.org/project/snscrape/
Wright, R.E.: Logistic regression. In: Grimm, L.G., Yarnold, P.R. (eds.) Reading and Understanding Multivariate Statistics, pp. 217–244. American Psychological Association, Washington, DC (1995)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Bernal, C., Bernal, M., Noguera, A., Ponce, H., Avalos-Gauna, E. (2021). Sentiment Analysis on Twitter About COVID-19 Vaccination in Mexico. In: Batyrshin, I., Gelbukh, A., Sidorov, G. (eds) Advances in Soft Computing. MICAI 2021. Lecture Notes in Computer Science(), vol 13068. Springer, Cham. https://doi.org/10.1007/978-3-030-89820-5_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-89820-5_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-89819-9
Online ISBN: 978-3-030-89820-5
eBook Packages: Computer ScienceComputer Science (R0)