Abstract
The use of Machine Learning (ML) algorithms in opinion mining, particularly supervised learning algorithms, requires an annotated corpus to train the classification model in order to predict results that are close to reality. Unfortunately, there are still no resources for the automatic processing of textual data expressed in the Senegalese urban language.
The objective of this paper is to build a multilingual corpus for opinion mining (COMFO). The process of building the COMFO corpus is composed of three steps: presentation of the data source, data collection and preparation, and annotation by lexical approach. The particularity of COMFO lies in the integration of foreign languages (French and English) and local languages, particularly urban Wolof, in order to reflect the collective opinion of Senegalese readers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
References
Jeyapriya, A., Selvi, C.K.: Extracting aspects and mining opinions in product reviews using supervised learning algorithm. In: Electronics and Communication Systems (ICECS), 2015 2nd International Conference on, pp. 548–552 (2015)
Liu, B., Zhang, L.: A survey of opinion mining and sentiment analysis. In: Mining text data, pp. 415–463. Springer (2012)
Proksch, S.-O., Lowe, W., Wäckerle, J., Soroka, S.: Multilingual sentiment analysis: A new approach to measuring conflict in legislative speeches. Legis. Stud. Q. (2018)
Grljević, O., Bošnjak, Z., Kovačević, A.: Opinion mining in higher education: a corpus-based approach. Enterp. Inf. Syst. 1–26 (2020)
Hardalov, M., Arora, A., Nakov, P., Augenstein, I.: Few-shot cross-lingual stance detection with sentiment-based pre-training. ArXiv Prepr. ArXiv210906050 (2021)
Lo, S.L., Cambria, E., Chiong, R., Cornforth, D.: Multilingual sentiment analysis: from formal to informal and scarce resource languages. Artif. Intell. Rev. 48(4), 499–527 (2017)
Faty, L., Ndiaye, M., Diop, I., Drame, K.: The complexity of comments from Senegalese online presses face with opinion mining methods. In: 2019 14th Iberian Conference on Information Systems and Technologies (CISTI), pp. 1–6 (2019)
Faty, L., Ndiaye, M., Sarr, E.N., Sall, O.: OpinionScraper: a news comments extraction tool for opinion mining. In: 2020 Seventh International Conference on Social Networks Analysis, Management and Security (SNAMS), pp. 1–5 (2020)
Faty, L., et al.: SenOpinion: a new lexicon for opinion tagging in Senegalese news comments., In: 2020 15th Iberian Conference on Information Systems and Technologies (CISTI), pp. 1–6 (2020)
Sun, S., Luo, C., Chen, J.: A review of natural language processing techniques for opinion mining systems. Inf. Fusion 36, 10–25 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Faty, L. et al. (2023). COMFO: Multilingual Corpus for Opinion Mining. In: Goertzel, B., Iklé, M., Potapov, A., Ponomaryov, D. (eds) Artificial General Intelligence. AGI 2022. Lecture Notes in Computer Science(), vol 13539. Springer, Cham. https://doi.org/10.1007/978-3-031-19907-3_2
Download citation
DOI: https://doi.org/10.1007/978-3-031-19907-3_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-19906-6
Online ISBN: 978-3-031-19907-3
eBook Packages: Computer ScienceComputer Science (R0)