Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3038884.3038906acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmedpraiConference Proceedingsconference-collections
research-article

Towards Arabic semantic opinion mining: identifying opinion, polarity and intensity

Published: 22 November 2016 Publication History

Abstract

Arabic opinion mining is a challenging task because Arabic is morphologically and semantically rich language. In this paper, we are interested in analyzing opinions in Arabic news articles. We propose to use a machine learning technique to classify opinions or sentiments at the expression level. Our approach involves determining the semantic category of the expression. It also includes the classification of the opinion expression into positive or negative and the classification of its intensity into high, medium and low. Our method relies on wide range of features which are used in the literature like n-grams, morphological, stylistic features, etc. In addition, we propose new features inspired from contextual, semantic information and others specific for Arabic language. In the same context, we try to have a good contribution in opinion mining in Arabic by proposing to use Conditional Random Fields as a discriminative model. We carry out many experiments by combining at the same time different set of features to find the best combination that yield the best results. We evaluate our method at the expression level using a corpus of Arabic news articles. Our method achieves a good result that reaches 84.93% for contextual polarity classification and 87.54% for semantic opinion expression categorization.

References

[1]
Ahmed Abbasi, Hsinchun Chen, and Arab Salem. 2008. Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums. ACM Transactions on Information Systems. 26, 3 (June 2008), 1--34.
[2]
Mohamed Abdul-Mageed and Mona Diab. 2012. AWATIF: A multi-genre corpus for modern standard Arabic subjectivity and sentiment analysis. In Proceedings of the International Conference on Language Resources and Evaluation. 3907fi?!3914.
[3]
Mohamed Abdul-Mageed and Mona Diab. 2012. Toward building a large-scale arabic sentiment lexicon. In proceedings of the sixth International Global Word-Net Conference. Matsue, Japan.
[4]
Mohamed Abdul-Mageed and Mona Diab. 2014. SANA: A large scale multi-genre, multi-dialect lexicon for arabic subjectivity and sentiment analysis. In Proceedings of the Ninth International Conference on Language Resources and Evaluation. Matsue, Japan.
[5]
Samah Alhazmi, William Black, and John McNaught. 2013. Arabic SentiWordNet in relation to Sentiwordnet 3.0. International Journal of Computational linguistics 4, 1 (August 2013), 1--11.
[6]
Nicholas Asher, Farah BenAmara, and Yvette Y. Mathieu. 2009. Appraisal of Opinion Expressions in Discourse. Lingvistic Investigationes 32, 2(2009), 279--292.
[7]
Nicholas Asher and Alex Lascarides. 2003. Logics of Conversation. Cambridge University Press.
[8]
Adel Assiri, Ahmed Emam, and Hmood Aldossari. 2015. Arabic Sentiment analysis: A Survey. Journal of Advanced Computer Science and Applications 6, 12 (2015).
[9]
Stefano Baccianella, Andrea Esuli, and Fabrizio Sebastiani. 2010. SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In Proceedings of the Seventh International Conference on Language Resources and Evaluation. 2200--2204.
[10]
Marco Baroni and Stefano Vegnaduzzo. 2004. Identifying subjective adjectives through web-based mutual information. In Proceedings of the 7th Konferenz zur Verarbeitung Natrlicher Sprache (German Conference on Natural Language Processing) KONVENS04. 613--619.
[11]
Mohamed Boudchiche, Azzeddine Mazroui, Mohamed Ould Abdallahi Ould Bebah, Abdelhak Lakhouaja, and Abderrahim Boudlal. 2016. Alkhalil Morpho Sys 2: A robust Arabic morpho-syntactic Analyzer. Journal of King Saud University fi Computer and Information Sciences (June 2016).
[12]
Tim Buckwalter. 2004. Buckwalter Arabic Morphological Analyzer Version 2.0 LDC2004L02. Web Download. Philadelphia: Linguistic Data Consortium.
[13]
Erik Cambria, Björn Schuller, Yunqing Xia, and Catherine Havasi. 2013. New avenues in opinion mining and sentiment analysis. IEEE Intelligent Systems 28, 2 (February 2013), 15--21.
[14]
Baptiste Chardon. 2013. Chaine de traitement pour une approche discursive de l'analyse d'opinion. Phd dissertation. UPS, France.
[15]
Mohamed Elarnaoty, Samir AbdelRahman, and Aly Fahmy. 2012. A Machine Learning Approach for Opinion Holder Extraction in Arabic Language. International Journal of Artificial Intelligence & Applications 3, 2 (2012).
[16]
Noura Farra, Elie Challita, Rawad Abou-Assi, and Hazem Hajj. 2010. Sentence-level and document-level sentiment mining for arabic texts. In Proceedings of IEEE International conference on data mining workshops. 1114--1119.
[17]
Ronen Feldman. 2013. Techniques and applications for sentiment analysis. Commun. ACM 56, 4 (April 2013), 82--89.
[18]
Marwa Graja. 2015. Comprhension automatique de la parole en dialecte tunisien dans le cadre des systmes de dialogue. Phd dissertation. Fsegs, Tunisie.
[19]
Nizar Habash. 2010. Introduction to Arabic Natural Language Processing. Synthesis Lectures on Human Language Technologies. G.Hirst Ed., Morgan and Claypool.
[20]
Vasileios Hatzivassiloglou and Kathleen R. McKeown. 1997. Predicting the semantic orientation of adjectives. In Proceedings of annual meeting of the Association for Computational Linguistics.
[21]
Khairullah Khan, Baharum Baharudin, Aurnagzeb Khan, and Ashraf Ullah. 2014. Mining Opinion components from unstructured reviews: A review. Journal of King Saud University-Computer and Information Sciences 26, 3 (September 2014), 258--275.
[22]
John Lafferty, Andrew McCallum, and Fernando C.N. Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the 18th International Conference on Machine Learning (ICML). 282--289.
[23]
Bing Liu. 2012. Sentiment Analysis and Opinion Mining. Morgan & Claypool, San Rafael, CA.
[24]
Bing Liu. 2015. Sentiment Analysis: Mining Opinions, Sentiments, and Emotions. Cambridge Univ.Press, Cambridge, UK.
[25]
Mohamed Maamouri, Ann Bies, Seth Kulick, Sondos Krouna, Fatma Gaddeche, and Wajdi Zaghouani. 2010. Arabic TreeBank (ATB): Part 3 Version 3.2. Linguistic Data Consortium. Catalog No: LDC2010T08.
[26]
J. R Martin and Peter R. R. White. 2005. Language of Evaluation: Appraisal in English. Palgrave Macmillan.
[27]
Walaa Medhat, Ahmed Hassan, and Hoda Korashy. 2014. Sentiment analysis algorithms and applications: A survey. Ain Shams Engineering Journal 5, 4 (December 2014), 1093--1113.
[28]
Malik Muhammad Saad Missen, Mohand Boughanem, and Guillaume Cabanac. 2013. Opinion mining: reviewed from word to document level. Social Network Analysis and Mining 3, 1 (March 2013), 107--125.
[29]
Andrés Montoyo, Patricio Martínez-Barco, and Alexandra Balahur. 2012. Subjectivity and sentiment analysis: An overview of the current state of the area and envisaged developments. Decis. Support Syst. 53, 4 (November 2012), 675--679.
[30]
Bo Pang and Lillian Lee. 2008. Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2, 1-2 (January 2008), 1--135.
[31]
Patrick Paroubek, Alexander Pak, and Djamel Mostefa. 2010. Annotations for Opinion Mining Evaluation in the Industrial Context of the DOXA project. In Proceedings of the Seventh International Conference on Language Resources and Evaluation. 1105--1112.
[32]
Arfath Pasha, Mohamed Al-Badrashiny, Mona Diab, Ahmed El Kholy, Ramy Eskander, Nizar Habash, Manoj Pooleery, Owen Rambow, and Ryan Roth. 2014. MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic. In Proceedings of the International Conference on Language Resources and Evaluation. 1094--1101.
[33]
Mohammed Rushdi-Saleh, M. Teresa Martín-Valdivia, L. Alfonso Ureñ a López, and José M. Perea-Ortega. 2011. OCA: Opinion corpus for Arabic. Journal of the American Society for Information Science and Technology 62, 10 (October 2011), 2045--2054.
[34]
Jonathan Sonntag and Manfred Stede. 2014. Sentiment analysis: What's your opinion?. In Text Mining: From Ontology Learning to Automated Text Processing Applications. Springer, 177--199.
[35]
Carlo Strapparava and Alessandro Valitutti. 2004. WordNet-Affect: an Affective Extension of WordNet. In Proceedings of the 4th International Conference on Language Resources and Evaluation. 1083--1086.
[36]
Maite Taboada. 2016. Sentiment Analysis: Overview from Linguistics. Annual Review of Linguistics 2 (January 2016), 325--347.
[37]
Reut Tsarfaty, Djam Seddah, Yoav Goldberg, Sandra Kubler, Yannick Versley, Marie Candito, Jennifer Foster, Ines Rehbein, and Lamia Tounsi. 2010. Statistical parsing of morphologically rich languages (SPMRL): What, how and whither. In Proceedings of the NAACL Workshop on Statistical Parsing of Morphologically Rich Languages. 1--12.
[38]
Casey Whitelaw, Navendu Garg, and Shlomo Argamon. 2005. Using Appraisal Taxonomies for Sentiment Analysis. In Proceedings of the 2nd Midwest Computational Linguistic Colloquium.
[39]
Janyce Wiebe, Theresa Wilson, and Claire Cardie. 2005. Annotating expressions of opinions and emotions in language. Language Resources and Evaluation 39, 2 (May 2005), 165--210.

Cited By

View all
  • (2020)A novel category detection of social media reviews in the restaurant industryMultimedia Systems10.1007/s00530-020-00704-229:3(1825-1838)Online publication date: 24-Oct-2020
  • (2019)A Survey of Opinion Mining in ArabicACM Transactions on Asian and Low-Resource Language Information Processing10.1145/329566218:3(1-52)Online publication date: 7-May-2019

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
MedPRAI-2016: Proceedings of the Mediterranean Conference on Pattern Recognition and Artificial Intelligence
November 2016
163 pages
ISBN:9781450348768
DOI:10.1145/3038884
  • General Chairs:
  • Chawki Djeddi,
  • Imran Siddiqi,
  • Akram Bennour,
  • Program Chairs:
  • Youcef Chibani,
  • Haikal El Abed
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 November 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. CRF
  2. Category
  3. Opinion
  4. Polarity
  5. Semantic annotation

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

MedPRAI-2016

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2020)A novel category detection of social media reviews in the restaurant industryMultimedia Systems10.1007/s00530-020-00704-229:3(1825-1838)Online publication date: 24-Oct-2020
  • (2019)A Survey of Opinion Mining in ArabicACM Transactions on Asian and Low-Resource Language Information Processing10.1145/329566218:3(1-52)Online publication date: 7-May-2019

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media