Abstract
Opinion mining is the process of analyzing views, attitude or opinions of a writer or a speaker. Research in this particular area involves the detection of opinions from the text of any language. Vast amount of work has been done for the English language. In spite of lack of resources for Indian languages, work has been done for Telugu, Bengali and Hindi language. In this paper, we proposed a hybrid research approach for the emotion/opinion mining of the Punjabi text. Hybrid technique is the combination of Naïve Bayes and N-grams. As the part of presented research, we have extracted the features of N-grams model which are used to train Naïve Bayes. The trained model is then validated using the testing data. Results obtained are also compared with already existing approaches and the accuracy of the results shows the better efficacy of the proposed method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Biadsy, F., Mckeown, K., Agarwal, A.: Contextual phrase-level polarity analysis using lexical affect scoring and syntactic n-grams (2009)
Esuli, A., Sebastiani, F., Baccianella, S.: Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010), Valletta, Malta (2010)
Das, A.: Opinion Extraction and Summarization from Text Documents in Bengali. Doctoral Thesis. UMI Order Number: UMI Order No. GAX95-09398. Jadavpur University (2011)
Wiebe, J., Banea, C., Mihalcea, R.: A bootstrapping method for building subjectivity lexicons for languages with scarce resources. In: Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), Marrakech, Morocco (2008)
Bandyopadhyay, S., Das, A.: SentiWordNet for Bangla (2010)
Bandyopadhyay, S., Das, A.: SentiWordNet for Indian Languages (2010)
Arora, P.: Sentiment Analysis for Hindi Language. Masters thesis, IIT, Hyderabad (2013)
Sebastiani, F., Esuli, A.: Sentiwordnet: A publicly available lexical resource for opinion mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation (LREC 2006), p. 54 (2006)
McKeown, K.R., Hatzivassiloglou, V.: Predicting the semantic orientation of adjectives. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, ACL 1998, Stroudsburg, PA, USA, p. 174 (1997)
Liu, B., Hu, M.: Mining and summarizing customer reviews. In: KDD, p. 168 (2004)
Wilson, T., Intelligent, T.W.: Annotating opinions in the world press. In: SIGdial 2003, p. 13 (2003)
Bhattacharyya, P., Joshi, A., Balamurali, A.R.: A fall-back strategy for sentiment analysis in Hindi: A case study (2010)
Rijke, M.D., Kamps, J., Marx, M., Mokken, R.J.: Using wordnet to measure semantic orientation of adjectives. National Institute for, p. 1115 (2004)
Indian Institute of Technology, Hyderabad, http://www.iith.ac.in/
Kim, S.: Determining the sentiment of opinions. In: Proceedings of COLING, p. 1367 (2004)
Hovy, E., Kim, S.: Identifying and analyzing judgment opinions. In: Proceedings of HLT/NAACL 2006, p. 200 (2006)
Vaithyanathan, S., Pang, B., Lee, L.: Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP), p. 79 (2002)
Ravichandran, D., Rao, D.: Semi-supervised polarity lexicon induction. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2009, Stroudsburg, PA, USA, p. 675 (2009)
Dunphy, D.C., Stone, P.J., Ogilvie, D.M., Smith, M.S.: The General Inquirer: A Computer Approach to Content Analysis. MIT Press, Cambridge (1966)
Turney, P.: Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews (2002)
Wiebe, J.M., O’Hara, T.P., Bruce, R.E.: Development and use of a gold-standard data set for subjectivity classifications. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, ACL 1999, Stroudsburg, PA, USA, p. 246 (1999)
Wilson, T.: Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of HLT-EMNLP, pp. 347–354 (2005)
Hatzivassiloglou, V., Yu, H.: Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, EMNLP 2003, Stroudsburg, PA, USA, p. 129 (2003)
Gupta, V.: Automatic Stemming of Words for Punjabi Language. In: Thampi, S.M., Gelbukh, A., Mukhopadhyay, J. (eds.) Advances in Signal Processing and Intelligent Recognition Systems. AISC, vol. 264, pp. 73–84. Springer, Heidelberg (2014)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Kaur, A., Gupta, V. (2014). N-gram Based Approach for Opinion Mining of Punjabi Text. In: Murty, M.N., He, X., Chillarige, R.R., Weng, P. (eds) Multi-disciplinary Trends in Artificial Intelligence. MIWAI 2014. Lecture Notes in Computer Science(), vol 8875. Springer, Cham. https://doi.org/10.1007/978-3-319-13365-2_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-13365-2_8
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13364-5
Online ISBN: 978-3-319-13365-2
eBook Packages: Computer ScienceComputer Science (R0)