Abstract
The sentiment analysis of short texts is an important research hotspot in natural language processing. Based on the word features, this paper constructs a binary sentiment dictionary for a Chinese short text corpus using statistical methods. Then we calculate the sentiment value of the dictionary by Word2Vec algorithm and seed words. To evaluate the effectiveness of the dictionary, we manually annotated sentiment of the dictionary and compared with the calculation result. We also compared the performance effects of using different emotional dictionaries for the sentiment classification. The results show that the sentiment collocation dictionary is performed well in the emotional classification of Chinese short texts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Redman, S., Ellis, R.: A way with words. Book 1. Cambridge University Press, Cambridge (1989)
Esuli, A., Sebastoamo, F.: Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of LREC Genoa-Italy: LREC, pp. 417–422 (2006)
Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: International Conference on Language Resources and Evaluation, pp. 83–90 (2010)
Tang, D.: Nation Taiwan University: simplified Chinese emotional dictionary (2013). http://www.datatang.com/data/11837
Xu, L., Lin, H.F., Pan, Y., et al.: Constructing the affective lexicon ontology. J. China Soc. Sci. Tech. Inf. 27(2), 180–185 (2008)
Yang, A.M., Lin, J.H., Zhou, Y.M.: Method on building Chinese text sentiment lexicon. J. Front. Comput. Sci. Technol. 7(11), 1033–1039 (2013)
Zhou, Y.M., Yang, A.M., Lin, J.H.: Construction method of sentiment lexicon for news reviews. J. Shandong Univ. (Eng. Sci.) 44(3), 36–40 (2014)
Zhou, Y.M., Yang, A.M., Yang, J.N.: A method of building Chinese microblog sentiment lexicon. Comput. Sci. 41(8), 67–70 (2014)
Zhou, J.F., Yang, A.M., et al.: Micro-bolg sentimental feature selection based on bigram collocation. Comput. Eng. 40(6), 162–165 (2014)
Hamdan, H., Bellot, P., Bechet, F.: Sentiment lexicon-based features for sentiment analysis in short text. In: International Conference on Intelligent Text Processing and Computational Linguistics (2015)
Zhang, C., Zeng, D., Li, J., Wang, F.Y., Zuo, W.: Sentiment analysis of Chinese documents. J. Assoc. Inf. Sci. Technol. 60(12), 2474–2487 (2009)
Tomas, M.: Word2Vec project (2014). https://code.google.com/p/Word2Vec/
Zhang, D., Xu, H., Su, Z., Xu, Y.: Chinese comments sentiment classification based on Word2Vec and SVM perf. Expert Syst. Appl. 42(4), 1857–1863 (2015)
Joseph, T., Lev, R., Yoshua, B.: Word representations: a simple and general method for semi-supervised learning. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 384–394 (2010)
Chen, Y.J.: The Automatic Extraction Method of Collocation of Words in Modern Chinese. East China Normal University (2005)
Wang, S., Yang, A.: A method of collocation orientation identification based on hybrid language information. J. Chin. Motion Process. 24(3), 69–74 (2012)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: Proceedings of Workshop at ICLR (2013)
Su, J.: Incredible Word2Vec.trained models (2017). http://spaces.ac.cn/archives/4304/comment-page-1
Řehůřek, R.: Gensim: Topic modelling for humans (2017). http://radimrehurek.com/genism
Acknowledgments
The work was supported by the University Innovative Talent Project of Guangdong Province (2013).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Zhou, J., Chen, B., Lin, Y. (2017). An Approach to Constructing Sentiment Collocation Dictionary for Chinese Short Text Based on Word2Vec. In: Huang, TC., Lau, R., Huang, YM., Spaniol, M., Yuen, CH. (eds) Emerging Technologies for Education. SETE 2017. Lecture Notes in Computer Science(), vol 10676. Springer, Cham. https://doi.org/10.1007/978-3-319-71084-6_64
Download citation
DOI: https://doi.org/10.1007/978-3-319-71084-6_64
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-71083-9
Online ISBN: 978-3-319-71084-6
eBook Packages: Computer ScienceComputer Science (R0)