Abstract
Nowadays, more and more people are willing to express their opinions and attitudes in the microblog platform. Stance detection refers to the task that judging whether the author of the text is in favor of or against the given target. Most of the existing literature are for the debates or online conversations, which have adequate context for inferring the authors’ stances. However, for detecting the stance in microblogs, we have to figure out the stance of the author only based on the unique and separate microblog, which sets new obstacles for this task. In this paper, we conduct a comprehensive empirical study on microblog stance detection using supervised and semi-supervised machine learning methods. Different unbalanced data processing strategies and classifiers, such as Linear SVM, Naive Bayes and Random Forest, are compared using NLPCC 2016 Stance Detection Evaluation Task dataset. Experiment results show that the method based on ensemble learning and SMOTE2 unbalanced processing with sentiment word features outperforms the best submission result in NLPCC 2016 Evaluation Task.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Thomas, M., Pang, B., Lee, L.: Get out the vote: determining support or opposition from congressional floor-debate transcripts. In: EMNLP, pp. 327–335 (2006)
Walker, M.A., Anand, P., Abbott, R., Grant, R.: Stance classification using dialogic properties of persuasion. In: HLT-NAACL, pp. 592–596 (2012)
Recasens, M., Danescu-Niculescu-Mizil, C., Jurafsky, D.: Linguistic models for analyzing and detecting biased language. In: ACL (1), pp. 1650–1659 (2013)
Somasundaran, S., Wiebe, J.: Recognizing stances in online debates. In: ACL-IJCNLP, pp. 226–234 (2009)
Murakami, A., Raymond, R.: Support or oppose? Classifying positions in online debates from reply activities and opinion expressions. In: COLING (Posters), pp. 869–875 (2010)
Millen, D.R., Fontaine, M.A.: Multi-team facilitation of very large-scale distributed meetings. In: ECSCW, pp. 259–275 (2003)
Abbott, R., Walker, M., Anand, P., Tree, J.E.F., Bowmani, R., King, J.: How can you say such things?!?: recognizing disagreement in informal political argument. In: The Workshop on Languages in Social Media, 8 (2011)
Wang, Y., Rosé, C.P.: Making conversational structure explicit: identification of initiation-response pairs within online discussions. In: HLT-NAACL, pp. 673–676 (2010)
Faulkner, A.: Automated classification of stance in student essays: an approach using stance target information and the wikipedia link-based measure. In: FLAIRS (2014)
Sobhani, P., Inkpen, D., Matwin, S.: From argumentation mining to stance classification. In: The Workshop on Argumentation Mining (2015)
Rajadesingan, A., Liu, H.: Identifying users with opposing opinions in Twitter debates. In: Kennedy, W.G., Agarwal, N., Yang, S.J. (eds.) SBP 2014. LNCS, vol. 8393, pp. 153–160. Springer, Heidelberg (2014). doi:10.1007/978-3-319-05579-4_19
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16(1), 321–357 (2002)
Han, H., Wang, W.-Y., Mao, B.-H.: Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning. In: Huang, D.-S., Zhang, X.-P., Huang, G.-B. (eds.) ICIC 2005. LNCS, vol. 3644, pp. 878–887. Springer, Heidelberg (2005). doi:10.1007/11538059_91
Batista, G.E.A.P.A., Bazzan, A.L.C., Monard, M.C.: Balancing training data for automated annotation of keywords: a case study. In: WOB, pp. 10–18 (2003)
Acknowledgements
The work is supported by National Natural Science Foundation of China (61370074, 61402091), the Fundamental Research Funds for the Central Universities of China under Grant N140404012.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Liu, L., Feng, S., Wang, D., Zhang, Y. (2016). An Empirical Study on Chinese Microblog Stance Detection Using Supervised and Semi-supervised Machine Learning Methods. In: Lin, CY., Xue, N., Zhao, D., Huang, X., Feng, Y. (eds) Natural Language Understanding and Intelligent Applications. ICCPOL NLPCC 2016 2016. Lecture Notes in Computer Science(), vol 10102. Springer, Cham. https://doi.org/10.1007/978-3-319-50496-4_68
Download citation
DOI: https://doi.org/10.1007/978-3-319-50496-4_68
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50495-7
Online ISBN: 978-3-319-50496-4
eBook Packages: Computer ScienceComputer Science (R0)