Abstract
Community Question Answering (CQA) websites provide a platform to ask questions and share their knowledge. Good questions in CQA websites can improve user experiences and attract more users. To the best of our knowledge, a few researches have been studied on the question quality, especially the quality of newly proposed questions. In this work, we consider that a good question is popular and answerable in CQA websites. The community features of questions are extracted automatically and utilized to acquire massive good questions. The text features and asker features of good questions are utilized to train our weakly supervised model based on Convolutional Neural Network to recognize good newly proposed questions. We conduct extensive experiments on the publicly available dataset from StackExchange and our best result achieves F1-score at 91.5%, outperforming the baselines.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.
References
Jeon, J., Croft, W.B., Lee, J.H., Park, S.: A framework to predict the quality of answers with non-textual features. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 228–235. ACM (2006)
Shah, C., Pomerantz, J.: Evaluating and predicting answer quality in community QA. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 411–418. ACM (2010)
Bian, J., Liu, Y., Zhou, D., Agichtein, E., Zha, H.: Learning to recognize reliable users and content in social media with coupled mutual reinforcement. In: Proceedings of the 18th International Conference on World Wide Web, pp. 51–60. ACM (2009)
Ko, J., Nyberg, E., Si, L.: A probabilistic graphical model for joint answer ranking in question answering. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 343–350. ACM (2007)
Blooma, M.J., Chua, A.Y., Goh, D.H.-L.: A predictive framework for retrieving the best answer. In: Proceedings of the 2008 ACM Symposium on Applied Computing, pp. 1107–1111. ACM (2008)
Harper, F.M., Raban, D., Rafaeli, S., Konstan, J.A.: Predictors of answer quality in online Q&A sites. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 865–874. ACM (2008)
Suryanto, M.A., Lim, E.P., Sun, A., Chiang, R.H.: Quality-aware collaborative question answering: methods and evaluation. In: Proceedings of the Second ACM International Conference on Web Search and Data Mining, pp. 142–151. ACM (2009)
Le, L.T., Shah, C., Choi, E.: Evaluating the quality of educational answers in community question-answering. In: 2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL), pp. 129–138. IEEE (2016)
Liu, H., Huang, J., An, C., Fu, X.: Answer quality prediction joint textual and non-textual features. In: 13th Web Information Systems and Applications Conference, 2016, pp. 144–148. IEEE (2016)
Suggu, S.P., Goutham, K.N., Chinnakotla, M.K., Shrivastava, M.: Deep feature fusion network for answer quality prediction in community question answering. arXiv preprint arXiv:1606.07103 (2016)
Luo, M., Nie, F., Chang, X., Yang, Y., Hauptmann, A.G., Zheng, Q.: Adaptive unsupervised feature selection with structure regularization. IEEE Trans. Neural Netw. Learn. Syst. PP(99), 1–13 (2017)
Luo, M., Chang, X., Yang, Y., Nie, L., Hauptmann, A.G., Zheng, Q.: Simple to complex cross-modal learning to rank, arXiv preprint arXiv:1702.01229 (2017)
Li, B., Jin, T., Lyu, M.R., King, I., Mak, B.: Analyzing and predicting question quality in community question answering services. In: Proceedings of the 21st International Conference on World Wide Web, pp. 775–782. ACM (2012)
Agichtein, E., Castillo, C., Donato, D., Gionis, A., Mishne, G.: Finding high-quality content in social media. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp. 183–194. ACM (2008)
Ravi, S., Pang, B., Rastogi, V., Kumar, R.: Great question! question quality in community Q&A. In: ICWSM, vol. 14, pp. 426–435 (2014)
Baltadzhieva, A., Chrupala, G., Angelova, G., Bontcheva, K., Mitkov, R.: Predicting the quality of questions on stackoverflow. In: RANLP, 2015, pp. 32–40 (2015)
Acknowledgments
This work is sponsored by The Fundamental Theory and Applications of Big Data with Knowledge Engineering under the National Key Research and Development Program of China with grant number 2016YFB1000903, Ministry of Education Innovation Research Team No. IRT 17R86, Innovative Research Group of the National Natural Science Foundation of China (61721002); National Science Foundation of China under Grant Nos. 61672419, 61532004, 61532015, the MOE Research Program for Online Education under Grant No. 2016YB166.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Zheng, Y. et al. (2017). Quality Prediction of Newly Proposed Questions in CQA by Leveraging Weakly Supervised Learning. In: Cong, G., Peng, WC., Zhang, W., Li, C., Sun, A. (eds) Advanced Data Mining and Applications. ADMA 2017. Lecture Notes in Computer Science(), vol 10604. Springer, Cham. https://doi.org/10.1007/978-3-319-69179-4_46
Download citation
DOI: https://doi.org/10.1007/978-3-319-69179-4_46
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69178-7
Online ISBN: 978-3-319-69179-4
eBook Packages: Computer ScienceComputer Science (R0)