Abstract
This paper designs and implements a Chinese spam review detection system based on rules. The main rules include the following three types: (1) Calculating the similarity between two comments, i.e., if the similarity is larger than a specified threshold, the two comments are viewed as review spam; (2) Calculating the correlation degree between comments and the product, i.e., if the degree is smaller than a specified threshold, the comment is viewed as review spam. (3) Detecting whether stuffing exists in the keyword, meta field or keywords of the web page. If they exist, the comments are viewed as review spam. In addition, we proposed a Naive Bayes Classifier in the review detection system. We selected 500 comments randomly and signed the comments true or false manually. Then 400 comments were selected to train and the other 100 comments were used to test. Finally, precision of the algorithm was attained. Experimental results show that the operation effect of our system is satisfactory.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
China Internet Network Information Center. 2009 annual report of China online shopping market, 38–42 (2009)
Lim, E.-P., Nguyen, V.-A., Jindal, N., Liu, B., Lauw, H.W.: Detecting product review spammers using rating behaviors. In: Proceeding CIKM 2010 Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 939–948 (2010)
Wu, G., Greene, D., Smyth, B., Cunningham, P.: Distortion as a validation criterion in the identification of suspicious reviews. Technical Report UCD-CSI-2010-04, University College Dublin (2010)
Jindal, N., Liu, B., Lim, E.: Finding Unusual Review Patterns Using Unexpected Rules. In: CIKM 2010: 19th ACM International Conference on Information and Knowledge Management, Toronto, Ontario (2010)
Li, F., Liu, N., Jin, H., Zhao, K., Yang, Q., Zhu, X.: Incorporate Reviewer and Product Information for Review Rating Prediction. In: Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, IJCAI 2011 (2011)
Wang, G., Xie, S., Liu, B., Yu, P.S.: Identify Online Store Review Spammers via Social Review Graph. Journal of ACM Transactions on Intelligent Systems and Technology (TIST)Â 3(4) (September 2012)
Xia, H., Liu, J.: Credibility Analysis of Comments of Virtual Community Based on Text Similarity Computing. Journal of Modern Information 31(9), 33–37 (2011)
Yang, F., Li, J.: Survey on research of opinion spam in user-generated-content. Application Research of Computers 28(10), 3601–3605 (2011)
ICTCLAS, http://ictclas.org/
Shi, C., Xu, C., Yang, X.: Study of TFIDF algorithm. Journal of Computer Application 29(6), 167–180 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xu, X., Han, T., Xu, Z., Wang, Y., Liu, Y. (2013). Design and Implementation of Chinese Spam Review Detection System. In: Liu, P., Su, Q. (eds) Chinese Lexical Semantics. CLSW 2013. Lecture Notes in Computer Science(), vol 8229. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45185-0_53
Download citation
DOI: https://doi.org/10.1007/978-3-642-45185-0_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-45184-3
Online ISBN: 978-3-642-45185-0
eBook Packages: Computer ScienceComputer Science (R0)