ABSTRACT
In the era of e-commerce, customers are trust on on-line review. Reviews help them to make right decisions to buy a product or hire a service. Spammer are writes the fake review to promote or demote the target products. Reviews are spam or non-spam; this is the biggest problem for E-commerce business. Moreover, the spam detection problem is complex task. Spammers are inventing new methods to writing spam reviews. It cannot be recognized easily. In this paper, we have extracted some new writing style features like, attractive text ratio and function word ratio and corleone-based features like, lexical validity and text like fraction and compared with already existing features. We have applied support vector machine (SVM), logistic regression, random forest, Jrip, functional tree, Naive Bayes, J48, PART algorithms for classification of review as a spam or non-spam. The SVM, random forest and logistic regression gives the 68% accuracy.
- S. Afroz, M. Brennan, and R. Greenstadt. Detecting hoaxes, frauds, and deception in writing style online. In Security and Privacy (SP), 2012 IEEE Symposium on, pages 461--475. IEEE, 2012. Google ScholarDigital Library
- S. Banerjee and A. Y. Chua. Applauses in hotel reviews: Genuine or deceptive? In Science and Information Conference (SAI), 2014, pages 938--942. IEEE, 2014.Google ScholarCross Ref
- S. Banerjee and A. Y. Chua. A linguistic framework to distinguish between genuine and deceptive online reviews. In Proceedings of the International Conference on Internet Computing and Web Services, 2014.Google Scholar
- S. Banerjee and A. Y. Chua. A study of manipulative and authentic negative reviews. In Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication, page 76. ACM, 2014. Google ScholarDigital Library
- R. K. Dewang and A. Singh. Identification of fake reviews using new set of lexical and syntactic features. In Proceedings of the Sixth International Conference on Computer and Communication Technology 2015, pages 115--119. ACM, 2015. Google ScholarDigital Library
- M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. The weka data mining software: an update. ACM SIGKDD explorations newsletter, 11(1):10--18, 2009. Google ScholarDigital Library
- A. Heydari, M. ali Tavakoli, N. Salim, and Z. Heydari. Detection of review spam: A survey. Expert Systems with Applications, 42(7):3634--3642, 2015. Google ScholarDigital Library
- N. Hu, I. Bose, N. S. Koh, and L. Liu. Manipulation of online reviews: An analysis of ratings, readability, and sentiments. Decision Support Systems, 52(3):674--684, 2012. Google ScholarDigital Library
- N. Jindal and B. Liu. Analyzing and detecting review spam. In Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on, pages 547--552. IEEE, 2007. Google ScholarDigital Library
- N. Jindal and B. Liu. Opinion spam and analysis. In Proceedings of the 2008 International Conference on Web Search and Data Mining, pages 219--230. ACM, 2008. Google ScholarDigital Library
- R. Y. Lau, S. S. Liao, and K. Xu. An empirical study of online consumer review spam: A design science approach. In ICIS, volume 2010, pages 103--123, 2010.Google Scholar
- F. Li, M. Huang, Y. Yang, and X. Zhu. Learning to identify review spam. In IJCAI Proceedings-International Joint Conference on Artificial Intelligence, volume 22, page 2488, 2011. Google ScholarDigital Library
- Y. Lin, T. Zhu, H. Wu, J. Zhang, X. Wang, and A. Zhou. Towards online anti-opinion spam: Spotting fake reviews from the review sequence. In Advances in Social Networks Analysis and Mining (ASONAM), 2014 IEEE/ACM International Conference on, pages 261--264. IEEE, 2014.Google ScholarCross Ref
- B. Liu. Sentiment analysis and opinion mining (synthesis lectures on human language technologies). Morgan & Claypool Publishers, 2012.Google Scholar
- M. Ott, C. Cardie, and J. T. Hancock. Negative deceptive opinion spam. In HLT-NAACL, pages 497--501, 2013.Google Scholar
- M. Ott, Y. Choi, C. Cardie, and J. T. Hancock. Finding deceptive opinion spam by any stretch of the imagination. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, pages 309--319. Association for Computational Linguistics, 2011. Google ScholarDigital Library
- J. Piskorski, M. Sydow, and D. Weiss. Exploring linguistic features for web spam detection: a preliminary study. In Proceedings of the 4th international workshop on Adversarial information retrieval on the web, pages 25--28. ACM, 2008. Google ScholarDigital Library
- S. Shojaee, M. A. A. Murad, A. Bin Azman, N. M. Sharef, and S. Nadali. Detecting deceptive reviews using lexical and syntactic features. In Intelligent Systems Design and Applications (ISDA), 2013 13th International Conference on, pages 53--58. IEEE, 2013.Google ScholarCross Ref
- K. Toutanova, D. Klein, C. D. Manning, and Y. Singer. Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1, pages 173--180. Association for Computational Linguistics, 2003. Google ScholarDigital Library
- R. Zheng, J. Li, H. Chen, and Z. Huang. A framework for authorship identification of online messages: Writing-style features and classification techniques. Journal of the American Society for Information Science and Technology, 57(3):378--393, 2006. Google ScholarDigital Library
- Finding of Review Spam through "Corleone, Review Genre, Writing Style and Review Text Detail Features"
Recommendations
Detecting group review spam
WWW '11: Proceedings of the 20th international conference companion on World wide webIt is well-known that many online reviews are not written by genuine users of products, but by spammers who write fake reviews to promote or demote some target products. Although some existing works have been done to detect fake reviews and individual ...
Detection of review spam
We have extracted all types of data that can be used in spam detection techniques.We have reviewed state of the art literature in the area of detection of spam reviews.In this research, we have categorized and classified spam detection methods and ...
Review spam detection
WWW '07: Proceedings of the 16th international conference on World Wide WebIt is now a common practice for e-commerce Web sites to enable their customers to write reviews of products that they have purchased. Such reviews provide valuable sources of information on these products. They are used by potential customers to find ...
Comments