Abstract
The drastic increase of user-generated contents has exhibited a rich source for mining opinions. Unfortunately, the quality of user-generated content varies significantly from excellent to meaningless, which by general estimation, causes a great deal of difficulty in mining-related applications. In the field of low-quality review detection, many previous approaches have individually detected low-quality reviews by using the intrinsic features of the review. However, no systematic study measuring the significance of reviewer information for detecting low-quality reviews has been previously done. In this paper, the importance of reviewer information when predicting review quality is studied and how to exploit it to build low-quality review detection models is determined. The experimental results on two different domains show that reviewer information does matter when modeling and predicting the quality of reviews. It is also shown that significant performance improvements can be achieved if the reviewer information is integrated with the intrinsic features of the reviews. These findings are of the essence in solving the low-quality review detection problem and in developing review-based opinion mining applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the web. In: Proceedings of the 14th International World Wide Web Conference (WWW-2005), 10–14 May 2005, in Chiba, Japan, pp. 342–351 (2005)
Popescu, A.-M., Etzioni, O.: Extracting product features and opinions from reviews. In: Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP 2005), pp. 339–346 (2005)
Turney, P.D.: Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the Meeting of the Association for Computational Linguistics (ACL 2002), pp. 417–424 (2002)
Pang, B., Lee, L., Vaithyanathan, S.: . Thumbs up? sentiment classification using machine learning techniques. In: Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP 2002), pp. 79–86 (2002)
Kim, S.M., Pantel, P., Chklovski, T., Pennacchiotti, M.: Automatically assessing review helpfulness. In: Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP 2006), pp. 423–430 (2006)
Liu, J., Cao, Y., Lin, C.Y., Huang, Y., Zhou, M.: Low-quality product review detection in opinion summarization. In: Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP 2007), pp. 334–342 (2007)
Zhang, Z., Balaji, V.: Utility Scoring of Product Reviews. In: Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management (CIKM 2006), pp. 51–57 (2006)
Agichtein, E., Castillo, C., Donato, D., Gionis, A., Mishne, G.: Finding high-quality content in Social Media. In: Proceedings of the First ACM International Conference on Web Search and Data Mining (WSDM 2008), pp. 183–194 (2008)
Jeon, J., Bruce Croft, W., Ho Lee, J., Park, S.: A framework to predict the quality of answers with nontextual features. In: Proceedings of 29th Annual International ACM SIGIR Conference on Research & Development on Information Retrieval (SIGIR 2006), pp. 228–235 (2006)
Zhou, Y., Bruce Croft, W.: Document quality models for Web Ad Hoc retrieval. In: Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management (CIKM 2005), pp. 331–332 (2005)
Ghose, A., Ipeirotis, P.G.: Designing novel review ranking systems: predicting the usefulness and impact of reviews. In: Proceedings of the Ninth International Conference on Electronic Commerce (ICEC 2007), pp. 303–310 (2007)
Miao, Q., Li, Q., Dai, R.: An integration strategy for mining product features and opinions. In: Proceedings of the 2008 ACM CIKM International Conference on Information and Knowledge Management (CIKM 2008), pp. 1369–1370 (2008)
Weimer, M., Gurevych, I.: Predicting the perceived quality of web forum posts. In: Proceedings of the Conference on Recent Advances in Natural Language Processing (RANLP 2007), pp. 643–648 (2007)
Chawla, N.V., Bowyer, K.W., Hall, L.O., Philip Kegelmeyer, W.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
Yu, P.S., Li, X., Liu, B.: Adding the temporal dimension to search - a case study in publication search. In: Proceedings of the 2005 IEEE/WIC/ACM Conferences on Web Intelligence (WI 2005), pp. 543–549 (2005)
Zhang, Z.: Weighing stars: aggregating online product reviews for intelligent e-commerce applications. Intell. Syst. 23(5), 42–49 (2008)
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval, 2(1–2), 1–135 (2008). https://doi.org/10.1561/1500000001
Wilson, T., Wiebe, J., Hwa, R.: Just how mad are you? finding strong and weak opinion clauses. In: Proceedings of the Nineteenth National Conference on Artificial Intelligence (AAAI-2004), pp.761–769 (2004)
Kim, S., Hovy, E.: Determining the sentiment of opinions. In Proceedings of the International Conference on Computational Linguistics (COLING 2004), pp. 1367–1373 (2004)
Hu, M., Liu, B.: Mining opinion features in customer reviews. In Proceeding of the 19th National Conference on Artificial Intelligence (AAAI 2004), pp. 755–760 (2004)
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceeding of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2004), pp. 168–177 (2004)
Carenini, G., Ng, R.T., Zwart, E.: Extracting knowledge from evaluative text. In: Proceedings of Third International Conference on Knowledge Capture (K-CAP 2005), pp. 11–18 (2005)
Su, Q., Xiang, K., Wang, H., Sun, B., Yu, S.: Using pointwise mutual information to identify implicit features in customer reviews. In: Matsumoto, Y., Sproat, R.W., Wong, K.-F., Zhang, M. (eds.) ICCPOL 2006. LNCS (LNAI), vol. 4285, pp. 22–30. Springer, Heidelberg (2006). https://doi.org/10.1007/11940098_3
Shi, B., Chang, K.: Mining Chinese reviews. In: Proceedings of the Sixth IEEE International Conference on Data Mining (ICDM 2006), pp. 585–589 (2006)
Feldman, R., Fresko, M., et al.: Extracting product comparisons from discussion boards. In: Proceedings of the Seventh IEEE International Conference on Data Mining (ICDM 2007), pp. 469–474 (2007)
Ghani, R., Probst, K., et al.: Text mining for product attribute extraction. ACM SIGKDD Explor. Newslett. 8(1), 41–48 (2006)
Wang, B., Wang, H.: Bootstrapping both product properties and opinion words from Chinese reviews with cross-training. In: Processings of 2007 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 259–262 (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 Springer Nature Switzerland AG
About this paper
Cite this paper
Miao, Q., Hu, C., Xu, F. (2023). Using Reviewer Information to Improve Performance of Low-Quality Review Detection. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2018. Lecture Notes in Computer Science, vol 13397. Springer, Cham. https://doi.org/10.1007/978-3-031-23804-8_30
Download citation
DOI: https://doi.org/10.1007/978-3-031-23804-8_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-23803-1
Online ISBN: 978-3-031-23804-8
eBook Packages: Computer ScienceComputer Science (R0)