Skip to main content

Using Reviewer Information to Improve Performance of Low-Quality Review Detection

  • Conference paper
  • First Online:
Computational Linguistics and Intelligent Text Processing (CICLing 2018)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13397))

  • 184 Accesses

Abstract

The drastic increase of user-generated contents has exhibited a rich source for mining opinions. Unfortunately, the quality of user-generated content varies significantly from excellent to meaningless, which by general estimation, causes a great deal of difficulty in mining-related applications. In the field of low-quality review detection, many previous approaches have individually detected low-quality reviews by using the intrinsic features of the review. However, no systematic study measuring the significance of reviewer information for detecting low-quality reviews has been previously done. In this paper, the importance of reviewer information when predicting review quality is studied and how to exploit it to build low-quality review detection models is determined. The experimental results on two different domains show that reviewer information does matter when modeling and predicting the quality of reviews. It is also shown that significant performance improvements can be achieved if the reviewer information is integrated with the intrinsic features of the reviews. These findings are of the essence in solving the low-quality review detection problem and in developing review-based opinion mining applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the web. In: Proceedings of the 14th International World Wide Web Conference (WWW-2005), 10–14 May 2005, in Chiba, Japan, pp. 342–351 (2005)

    Google Scholar 

  2. Popescu, A.-M., Etzioni, O.: Extracting product features and opinions from reviews. In: Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP 2005), pp. 339–346 (2005)

    Google Scholar 

  3. Turney, P.D.: Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the Meeting of the Association for Computational Linguistics (ACL 2002), pp. 417–424 (2002)

    Google Scholar 

  4. Pang, B., Lee, L., Vaithyanathan, S.: . Thumbs up? sentiment classification using machine learning techniques. In: Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP 2002), pp. 79–86 (2002)

    Google Scholar 

  5. Kim, S.M., Pantel, P., Chklovski, T., Pennacchiotti, M.: Automatically assessing review helpfulness. In: Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP 2006), pp. 423–430 (2006)

    Google Scholar 

  6. Liu, J., Cao, Y., Lin, C.Y., Huang, Y., Zhou, M.: Low-quality product review detection in opinion summarization. In: Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP 2007), pp. 334–342 (2007)

    Google Scholar 

  7. Zhang, Z., Balaji, V.: Utility Scoring of Product Reviews. In: Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management (CIKM 2006), pp. 51–57 (2006)

    Google Scholar 

  8. Agichtein, E., Castillo, C., Donato, D., Gionis, A., Mishne, G.: Finding high-quality content in Social Media. In: Proceedings of the First ACM International Conference on Web Search and Data Mining (WSDM 2008), pp. 183–194 (2008)

    Google Scholar 

  9. Jeon, J., Bruce Croft, W., Ho Lee, J., Park, S.: A framework to predict the quality of answers with nontextual features. In: Proceedings of 29th Annual International ACM SIGIR Conference on Research & Development on Information Retrieval (SIGIR 2006), pp. 228–235 (2006)

    Google Scholar 

  10. Zhou, Y., Bruce Croft, W.: Document quality models for Web Ad Hoc retrieval. In: Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management (CIKM 2005), pp. 331–332 (2005)

    Google Scholar 

  11. Ghose, A., Ipeirotis, P.G.: Designing novel review ranking systems: predicting the usefulness and impact of reviews. In: Proceedings of the Ninth International Conference on Electronic Commerce (ICEC 2007), pp. 303–310 (2007)

    Google Scholar 

  12. Miao, Q., Li, Q., Dai, R.: An integration strategy for mining product features and opinions. In: Proceedings of the 2008 ACM CIKM International Conference on Information and Knowledge Management (CIKM 2008), pp. 1369–1370 (2008)

    Google Scholar 

  13. Weimer, M., Gurevych, I.: Predicting the perceived quality of web forum posts. In: Proceedings of the Conference on Recent Advances in Natural Language Processing (RANLP 2007), pp. 643–648 (2007)

    Google Scholar 

  14. Chawla, N.V., Bowyer, K.W., Hall, L.O., Philip Kegelmeyer, W.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)

    Google Scholar 

  15. Yu, P.S., Li, X., Liu, B.: Adding the temporal dimension to search - a case study in publication search. In: Proceedings of the 2005 IEEE/WIC/ACM Conferences on Web Intelligence (WI 2005), pp. 543–549 (2005)

    Google Scholar 

  16. Zhang, Z.: Weighing stars: aggregating online product reviews for intelligent e-commerce applications. Intell. Syst. 23(5), 42–49 (2008)

    Google Scholar 

  17. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval, 2(1–2), 1–135 (2008). https://doi.org/10.1561/1500000001

  18. Wilson, T., Wiebe, J., Hwa, R.: Just how mad are you? finding strong and weak opinion clauses. In: Proceedings of the Nineteenth National Conference on Artificial Intelligence (AAAI-2004), pp.761–769 (2004)

    Google Scholar 

  19. Kim, S., Hovy, E.: Determining the sentiment of opinions. In Proceedings of the International Conference on Computational Linguistics (COLING 2004), pp. 1367–1373 (2004)

    Google Scholar 

  20. Hu, M., Liu, B.: Mining opinion features in customer reviews. In Proceeding of the 19th National Conference on Artificial Intelligence (AAAI 2004), pp. 755–760 (2004)

    Google Scholar 

  21. Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceeding of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2004), pp. 168–177 (2004)

    Google Scholar 

  22. Carenini, G., Ng, R.T., Zwart, E.: Extracting knowledge from evaluative text. In: Proceedings of Third International Conference on Knowledge Capture (K-CAP 2005), pp. 11–18 (2005)

    Google Scholar 

  23. Su, Q., Xiang, K., Wang, H., Sun, B., Yu, S.: Using pointwise mutual information to identify implicit features in customer reviews. In: Matsumoto, Y., Sproat, R.W., Wong, K.-F., Zhang, M. (eds.) ICCPOL 2006. LNCS (LNAI), vol. 4285, pp. 22–30. Springer, Heidelberg (2006). https://doi.org/10.1007/11940098_3

    Chapter  Google Scholar 

  24. Shi, B., Chang, K.: Mining Chinese reviews. In: Proceedings of the Sixth IEEE International Conference on Data Mining (ICDM 2006), pp. 585–589 (2006)

    Google Scholar 

  25. Feldman, R., Fresko, M., et al.: Extracting product comparisons from discussion boards. In: Proceedings of the Seventh IEEE International Conference on Data Mining (ICDM 2007), pp. 469–474 (2007)

    Google Scholar 

  26. Ghani, R., Probst, K., et al.: Text mining for product attribute extraction. ACM SIGKDD Explor. Newslett. 8(1), 41–48 (2006)

    Article  Google Scholar 

  27. Wang, B., Wang, H.: Bootstrapping both product properties and opinion words from Chinese reviews with cross-training. In: Processings of 2007 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 259–262 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qingliang Miao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Miao, Q., Hu, C., Xu, F. (2023). Using Reviewer Information to Improve Performance of Low-Quality Review Detection. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2018. Lecture Notes in Computer Science, vol 13397. Springer, Cham. https://doi.org/10.1007/978-3-031-23804-8_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-23804-8_30

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-23803-1

  • Online ISBN: 978-3-031-23804-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics