Skip to main content
Log in

Explainable knowledge integrated sequence model for detecting fake online reviews

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Online reviews have a great influence on customers’ shopping decisions. However, countless fake reviews are posted on shopping platforms, which seriously interfere with customers’ shopping decisions and pollute the fair e-commerce environment. In this paper, we propose EKI-SM, an explainable knowledge integrated sequence model, to detect fake reviews. Compared with existing models, the EKI-SM displays four advantages: 1) It integrates a set of important knowledge and learns high-dimensional word embedding from reviews to guide fake review detection tasks; in addition, this knowledge explains the results of the model. 2) It learns a continuous sequence model from discrete observations with high-dimensional features, which helps to learn more discriminating fake review features. 3) It fuses the one-dimensional convolutional network, the long short-term memory network, and the residual connector to capture the local and global dependency of the sequence and make the prediction model more robust. 4) Inspired by the idea of interpretable deep learning, we explain the EKI-SM and find the important critical words for detecting fake online reviews, which derive some interesting insights. Experiments on actual fake review datasets demonstrate that the EKI-SM achieves higher accuracy in fake review detection than that of other state-of-the-art methods; indeed, it benefits from the integration of knowledge and multi-modal features.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Code Availability

The code uses python language programming to run on PyCharm, the code is available.

References

  1. Yuan W, Wang H, Yu X, Liu N, Li Z (2020) Attention-based context-aware sequential recommendation model. Inf Sci 510:122–134

    Article  Google Scholar 

  2. Hu B, Wang H, Yu X, Yuan W, He T (2019) Sparse network embedding for community detection and sign prediction in signed social networks. Journal of Ambient Intelligence and Humanized Computing 10(1):175–186

    Article  Google Scholar 

  3. Jindal N, Liu B (2007) Analyzing and detecting review spam. In: Seventh IEEE International Conference on Data Mining (ICDM 2007). IEEE, pp 547–552

  4. Fang Y, Wang H, Zhao L, Yu F, Wang C (2020) Dynamic knowledge graph based fake-review detection. Appl Intell 50(12):4281–4295

    Article  Google Scholar 

  5. Feng S, Zhang H, Cao J, Yao Y (2019) Merging user social network into the random walk model for better group recommendation. Appl Intell 49(6):2046–2058

    Article  Google Scholar 

  6. Jindal N, Liu B (2008) Opinion spam and analysis. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp 219–230

  7. Feng S, Zhang H, Wang L, Liu L, Xu Y (2019) Detecting the latent associations hidden in multi-source information for better group recommendation. Knowledge-Based Systems 171:56–68

    Article  Google Scholar 

  8. Li Y, Lin Y, Zhang J, Li J, Zhao L (2015) Highlighting the fake reviews in review sequence with the suspicious contents and behaviours. Journal Of Information & Computational Science 12(4):1615–1627

    Article  Google Scholar 

  9. Liu R, Wang H, Yu X (2018) Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf Sci 450:200–226

    Article  MathSciNet  Google Scholar 

  10. Wang T, Liu L, Liu N, Zhang H, Zhang L, Feng S (2020) A multi-label text classification method via dynamic semantic representation model and deep neural network. Appl Intell 50 (8):2339–2351

    Article  Google Scholar 

  11. Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504– 507

    Article  MathSciNet  MATH  Google Scholar 

  12. Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: Proceedings of the eleventh annual conference on computational learning theory, pp 92–100

  13. Li J, Ott M, Cardie C, Hovy E (2014) Towards a general rule for identifying deceptive opinion spam. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (Volume 1: Long Papers), pp 1566–1576

  14. Dewang RK, Singh AK (2015) Identification of fake reviews using new set of lexical and syntactic features. In: Proceedings of the sixth international conference on computer and communication technology 2015, pp 115–119

  15. Li L, Qin B, Ren W, Liu T (2017) Document representation and feature combination for deceptive spam review detection. Neurocomputing 254:33–41

    Article  Google Scholar 

  16. Ren Y, Ji D (2017) Neural networks for deceptive opinion spam detection: an empirical study. Inf Sci 385:213–224

    Article  Google Scholar 

  17. Barushka A, Hajek P (2019) Review spam detection using word embeddings and deep neural networks. In: IFIP International conference on artificial intelligence applications and innovations. Springer, pp 340–350

  18. Hajek P, Barushka A, Munk M (2020) Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining. Neural Comput & Applic 32(23):17259–17274

    Article  Google Scholar 

  19. Zeng Z-Y, Lin J-J, Chen M-S, Chen M-H, Lan Y-Q, Liu J-L (2019) A review structure based ensemble model for deceptive review spam. Information 10(7):243

    Article  Google Scholar 

  20. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556

  21. Van der Maaten L, Hinton G (2008) Visualizing data using t-sne J Mach Learn Res 9(11)

  22. Gade K, Geyik S, Kenthapadi K, Mithal V, Taly A (2020) Explainable ai in industry: practical challenges and lessons learned. In: Companion proceedings of the Web conference 2020, pp 303–304

  23. Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Annals of statistics

  24. Ribeiro MT, Singh S, Guestrin C (2016) Why should i trust you? explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1135–1144

  25. Tiddi I et al (2020) Directions for explainable knowledge-enabled systems. Knowledge Graphs for eXplainable Artificial intelligence: Foundations Applications and Challenges 47:245

    Google Scholar 

  26. Bizer C, Primpeli A, Peeters R (2019) Using the semantic web as a source of training data. Datenbank-Spektrum 19(2):127–135

    Article  Google Scholar 

  27. Ahmed H, Traore I, Saad S (2018) Detecting opinion spams and fake news using text classification. Security and Privacy 1(1):9

    Article  Google Scholar 

  28. Martineau JC, Finin T (2009) Delta tfidf: an improved feature space for sentiment analysis. In: Third international AAAI conference on weblogs and social media

  29. Brown PF, Della Pietra VJ, Desouza PV, Lai JC, Mercer RL (1992) Class-based n-gram models of natural language. Computational linguistics 18(4):467–480

    Google Scholar 

  30. Pennington J, Socher R, Manning CD (2014) Glove: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543

  31. Ott M, Choi Y, Cardie C, Hancock JT (2011) Finding deceptive opinion spam by any stretch of the imagination. arXiv:11070.4557

  32. Li FH, Huang M, Yang Y, Zhu X (2011) Learning to identify review spam. In: Twenty-second international joint conference on artificial intelligence

  33. Cao N, Ji S, Chiu DK, He M, Sun X (2020) A deceptive review detection framework: Combination of coarse and fine-grained features. Expert Syst Appl 156:113465

    Article  Google Scholar 

  34. Kennedy S, Walsh N, Sloka K, Foster J, McCarren A (2020) Fact or factitious? contextualized opinion spam detection. arXiv:2010.15296

  35. Jiang C, Zhang X, Jin A (2020) Detecting online fake reviews via hierarchical neural networks and multivariate features. In: International conference on neural information processing. Springer, pp 730–742

  36. Neisari A, Rueda L, Saad S (2021) Spam review detection using self-organizing maps and convolutional neural networks. Computers & Security 106:102274

    Article  Google Scholar 

Download references

Acknowledgments

This work is supported by the National Nature Science Foundation of China (No.61672329, No.62072290,No. 81871508, No. 61773246); Major Program of Shandong Province Natural Science Foundation (ZR2019ZD04, No. ZR2018ZB0419); Shandong Provincial Project of Education Scientific Plan (No.SDYY18058).

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Han Shu and Wang Hong. The first draft of the manuscript was written by Han Shu and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Hong Wang.

Ethics declarations

Conflict of Interests

The authors declared that they have no conflicts of interest to this work.

Additional information

Availability of data and materials

The data set can be obtained at the link below http://myleott.com/op-spam.html

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Han, S., Wang, H., Li, W. et al. Explainable knowledge integrated sequence model for detecting fake online reviews. Appl Intell 53, 6953–6965 (2023). https://doi.org/10.1007/s10489-022-03822-8

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-022-03822-8

Keywords

Navigation