Abstract
Online reviews have a great influence on customers’ shopping decisions. However, countless fake reviews are posted on shopping platforms, which seriously interfere with customers’ shopping decisions and pollute the fair e-commerce environment. In this paper, we propose EKI-SM, an explainable knowledge integrated sequence model, to detect fake reviews. Compared with existing models, the EKI-SM displays four advantages: 1) It integrates a set of important knowledge and learns high-dimensional word embedding from reviews to guide fake review detection tasks; in addition, this knowledge explains the results of the model. 2) It learns a continuous sequence model from discrete observations with high-dimensional features, which helps to learn more discriminating fake review features. 3) It fuses the one-dimensional convolutional network, the long short-term memory network, and the residual connector to capture the local and global dependency of the sequence and make the prediction model more robust. 4) Inspired by the idea of interpretable deep learning, we explain the EKI-SM and find the important critical words for detecting fake online reviews, which derive some interesting insights. Experiments on actual fake review datasets demonstrate that the EKI-SM achieves higher accuracy in fake review detection than that of other state-of-the-art methods; indeed, it benefits from the integration of knowledge and multi-modal features.








Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Code Availability
The code uses python language programming to run on PyCharm, the code is available.
References
Yuan W, Wang H, Yu X, Liu N, Li Z (2020) Attention-based context-aware sequential recommendation model. Inf Sci 510:122–134
Hu B, Wang H, Yu X, Yuan W, He T (2019) Sparse network embedding for community detection and sign prediction in signed social networks. Journal of Ambient Intelligence and Humanized Computing 10(1):175–186
Jindal N, Liu B (2007) Analyzing and detecting review spam. In: Seventh IEEE International Conference on Data Mining (ICDM 2007). IEEE, pp 547–552
Fang Y, Wang H, Zhao L, Yu F, Wang C (2020) Dynamic knowledge graph based fake-review detection. Appl Intell 50(12):4281–4295
Feng S, Zhang H, Cao J, Yao Y (2019) Merging user social network into the random walk model for better group recommendation. Appl Intell 49(6):2046–2058
Jindal N, Liu B (2008) Opinion spam and analysis. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp 219–230
Feng S, Zhang H, Wang L, Liu L, Xu Y (2019) Detecting the latent associations hidden in multi-source information for better group recommendation. Knowledge-Based Systems 171:56–68
Li Y, Lin Y, Zhang J, Li J, Zhao L (2015) Highlighting the fake reviews in review sequence with the suspicious contents and behaviours. Journal Of Information & Computational Science 12(4):1615–1627
Liu R, Wang H, Yu X (2018) Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf Sci 450:200–226
Wang T, Liu L, Liu N, Zhang H, Zhang L, Feng S (2020) A multi-label text classification method via dynamic semantic representation model and deep neural network. Appl Intell 50 (8):2339–2351
Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504– 507
Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: Proceedings of the eleventh annual conference on computational learning theory, pp 92–100
Li J, Ott M, Cardie C, Hovy E (2014) Towards a general rule for identifying deceptive opinion spam. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (Volume 1: Long Papers), pp 1566–1576
Dewang RK, Singh AK (2015) Identification of fake reviews using new set of lexical and syntactic features. In: Proceedings of the sixth international conference on computer and communication technology 2015, pp 115–119
Li L, Qin B, Ren W, Liu T (2017) Document representation and feature combination for deceptive spam review detection. Neurocomputing 254:33–41
Ren Y, Ji D (2017) Neural networks for deceptive opinion spam detection: an empirical study. Inf Sci 385:213–224
Barushka A, Hajek P (2019) Review spam detection using word embeddings and deep neural networks. In: IFIP International conference on artificial intelligence applications and innovations. Springer, pp 340–350
Hajek P, Barushka A, Munk M (2020) Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining. Neural Comput & Applic 32(23):17259–17274
Zeng Z-Y, Lin J-J, Chen M-S, Chen M-H, Lan Y-Q, Liu J-L (2019) A review structure based ensemble model for deceptive review spam. Information 10(7):243
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556
Van der Maaten L, Hinton G (2008) Visualizing data using t-sne J Mach Learn Res 9(11)
Gade K, Geyik S, Kenthapadi K, Mithal V, Taly A (2020) Explainable ai in industry: practical challenges and lessons learned. In: Companion proceedings of the Web conference 2020, pp 303–304
Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Annals of statistics
Ribeiro MT, Singh S, Guestrin C (2016) Why should i trust you? explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1135–1144
Tiddi I et al (2020) Directions for explainable knowledge-enabled systems. Knowledge Graphs for eXplainable Artificial intelligence: Foundations Applications and Challenges 47:245
Bizer C, Primpeli A, Peeters R (2019) Using the semantic web as a source of training data. Datenbank-Spektrum 19(2):127–135
Ahmed H, Traore I, Saad S (2018) Detecting opinion spams and fake news using text classification. Security and Privacy 1(1):9
Martineau JC, Finin T (2009) Delta tfidf: an improved feature space for sentiment analysis. In: Third international AAAI conference on weblogs and social media
Brown PF, Della Pietra VJ, Desouza PV, Lai JC, Mercer RL (1992) Class-based n-gram models of natural language. Computational linguistics 18(4):467–480
Pennington J, Socher R, Manning CD (2014) Glove: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543
Ott M, Choi Y, Cardie C, Hancock JT (2011) Finding deceptive opinion spam by any stretch of the imagination. arXiv:11070.4557
Li FH, Huang M, Yang Y, Zhu X (2011) Learning to identify review spam. In: Twenty-second international joint conference on artificial intelligence
Cao N, Ji S, Chiu DK, He M, Sun X (2020) A deceptive review detection framework: Combination of coarse and fine-grained features. Expert Syst Appl 156:113465
Kennedy S, Walsh N, Sloka K, Foster J, McCarren A (2020) Fact or factitious? contextualized opinion spam detection. arXiv:2010.15296
Jiang C, Zhang X, Jin A (2020) Detecting online fake reviews via hierarchical neural networks and multivariate features. In: International conference on neural information processing. Springer, pp 730–742
Neisari A, Rueda L, Saad S (2021) Spam review detection using self-organizing maps and convolutional neural networks. Computers & Security 106:102274
Acknowledgments
This work is supported by the National Nature Science Foundation of China (No.61672329, No.62072290,No. 81871508, No. 61773246); Major Program of Shandong Province Natural Science Foundation (ZR2019ZD04, No. ZR2018ZB0419); Shandong Provincial Project of Education Scientific Plan (No.SDYY18058).
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Han Shu and Wang Hong. The first draft of the manuscript was written by Han Shu and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of Interests
The authors declared that they have no conflicts of interest to this work.
Additional information
Availability of data and materials
The data set can be obtained at the link below http://myleott.com/op-spam.html
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Han, S., Wang, H., Li, W. et al. Explainable knowledge integrated sequence model for detecting fake online reviews. Appl Intell 53, 6953–6965 (2023). https://doi.org/10.1007/s10489-022-03822-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-03822-8