Explainable knowledge integrated sequence model for detecting fake online reviews

Han, Shu; Wang, Hong; Li, Wei; Zhang, Hui; Zhuang, Luhe

doi:10.1007/s10489-022-03822-8

Explainable knowledge integrated sequence model for detecting fake online reviews

Published: 12 July 2022

Volume 53, pages 6953–6965, (2023)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Shu Han¹,
Hong Wang ORCID: orcid.org/0000-0001-5468-8400¹,
Wei Li¹,
Hui Zhang¹ &
…
Luhe Zhuang¹

626 Accesses
4 Citations
Explore all metrics

Abstract

Online reviews have a great influence on customers’ shopping decisions. However, countless fake reviews are posted on shopping platforms, which seriously interfere with customers’ shopping decisions and pollute the fair e-commerce environment. In this paper, we propose EKI-SM, an explainable knowledge integrated sequence model, to detect fake reviews. Compared with existing models, the EKI-SM displays four advantages: 1) It integrates a set of important knowledge and learns high-dimensional word embedding from reviews to guide fake review detection tasks; in addition, this knowledge explains the results of the model. 2) It learns a continuous sequence model from discrete observations with high-dimensional features, which helps to learn more discriminating fake review features. 3) It fuses the one-dimensional convolutional network, the long short-term memory network, and the residual connector to capture the local and global dependency of the sequence and make the prediction model more robust. 4) Inspired by the idea of interpretable deep learning, we explain the EKI-SM and find the important critical words for detecting fake online reviews, which derive some interesting insights. Experiments on actual fake review datasets demonstrate that the EKI-SM achieves higher accuracy in fake review detection than that of other state-of-the-art methods; indeed, it benefits from the integration of knowledge and multi-modal features.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial intelligence in recommender systems

Article Open access 01 November 2020

Real-time prediction of online shoppers’ purchasing intention using multilayer perceptron and LSTM recurrent neural networks

Article 09 May 2018

A comprehensive survey of AI-enabled phishing attacks detection techniques

Article 23 October 2020

Code Availability

The code uses python language programming to run on PyCharm, the code is available.

References

Yuan W, Wang H, Yu X, Liu N, Li Z (2020) Attention-based context-aware sequential recommendation model. Inf Sci 510:122–134
Article Google Scholar
Hu B, Wang H, Yu X, Yuan W, He T (2019) Sparse network embedding for community detection and sign prediction in signed social networks. Journal of Ambient Intelligence and Humanized Computing 10(1):175–186
Article Google Scholar
Jindal N, Liu B (2007) Analyzing and detecting review spam. In: Seventh IEEE International Conference on Data Mining (ICDM 2007). IEEE, pp 547–552
Fang Y, Wang H, Zhao L, Yu F, Wang C (2020) Dynamic knowledge graph based fake-review detection. Appl Intell 50(12):4281–4295
Article Google Scholar
Feng S, Zhang H, Cao J, Yao Y (2019) Merging user social network into the random walk model for better group recommendation. Appl Intell 49(6):2046–2058
Article Google Scholar
Jindal N, Liu B (2008) Opinion spam and analysis. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp 219–230
Feng S, Zhang H, Wang L, Liu L, Xu Y (2019) Detecting the latent associations hidden in multi-source information for better group recommendation. Knowledge-Based Systems 171:56–68
Article Google Scholar
Li Y, Lin Y, Zhang J, Li J, Zhao L (2015) Highlighting the fake reviews in review sequence with the suspicious contents and behaviours. Journal Of Information & Computational Science 12(4):1615–1627
Article Google Scholar
Liu R, Wang H, Yu X (2018) Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf Sci 450:200–226
Article MathSciNet Google Scholar
Wang T, Liu L, Liu N, Zhang H, Zhang L, Feng S (2020) A multi-label text classification method via dynamic semantic representation model and deep neural network. Appl Intell 50 (8):2339–2351
Article Google Scholar
Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504– 507
Article MathSciNet MATH Google Scholar
Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: Proceedings of the eleventh annual conference on computational learning theory, pp 92–100
Li J, Ott M, Cardie C, Hovy E (2014) Towards a general rule for identifying deceptive opinion spam. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (Volume 1: Long Papers), pp 1566–1576
Dewang RK, Singh AK (2015) Identification of fake reviews using new set of lexical and syntactic features. In: Proceedings of the sixth international conference on computer and communication technology 2015, pp 115–119
Li L, Qin B, Ren W, Liu T (2017) Document representation and feature combination for deceptive spam review detection. Neurocomputing 254:33–41
Article Google Scholar
Ren Y, Ji D (2017) Neural networks for deceptive opinion spam detection: an empirical study. Inf Sci 385:213–224
Article Google Scholar
Barushka A, Hajek P (2019) Review spam detection using word embeddings and deep neural networks. In: IFIP International conference on artificial intelligence applications and innovations. Springer, pp 340–350
Hajek P, Barushka A, Munk M (2020) Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining. Neural Comput & Applic 32(23):17259–17274
Article Google Scholar
Zeng Z-Y, Lin J-J, Chen M-S, Chen M-H, Lan Y-Q, Liu J-L (2019) A review structure based ensemble model for deceptive review spam. Information 10(7):243
Article Google Scholar
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556
Van der Maaten L, Hinton G (2008) Visualizing data using t-sne J Mach Learn Res 9(11)
Gade K, Geyik S, Kenthapadi K, Mithal V, Taly A (2020) Explainable ai in industry: practical challenges and lessons learned. In: Companion proceedings of the Web conference 2020, pp 303–304
Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Annals of statistics
Ribeiro MT, Singh S, Guestrin C (2016) Why should i trust you? explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1135–1144
Tiddi I et al (2020) Directions for explainable knowledge-enabled systems. Knowledge Graphs for eXplainable Artificial intelligence: Foundations Applications and Challenges 47:245
Google Scholar
Bizer C, Primpeli A, Peeters R (2019) Using the semantic web as a source of training data. Datenbank-Spektrum 19(2):127–135
Article Google Scholar
Ahmed H, Traore I, Saad S (2018) Detecting opinion spams and fake news using text classification. Security and Privacy 1(1):9
Article Google Scholar
Martineau JC, Finin T (2009) Delta tfidf: an improved feature space for sentiment analysis. In: Third international AAAI conference on weblogs and social media
Brown PF, Della Pietra VJ, Desouza PV, Lai JC, Mercer RL (1992) Class-based n-gram models of natural language. Computational linguistics 18(4):467–480
Google Scholar
Pennington J, Socher R, Manning CD (2014) Glove: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543
Ott M, Choi Y, Cardie C, Hancock JT (2011) Finding deceptive opinion spam by any stretch of the imagination. arXiv:11070.4557
Li FH, Huang M, Yang Y, Zhu X (2011) Learning to identify review spam. In: Twenty-second international joint conference on artificial intelligence
Cao N, Ji S, Chiu DK, He M, Sun X (2020) A deceptive review detection framework: Combination of coarse and fine-grained features. Expert Syst Appl 156:113465
Article Google Scholar
Kennedy S, Walsh N, Sloka K, Foster J, McCarren A (2020) Fact or factitious? contextualized opinion spam detection. arXiv:2010.15296
Jiang C, Zhang X, Jin A (2020) Detecting online fake reviews via hierarchical neural networks and multivariate features. In: International conference on neural information processing. Springer, pp 730–742
Neisari A, Rueda L, Saad S (2021) Spam review detection using self-organizing maps and convolutional neural networks. Computers & Security 106:102274
Article Google Scholar

Download references

Acknowledgments

This work is supported by the National Nature Science Foundation of China (No.61672329, No.62072290,No. 81871508, No. 61773246); Major Program of Shandong Province Natural Science Foundation (ZR2019ZD04, No. ZR2018ZB0419); Shandong Provincial Project of Education Scientific Plan (No.SDYY18058).

Author information

Authors and Affiliations

School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, China
Shu Han, Hong Wang, Wei Li, Hui Zhang & Luhe Zhuang

Authors

Shu Han
View author publications
You can also search for this author in PubMed Google Scholar
Hong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Li
View author publications
You can also search for this author in PubMed Google Scholar
Hui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Luhe Zhuang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Han Shu and Wang Hong. The first draft of the manuscript was written by Han Shu and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Hong Wang.

Ethics declarations

Conflict of Interests

The authors declared that they have no conflicts of interest to this work.

Additional information

Availability of data and materials

The data set can be obtained at the link below http://myleott.com/op-spam.html

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Han, S., Wang, H., Li, W. et al. Explainable knowledge integrated sequence model for detecting fake online reviews. Appl Intell 53, 6953–6965 (2023). https://doi.org/10.1007/s10489-022-03822-8

Download citation

Accepted: 27 May 2022
Published: 12 July 2022
Issue Date: March 2023
DOI: https://doi.org/10.1007/s10489-022-03822-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Explainable knowledge integrated sequence model for detecting fake online reviews

Abstract

Access this article

Similar content being viewed by others

Artificial intelligence in recommender systems

Real-time prediction of online shoppers’ purchasing intention using multilayer perceptron and LSTM recurrent neural networks

A comprehensive survey of AI-enabled phishing attacks detection techniques

Code Availability

References

Acknowledgments

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of Interests

Additional information

Availability of data and materials

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Explainable knowledge integrated sequence model for detecting fake online reviews

Abstract

Access this article

Similar content being viewed by others

Artificial intelligence in recommender systems

Real-time prediction of online shoppers’ purchasing intention using multilayer perceptron and LSTM recurrent neural networks

A comprehensive survey of AI-enabled phishing attacks detection techniques

Code Availability

References

Acknowledgments

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of Interests

Additional information

Availability of data and materials

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation