A Deep Hybrid Model for fake review detection by jointly leveraging review text, overall ratings, and aspect ratings

Duma, Ramadhani Ally; Niu, Zhendong; Nyamawe, Ally S.; Tchaye-Kondi, Jude; Yusuf, Abdulganiyu Abdu

doi:10.1007/s00500-023-07897-4

A Deep Hybrid Model for fake review detection by jointly leveraging review text, overall ratings, and aspect ratings

Data analytics and machine learning
Published: 06 March 2023

Volume 27, pages 6281–6296, (2023)
Cite this article

Soft Computing Aims and scope Submit manuscript

1003 Accesses
Explore all metrics

Abstract

Recently, product/ service reviews and online businesses have been similar to the blood–heart relationship as they greatly impact customers’ purchase decisions. There is an increasing incentive to manipulate reviews, mostly profit-motivated, as positive reviews imply high purchases and vice versa. Therefore, a suitable fake review detection approach is paramount in ensuring fair e-business competition and sustainability. Most existing methods mainly utilize discrete review features such as text similarity, rating deviation, review content, product information, the semantic meaning of reviews, and reviewer behaviors. In the matter of discourse, some recent researchers attempted multi-feature (review- and reviewer-centric features) integration. However, such approaches face two issues: (1) Review representation is extracted in an independent manner, thus ignoring correlations between them (2) Lack of a unified framework that can jointly learn latent text feature vectors, aspect ratings, and overall rating. To address the named issues, we propose a novel Deep Hybrid Model for fake review detection, which jointly learns from latent text feature vectors, aspect ratings, and overall ratings. Initially, it computes contextualized review text vectors, extracts aspects, and calculates respective rating values. Then, contextualized word vectors, overall ratings, and aspect ratings are concatenated. Finally, the model learns to classify reviews from such unified multi-dimensional feature representation. Extensive experiments on a publicly available dataset demonstrate that the proposed approach significantly outperforms state-of-the-art baseline approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

DHMFRD – TER: a deep hybrid model for fake review detection incorporating review texts, emotions, and ratings

Article 26 May 2023

Fake review detection techniques, issues, and future research directions: a literature review

Article 17 May 2024

WF-CFRB: A Deep Learning Approach for Fake Review Detection Based on Weighted Fusion of Contextual Features and Reviewer Behaviors

Article 05 March 2025

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availability

Enquiries about data availability should be directed to the authors.

Notes

Abbreviations

ACB:: Attention-based CNN Bi-LSTM
AMT:: Amazon mechanical turk
AR-LOF:: Aspects rating local outlier factor
BERT:: Bidirectional encoder representations from transformers
Bi-LSTM:: Bidirectional Long Short-Term Memory
bfGAN:: Behavioral feature Generative Adversarial network
CNN:: Convolution neural network
DFFNN:: Deep feed-forward neural network
DL:: Deep learning
DSRHA:: Detection of spam reviews through a hierarchical attention architecture with N-gram CNN and Bi-LSTM
FABC:: Fuzzy artificial bee colony
$F_N$ :: False negative
$F_P$ :: False positive
IP:: Internet protocol
LARA:: Latent aspects rating analysis LDA Latent Dirichlet Allocation
LSTM:: Long short-term memory
MAC:: Media access control
ML:: Machine learning
MLP:: Multilayer perceptron
NB:: Naïve Bayes
NLTK:: Natural language toolkit
NN:: Neural network
NRC:: National research council canada
ReLU:: Rectified linear unit
RF:: Random forest
SVM:: Support vector machines
$T_N$ :: True negative
$T_P$ :: True positive
VPN:: Virtual private network

References

Ahmad SN, Laroche M (2015) How do expressed emotions affect the helpfulness of a product review? evidence from reviews using latent semantic analysis. Int J Electron Commer 20(1):76–111
Article Google Scholar
Alsharif N (2022) Fake opinion detection in an e-commerce business based on a long-short memory algorithm. Soft Comput 26:1–8
Article Google Scholar
Asghar MZ, Ullah A, Ahmad S, Khan A (2020) Opinion spam detection framework using hybrid classification scheme. Soft Comput 24(5):3475–3498
Article Google Scholar
Barbado R, Araque O, Iglesias CA (2019) A framework for fake review detection in online consumer electronics retailers. Inf Process Manag 56(4):1234–1244
Article Google Scholar
Bathla G, Singh P, Singh RK, Cambria E, Tiwari R (2022) Intelligent fake reviews detection based on aspect extraction and analysis using deep learning. Neural Comput Appl 34(22):20213–20229
Article Google Scholar
Bhuvaneshwari P, Rao AN, Robinson YH (2021) Spam review detection using self attention based cnn and bi-directional lstm. Multimed Tools Appl 80(12):18107–18124
Article Google Scholar
Budhi GS, Chiong R, Wang Z (2021) Resampling imbalanced data to detect fake reviews using machine learning classifiers and textual-based features. Multimed Tools Appl 80(9):13079–13097
Article Google Scholar
Chen W, Yeo CK, Lau CT, Lee BS (2017) A study on real-time low-quality content detection on twitter from the users’ perspective. PLoS One 12(8):e0182487
Article Google Scholar
Cheng Z, Ding Y, Zhu L, Kankanhalli M (2018) Aspect-aware latent factor model: rating prediction with ratings and reviews. In: Proceedings of the 2018 world wide web conference, pp 639–648
Dong L-Y, Ji S-J, Zhang C-J, Zhang Q, Chiu DW, Qiu L-Q, Li D (2018) An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews. Expert Syst Appl 114:210–223
Article Google Scholar
Ellson A (2018) A third of tripadvisor reviews are fake as cheats buy five stars. the times. https://www.thetimes.co.uk/article/hotel-and-caf-cheats-are-caught-trying-to-buy-tripadvisor-stars-027fbcwc8. Accessed: 2021-12-20
Fei G, Mukherjee A, Liu B, Hsu M, Castellanos M, Ghosh R (2013) Exploiting burstiness in reviews for review spammer detection. In: Proceedings of the international AAAI conference on web and social media 7:175–184
Feng S, Banerjee R, Choi Y (2012) Syntactic stylometry for deception detection. InL Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp 171–175
Guo Y, Barnes SJ, Jia Q (2017) Mining meaning from online ratings and reviews: tourist satisfaction analysis using latent dirichlet allocation. Tour Manage 59:467–483
Article Google Scholar
Hajek P, Barushka A, Munk M (2020) Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining. Neural Comput Appl 32(23):17259–17274
Article Google Scholar
Hajek P, Sahut J-M et al (2022) Mining behavioural and sentiment-dependent linguistic patterns from restaurant reviews for fake review detection. Technol Forecast Soc Change. https://doi.org/10.1016/j.techfore.2022.121532
Article Google Scholar
Hayat U, Saeed A, Vardag MHK, Ullah MF, Iqbal N (2022) Roman urdu fake reviews detection using stacked lstm architecture. SN Comput Sci 3(6):1–9
Article Google Scholar
Hu M, Liu B (2004) Mining opinion features in customer reviews. In AAAI 4:755–760
Google Scholar
Jacob MS, Selvi Rajendran P (2022) Fuzzy artificial bee colony-based cnn-lstm and semantic feature for fake product review classification. Concurr Comput Pract Exp 34(1):e6539
Article Google Scholar
Jindal N, Liu B (2007) Review spam detection. In: Proceedings of the 16th international conference on World Wide Web, pp 1189–1190
Jindal N, Liu B, Lim E-P (2010) Finding unusual review patterns using unexpected rules. In: Proceedings of the 19th ACM international conference on Information and knowledge management, pp 1549–1552
Kaliyar RK, Goswami A, Narang P (2021) Fakebert: fake news detection in social media with a bert-based deep learning approach. Multimed Tools Appl 80(8):11765–11788
Article Google Scholar
Khan ZY, Niu Z, Nyamawe AS, Ul Haq I (2021) A deep hybrid model for recommendation by jointly leveraging ratings, reviews and metadata information. Eng Appl Artif Intell 97:104066
Article Google Scholar
Kokate S, Tidke B (2015) Fake review and brand spam detection using j48 classifier. IJCSIT Int J Comput Sci Inf Technol 6(4):3523–3526
Google Scholar
Li H, Fei G, Wang S, Liu B, Shao W, Mukherjee A, Shao J (2017) Bimodal distribution and co-bursting in review spam detection. In: Proceedings of the 26th international conference on world wide web, pp 1063–1072
Li J, Fu Y, Liu D, Xu R (2020a). Improving fake product detection with aspect-based sentiment analysis. In: International conference on cognitive computing, pp 39–49. Springer
Li J, Lv P, Xiao W, Yang L, Zhang P (2021) Exploring groups of opinion spam using sentiment analysis guided by nominated topics. Expert Syst Appl 171:114585
Article Google Scholar
Li J, Wang X, Yang L, Zhang P, Yang D (2020) Identifying ground truth in opinion spam: an empirical survey based on review psychology. Appl Intell 50(11):3554–3569
Article Google Scholar
Lim E-P, Nguyen V-A, Jindal N, Liu B, Lauw HW (2010) Detecting product review spammers using rating behaviors. In: Proceedings of the 19th ACM international conference on Information and knowledge management, pp 939–948
Liu Y, Wang L, Shi T, Li J (2022) Detection of spam reviews through a hierarchical attention architecture with n-gram CNN and bi-LSTM. Inf Syst 103:101865
Article Google Scholar
Liu Y, Wang L, Shi T, Li J (2022) Detection of spam reviews through a hierarchical attention architecture with n-gram cnn and bi-lstm. Inf Syst 103:101865
Article Google Scholar
Luca M (2016) Reviews, reputation, and revenue: The case of yelp. com. Com (March 15, 2016). Harvard Business School NOM Unit Working Paper, (12-016)
Luo N, Deng H, Zhao L, Liu Y, Wang X, Tan Z (2017) Multi-aspect feature based neural network model in detecting fake reviews. In: 2017 4th international conference on information science and control engineering (ICISCE), pp 475–479. IEEE
Luo Y, Tang RL (2019) Understanding hidden dimensions in textual reviews on airbnb: an application of modified latent aspect rating analysis (lara). Int J Hosp Manag 80:144–154
Article Google Scholar
Manaskasemsak B, Tantisuwankul J, Rungsawang A (2021) Fake review and reviewer detection through behavioral graph partitioning integrating deep neural network. Neural Comput Appl 35:1–14
Google Scholar
Mohammad SM, Turney PD (2013) Nrc emotion lexicon. Nat Res Counc Canada 2:234
Google Scholar
Mohawesh R, Tran S, Ollington R, Xu S (2021) Analysis of concept drift in fake reviews detection. Expert Syst Appl 169:114318
Article Google Scholar
Mukherjee A, Kumar A, Liu B, Wang J, Hsu M, Castellanos M, Ghosh R (2013a) Spotting opinion spammers using behavioral footprints. In: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 632–640
Mukherjee A, Liu B (2012) Aspect extraction through semi-supervised modeling. In: Proceedings of the 50th annual meeting of the association for computational linguistics (Volume 1: Long Papers), pp 339–348
Mukherjee A, Liu B, Glance N (2012) Spotting fake reviewer groups in consumer reviews. In: Proceedings of the 21st international conference on World Wide Web, pp 191–200
Mukherjee A, Venkataraman V, Liu B, Glance N (2013b) What yelp fake review filter might be doing? In: Proceedings of the International AAAI Conference on Web and Social Media, (volume 7)
Noekhah S, Fouladfar E, Salim N, Ghorashi SH, Hozhabri AA (2014) A novel approach for opinion spam detection in e-commerce. In: Proceedings of the 8th IEEE international conference on E-commerce with focus on E-trust
Ochi M, Okabe M, Onai R (2011) Rating prediction using feature words extracted from customer reviews. In: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, pp 1205–1206
Pang B, Lee L (2005) Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. arXiv preprint arXiv:cs/0506075
Rajamohana S, Umamaheswari K (2018) Hybrid approach of improved binary particle swarm optimization and shuffled frog leaping for feature selection. Comput Electr Eng 67:497–508
Article Google Scholar
Rayana S, Akoglu L (2015) Collective opinion spam detection: Bridging review networks and metadata. In: Proceedings of the 21th acm sigkdd international conference on knowledge discovery and data mining, pp 985–994
Rehman AU, Malik AK, Raza B, Ali W (2019) A hybrid cnn-lstm model for improving accuracy of movie reviews sentiment analysis. Multimed Tools Appl 78(18):26597–26613
Article Google Scholar
Ren J, Yeoh W, Shan Ee M, Popovič A (2018) Online consumer reviews and sales: examining the chicken-egg relationships. J Am Soc Inf Sci 69(3):449–460
Google Scholar
Ren Y, Ji D (2017) Neural networks for deceptive opinion spam detection: an empirical study. Inf Sci 385:213–224
Article Google Scholar
Rout JK, Singh S, Jena SK, Bakshi S (2017) Deceptive review detection using labeled and unlabeled data. Multimed Tools Appl 76(3):3187–3211
Article Google Scholar
Savage D, Zhang X, Yu X, Chou P, Wang Q (2015) Detection of opinion spam based on anomalous rating deviation. Expert Syst Appl 42(22):8650–8657
Article Google Scholar
Shan G, Zhou L, Zhang D (2021) From conflicts and confusion to doubts: examining review inconsistency for fake review detection. Decis Support Syst 144:113513
Article Google Scholar
Sundermeyer M, Schlüter R, Ney H (2012) Lstm neural networks for language modeling. In: 13th annual conference of the international speech communication association
Tang X, Qian T, You Z (2020) Generating behavior features for cold-start spam review detection with adversarial learning. Inf Sci 526:274–288
Article Google Scholar
Titov I, McDonald R (2008) Modeling online reviews with multi-grain topic models. In: Proceeding of the 17th international conference on World Wide Web - WWW ’08. ACM Press
Vidanagama DU, Silva TP, Karunananda AS (2020) Deceptive consumer review detection: a survey. Artif Intell Rev 53(2):1323–1352
Article Google Scholar
Wang H, Lu Y, Zhai C (2010) Latent aspect rating analysis on review text data: a rating regression approach. In: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 783–792
Wang H, Lu Y, Zhai C (2011) Latent aspect rating analysis without aspect keyword supervision. In: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 618–626
Wang X, Liu K, Zhao J (2017) Handling cold-start problem in review spam detection by jointly embedding texts and behaviors. In: Proceedings of the 55th annual meeting of the association for computational linguistics (Volume 1: Long Papers), pp 366–376
Weng C-H, Lin K-C, Ying J-C (2022) Detection of chinese deceptive reviews based on pre-trained language model. Appl Sci 12(7):3338
Article Google Scholar
Xie S, Wang G, Lin S, Yu PS (2012) Review spam detection via time series pattern discovery. In: Proceedings of the 21st International Conference on World Wide Web, pp 635–636
Xu Q, Zhao H (2012) Using deep linguistic features for finding deceptive opinion spam. In: Proceedings of COLING 2012: Posters, pp 1341–1350
Yao J, Zheng Y, Jiang H (2021) An ensemble model for fake online review detection based on data resampling, feature pruning, and parameter optimization. IEEE Access 9:16914–16927
Article Google Scholar
You L, Peng Q, Xiong Z, He D, Qiu M, Zhang X (2020) Integrating aspect analysis and local outlier factor for intelligent review spam detection. Futur Gener Comput Syst 102:163–172
Article Google Scholar
You Z, Qian T, Liu B (2018) An attribute enhanced domain adaptive model for cold-start spam review detection. In: Proceedings of the 27th international conference on computational linguistics, pp 1884–1895
Zhang Y, Wallace B (2015) A sensitivity analysis of (and practitioners’ guide to) convolutional neural networks for sentence classification. arXiv preprint arXiv:1510.03820
Zhu J, Zhu M, Wang H, Tsou BK (2009) Aspect-based sentence segmentation for sentiment summarization. In: Proceedings of the 1st international CIKM workshop on Topic-sentiment analysis for mass opinion, pp 65–72

Download references

Funding

This work was supported by the National Key R &D Program of China under Grants 2019YFB1406302, and the National Natural Science Foundation of China under Grant 62272048.

Author information

Authors and Affiliations

School of Computer Science and Technology, Beijing Institute of Technology, Beijing, 100081, People’s Republic of China
Ramadhani Ally Duma, Zhendong Niu, Jude Tchaye-Kondi & Abdulganiyu Abdu Yusuf
College of Informatics and Virtual Education, The University of Dodoma, Dodoma, Tanzania
Ramadhani Ally Duma & Ally S. Nyamawe
School of Computing and Information, University of Pittsburgh, Pittsburgh, PA, USA
Zhendong Niu

Authors

Ramadhani Ally Duma
View author publications
You can also search for this author inPubMed Google Scholar
Zhendong Niu
View author publications
You can also search for this author inPubMed Google Scholar
Ally S. Nyamawe
View author publications
You can also search for this author inPubMed Google Scholar
Jude Tchaye-Kondi
View author publications
You can also search for this author inPubMed Google Scholar
Abdulganiyu Abdu Yusuf
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

RAD Conceptualization, Methodology, Software, Formal analysis, Investigation, Resources, Data curation, Writing—original draft, visualization. ZN Methodology, Resources, Supervision, Validation, Visualization, Writing—review & editing, Project administration. AN Conceptualization, Methodology, Writing—review & editing JT-K Software, Methodology, Data curation AY Writing—review & editing

Corresponding author

Correspondence to Zhendong Niu.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Informed consent

No formal consent is required for this kind of study.

Ethical approval

For this kind of study, there is nothing that requires ethical approval.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Duma, R.A., Niu, Z., Nyamawe, A.S. et al. A Deep Hybrid Model for fake review detection by jointly leveraging review text, overall ratings, and aspect ratings. Soft Comput 27, 6281–6296 (2023). https://doi.org/10.1007/s00500-023-07897-4

Download citation

Accepted: 01 February 2023
Published: 06 March 2023
Issue Date: May 2023
DOI: https://doi.org/10.1007/s00500-023-07897-4

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Deep Hybrid Model for fake review detection by jointly leveraging review text, overall ratings, and aspect ratings

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

DHMFRD – TER: a deep hybrid model for fake review detection incorporating review texts, emotions, and ratings

Fake review detection techniques, issues, and future research directions: a literature review

WF-CFRB: A Deep Learning Approach for Fake Review Detection Based on Weighted Fusion of Contextual Features and Reviewer Behaviors

Explore related subjects

Data availability

Notes

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Informed consent

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now