Abstract
Detecting deceptive reviews can assist customers in grasping the real evaluation of products and services to make better purchase decisions and help companies satisfy timely to their customers’ expectations. Methods based on neural networks for deceptive review detection have made significant progress in recent years. Models using attention mechanisms such as BERT have demonstrated the ability to capture contextual information in review texts. However, their ability to capture global information about the word level is limited. This latter is the strength of Graph Convolutional Networks (GCNs). In this study, we propose a detection model (SGCN-BERT) based on the combination of Semantic Graph Convolutional Network (SGCN) and pre-trained model BERT. During the construction of the heterogeneous review graph, we consider both the co-occurrence relationship and semantic relationship between words to enrich the graph information. The graph embedding of the reviews are obtained through SGCN and input to BERT together with word embeddings. Global and local information containing lexical-semantic interact through different layers of BERT, allowing them to influence and build the final classification representation jointly mutually. Comprehensive tests on four public datasets show that our method outperforms previous methods and has good generalization capability.
Similar content being viewed by others
References
Filieri R, McLeay F (2013) E-wom and accommodation: an analysis of the factors that influence travelers’ adoption of information from online reviews. J Travel Res 53(1):44–57. https://doi.org/10.1177/0047287513481274
Jindal N, Liu B (2008) Opinion Spam and Analysis. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp. 219–230 (2008 Published). https://doi.org/10.1145/1341531.1341560
Crawford M, Khoshgoftaar TM, Prusa JD, Richter AN, Najada HA (2015) Survey of review spam detection using machine learning techniques. J Big Data 2(1):23–47. https://doi.org/10.1186/S40537-015-0029-9
Ott M, Choi Y, Cardie C, Hancock JT (2011) Finding deceptive opinion spam by any stretch of the imagination. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 309–319 (2011 Published). https://aclanthology.org/P11-1032
Ott M, Cardie C, Hancock JT (2013) Negative Deceptive Opinion Spam. In: North American Chapter of the Association for Computational Linguistics, pp. 497–501 (2013 Published). https://aclanthology.org/N13-1053
Li J, Cardie C, Li S (2013) TopicSpam: a Topic-Model Based Approach for Spam Detection. In: Meeting of the Association for Computational Linguistics, pp. 217–221 (2013 Published). https://aclanthology.org/P13-2039
Li J, Ott M, Cardie C, Hovy E (2014) Towards a General Rule for Identifying Deceptive Opinion Spam. In: Meeting of the Association for Computational Linguistics, pp. 1566–1576 (2014 Published). https://doi.org/10.3115/V1/P14-1147
Yingjie T, Mahboubeh M, Peyman T, Hosseini SM (2020) Bamakan: A non-convex semi-supervised approach to opinion spam detection by ramp-one class svm. Inf Process Manage 57:102381. https://doi.org/10.1016/j.ipm.2020.102381
Li L, Qin B, Ren W, Liu T (2017) Document representation and feature combination for deceptive spam review detection. Neurocomputing 254(254):33–41. https://doi.org/10.1016/J.NEUCOM.2016.10.080
Ren Y, Ji D (2017) Neural networks for deceptive opinion spam detection: an empirical study. Inf Sci 385:213–224. https://doi.org/10.1016/J.INS.2017.01.015
Battaglia PW, Hamrick JB, Bapst V, Sanchez-Gonzalez A, Zambaldi VF, Malinowski M, Tacchetti A, Raposo D, Santoro A, Faulkner R, Glehre, Song HF, Ballard AJ, Gilmer J, Dahl GE, Vaswani A, Allen KR, Nash C, Langston V, Dyer C, Heess N, Wierstra D, Kohli P, Botvinick M, Vinyals O, Li Y, Pascanu R. (2018) Relational inductive biases, deep learning, and graph networks. CoRR arxiv:abs/1806.01261
Kipf TN, Welling M (2017) Semi-Supervised Classification with Graph Convolutional Networks. In: International Conference on Learning Representations (2017 Published). https://doi.org/10.1109/ICDM.2019.00070
Liu X, You X, Zhang X, Wu J, Lv P (2020) Tensor Graph Convolutional Networks for Text Classification. In: Proceedings of the AAAI Conference on Artificial Intelligence, 34: 8409–8416 (2020 Published). https://doi.org/10.1609/AAAI.V34I05.6359
Yao L, Mao C, Luo Y (2019) Graph Convolutional Networks for Text Classification. In: Proceedings of the AAAI Conference on Artificial Intelligence, 33: 7370–7377 (2019 Published). https://doi.org/10.1609/AAAI.V33I01.33017370
Hai Z, Zhao P, Cheng P, Yang P, Li X-L, Li G (2016) Deceptive Review Spam Detection Via Exploiting Task Relatedness and Unlabeled Data. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 1817–1826 (2016 Published). https://doi.org/10.18653/V1/D16-1187
Devlin J, Chang M-W, Lee K, Toutanova KN (2018) BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, 1: 4171–4186 (2018 Published). https://doi.org/10.18653/V1/N19-1423
Li L, Ren W, Qin B, Liu T (2015) Learning Document Representation for Deceptive Opinion Spam Detection. In: Proceedings of the 4th China National Conference on Chinese Computational Linguistics, pp. 393–404 (2015 Published). https://doi.org/10.1007/978-3-319-25816-4_32
Wang X, Liu K, He S, Zhao J (2016) Learning to Represent Review with Tensor Decomposition for Spam Detection. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 866–875 (2016 Published). https://doi.org/10.18653/V1/D16-1083
Ren Y, Zhang Y (2016) Deceptive Opinion Spam Detection Using Neural Network. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 140–150 (2016 Published). https://aclanthology.org/C16-1014
O’Shea J, Crockett K, Khan W, Kindynis P, Antoniades A, Boultadakis G (2018) Intelligent Deception Detection Through Machine Based Interviewing. In: 2018 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. https://doi.org/10.1109/IJCNN.2018.8489392
Wasiq K, Keeley C, James O, Abir H, Bilal MK (2021) Deception in the eyes of deceiver: a computer vision and machine learning based automated deception detection. Expert Syst Appl 169:114341. https://doi.org/10.1016/j.eswa.2020.114341
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention Is All You Need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, 30: 5998–6008 (2017 Published)
Zhang Y, Fan Y, Ye Y, Zhao L, Shi C (2019) Key Player Identification in Underground Forums over Attributed Heterogeneous Information Network Embedding Framework. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 549–558 (2019 Published). https://doi.org/10.1145/3357384.3357876
Wang D, Qi Y, Lin J, Cui P, Jia Q, Wang Z, Fang Y, Yu Q, Zhou J, Yang S (2019) A Semi-Supervised Graph Attentive Network for Financial Fraud Detection. In: 2019 IEEE International Conference on Data Mining (ICDM), pp. 598–607 (2019 Published). https://doi.org/10.1109/ICDM.2019.00070
Li A, Qin Z, Liu R, Yang Y, Li D (2019) Spam Review Detection with Graph Convolutional Networks. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 2703–2711 (2019 Published). https://doi.org/10.1145/3357384.3357820
Wang G, Xie S, Liu B, Yu PS (2011) Review Graph Based Online Store Review Spammer Detection. In: 2011 IEEE 11th International Conference on Data Mining, pp. 1242–1247 (2011 Published). https://doi.org/10.1109/ICDM.2011.124
Yilmaz CM, Durahim AO (2018) SPR2EP: a Semi-supervised Spam Review Detection Framework. In: Proceedings of the 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 306–313 (2018 Published). https://doi.org/10.1109/ASONAM.2018.8508314
Wang J, Wen R, Wu C, Huang Y, Xion J (2019) FdGars: Fraudster Detection Via Graph Convolutional Networks in Online App Review System. In: Companion of the 2019 World Wide Web Conference, pp. 310–316 (2019 Published). https://doi.org/10.1145/3308560.3316586
Manaskasemsak B, Chanmakho C, Klainongsuang J, Rungsawang A (2019) Opinion Spam Detection Through User Behavioral Graph Partitioning Approach. In: Proceedings of the 2019 3rd International Conference on Intelligent Systems, Metaheuristics and Swarm Intelligence, pp. 73–77 (2019 Published). https://doi.org/10.1145/3325773.3325783
Mukherjee A, Kumar A, Liu B, Wang J, Hsu M, Castellanos M, Ghosh R (2013) Spotting Opinion Spammers Using Behavioral Footprints. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 632–640. https://doi.org/10.1145/2487575.2487580
Zeng ZY, Lu XY, Xu SJ (2020) Spam review detection base on deep learning model of multi-layer attention mechanism. Comput Appl Softw 37(5):177–182
Hajek P, Barushka A, Munk M (2020) Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining. Neural Comput Appl 32(23):17259–17274. https://doi.org/10.1007/s00521-020-04757-2
Neisari A, Rueda L, Saad S (2021) Spam review detection using self-organizing maps and convolutional neural networks. Comput Secur. https://doi.org/10.1016/J.COSE.2021.102274
Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) Roberta: A robustly optimized bert pretraining approach. CoRR arxiv:abs/1907.11692 (2019)
Cao N, Ji S, Chiu DK, Gong M (2022) A deceptive reviews detection model: separated training of multi-feature learning and classification. Expert Syst Appl. https://doi.org/10.1016/j.eswa.2021.115977
Acknowledgements
This work is supported by the Fundamental Research Funds for the Central Universities (Grant No. 2572019BH03).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflicts of interest to report regarding the present study.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Li, S., Cheng, W. Augmenting the global semantic information between words to heterogeneous graph for deception detection. Neural Comput & Applic 34, 19079–19090 (2022). https://doi.org/10.1007/s00521-022-07492-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-022-07492-y