Skip to main content
Log in

Augmenting the global semantic information between words to heterogeneous graph for deception detection

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Detecting deceptive reviews can assist customers in grasping the real evaluation of products and services to make better purchase decisions and help companies satisfy timely to their customers’ expectations. Methods based on neural networks for deceptive review detection have made significant progress in recent years. Models using attention mechanisms such as BERT have demonstrated the ability to capture contextual information in review texts. However, their ability to capture global information about the word level is limited. This latter is the strength of Graph Convolutional Networks (GCNs). In this study, we propose a detection model (SGCN-BERT) based on the combination of Semantic Graph Convolutional Network (SGCN) and pre-trained model BERT. During the construction of the heterogeneous review graph, we consider both the co-occurrence relationship and semantic relationship between words to enrich the graph information. The graph embedding of the reviews are obtained through SGCN and input to BERT together with word embeddings. Global and local information containing lexical-semantic interact through different layers of BERT, allowing them to influence and build the final classification representation jointly mutually. Comprehensive tests on four public datasets show that our method outperforms previous methods and has good generalization capability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2

Similar content being viewed by others

References

  1. Filieri R, McLeay F (2013) E-wom and accommodation: an analysis of the factors that influence travelers’ adoption of information from online reviews. J Travel Res 53(1):44–57. https://doi.org/10.1177/0047287513481274

    Article  Google Scholar 

  2. Jindal N, Liu B (2008) Opinion Spam and Analysis. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp. 219–230 (2008 Published). https://doi.org/10.1145/1341531.1341560

  3. Crawford M, Khoshgoftaar TM, Prusa JD, Richter AN, Najada HA (2015) Survey of review spam detection using machine learning techniques. J Big Data 2(1):23–47. https://doi.org/10.1186/S40537-015-0029-9

    Article  Google Scholar 

  4. Ott M, Choi Y, Cardie C, Hancock JT (2011) Finding deceptive opinion spam by any stretch of the imagination. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 309–319 (2011 Published). https://aclanthology.org/P11-1032

  5. Ott M, Cardie C, Hancock JT (2013) Negative Deceptive Opinion Spam. In: North American Chapter of the Association for Computational Linguistics, pp. 497–501 (2013 Published). https://aclanthology.org/N13-1053

  6. Li J, Cardie C, Li S (2013) TopicSpam: a Topic-Model Based Approach for Spam Detection. In: Meeting of the Association for Computational Linguistics, pp. 217–221 (2013 Published). https://aclanthology.org/P13-2039

  7. Li J, Ott M, Cardie C, Hovy E (2014) Towards a General Rule for Identifying Deceptive Opinion Spam. In: Meeting of the Association for Computational Linguistics, pp. 1566–1576 (2014 Published). https://doi.org/10.3115/V1/P14-1147

  8. Yingjie T, Mahboubeh M, Peyman T, Hosseini SM (2020) Bamakan: A non-convex semi-supervised approach to opinion spam detection by ramp-one class svm. Inf Process Manage 57:102381. https://doi.org/10.1016/j.ipm.2020.102381

    Article  Google Scholar 

  9. Li L, Qin B, Ren W, Liu T (2017) Document representation and feature combination for deceptive spam review detection. Neurocomputing 254(254):33–41. https://doi.org/10.1016/J.NEUCOM.2016.10.080

    Article  Google Scholar 

  10. Ren Y, Ji D (2017) Neural networks for deceptive opinion spam detection: an empirical study. Inf Sci 385:213–224. https://doi.org/10.1016/J.INS.2017.01.015

    Article  Google Scholar 

  11. Battaglia PW, Hamrick JB, Bapst V, Sanchez-Gonzalez A, Zambaldi VF, Malinowski M, Tacchetti A, Raposo D, Santoro A, Faulkner R, Glehre, Song HF, Ballard AJ, Gilmer J, Dahl GE, Vaswani A, Allen KR, Nash C, Langston V, Dyer C, Heess N, Wierstra D, Kohli P, Botvinick M, Vinyals O, Li Y, Pascanu R. (2018) Relational inductive biases, deep learning, and graph networks. CoRR arxiv:abs/1806.01261

  12. Kipf TN, Welling M (2017) Semi-Supervised Classification with Graph Convolutional Networks. In: International Conference on Learning Representations (2017 Published). https://doi.org/10.1109/ICDM.2019.00070

  13. Liu X, You X, Zhang X, Wu J, Lv P (2020) Tensor Graph Convolutional Networks for Text Classification. In: Proceedings of the AAAI Conference on Artificial Intelligence, 34: 8409–8416 (2020 Published). https://doi.org/10.1609/AAAI.V34I05.6359

  14. Yao L, Mao C, Luo Y (2019) Graph Convolutional Networks for Text Classification. In: Proceedings of the AAAI Conference on Artificial Intelligence, 33: 7370–7377 (2019 Published). https://doi.org/10.1609/AAAI.V33I01.33017370

  15. Hai Z, Zhao P, Cheng P, Yang P, Li X-L, Li G (2016) Deceptive Review Spam Detection Via Exploiting Task Relatedness and Unlabeled Data. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 1817–1826 (2016 Published). https://doi.org/10.18653/V1/D16-1187

  16. Devlin J, Chang M-W, Lee K, Toutanova KN (2018) BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, 1: 4171–4186 (2018 Published). https://doi.org/10.18653/V1/N19-1423

  17. Li L, Ren W, Qin B, Liu T (2015) Learning Document Representation for Deceptive Opinion Spam Detection. In: Proceedings of the 4th China National Conference on Chinese Computational Linguistics, pp. 393–404 (2015 Published). https://doi.org/10.1007/978-3-319-25816-4_32

  18. Wang X, Liu K, He S, Zhao J (2016) Learning to Represent Review with Tensor Decomposition for Spam Detection. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 866–875 (2016 Published). https://doi.org/10.18653/V1/D16-1083

  19. Ren Y, Zhang Y (2016) Deceptive Opinion Spam Detection Using Neural Network. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 140–150 (2016 Published). https://aclanthology.org/C16-1014

  20. O’Shea J, Crockett K, Khan W, Kindynis P, Antoniades A, Boultadakis G (2018) Intelligent Deception Detection Through Machine Based Interviewing. In: 2018 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. https://doi.org/10.1109/IJCNN.2018.8489392

  21. Wasiq K, Keeley C, James O, Abir H, Bilal MK (2021) Deception in the eyes of deceiver: a computer vision and machine learning based automated deception detection. Expert Syst Appl 169:114341. https://doi.org/10.1016/j.eswa.2020.114341

    Article  Google Scholar 

  22. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention Is All You Need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, 30: 5998–6008 (2017 Published)

  23. Zhang Y, Fan Y, Ye Y, Zhao L, Shi C (2019) Key Player Identification in Underground Forums over Attributed Heterogeneous Information Network Embedding Framework. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 549–558 (2019 Published). https://doi.org/10.1145/3357384.3357876

  24. Wang D, Qi Y, Lin J, Cui P, Jia Q, Wang Z, Fang Y, Yu Q, Zhou J, Yang S (2019) A Semi-Supervised Graph Attentive Network for Financial Fraud Detection. In: 2019 IEEE International Conference on Data Mining (ICDM), pp. 598–607 (2019 Published). https://doi.org/10.1109/ICDM.2019.00070

  25. Li A, Qin Z, Liu R, Yang Y, Li D (2019) Spam Review Detection with Graph Convolutional Networks. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 2703–2711 (2019 Published). https://doi.org/10.1145/3357384.3357820

  26. Wang G, Xie S, Liu B, Yu PS (2011) Review Graph Based Online Store Review Spammer Detection. In: 2011 IEEE 11th International Conference on Data Mining, pp. 1242–1247 (2011 Published). https://doi.org/10.1109/ICDM.2011.124

  27. Yilmaz CM, Durahim AO (2018) SPR2EP: a Semi-supervised Spam Review Detection Framework. In: Proceedings of the 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 306–313 (2018 Published). https://doi.org/10.1109/ASONAM.2018.8508314

  28. Wang J, Wen R, Wu C, Huang Y, Xion J (2019) FdGars: Fraudster Detection Via Graph Convolutional Networks in Online App Review System. In: Companion of the 2019 World Wide Web Conference, pp. 310–316 (2019 Published). https://doi.org/10.1145/3308560.3316586

  29. Manaskasemsak B, Chanmakho C, Klainongsuang J, Rungsawang A (2019) Opinion Spam Detection Through User Behavioral Graph Partitioning Approach. In: Proceedings of the 2019 3rd International Conference on Intelligent Systems, Metaheuristics and Swarm Intelligence, pp. 73–77 (2019 Published). https://doi.org/10.1145/3325773.3325783

  30. Mukherjee A, Kumar A, Liu B, Wang J, Hsu M, Castellanos M, Ghosh R (2013) Spotting Opinion Spammers Using Behavioral Footprints. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 632–640. https://doi.org/10.1145/2487575.2487580

  31. Zeng ZY, Lu XY, Xu SJ (2020) Spam review detection base on deep learning model of multi-layer attention mechanism. Comput Appl Softw 37(5):177–182

    Google Scholar 

  32. Hajek P, Barushka A, Munk M (2020) Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining. Neural Comput Appl 32(23):17259–17274. https://doi.org/10.1007/s00521-020-04757-2

    Article  Google Scholar 

  33. Neisari A, Rueda L, Saad S (2021) Spam review detection using self-organizing maps and convolutional neural networks. Comput Secur. https://doi.org/10.1016/J.COSE.2021.102274

    Article  Google Scholar 

  34. Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) Roberta: A robustly optimized bert pretraining approach. CoRR arxiv:abs/1907.11692 (2019)

  35. Cao N, Ji S, Chiu DK, Gong M (2022) A deceptive reviews detection model: separated training of multi-feature learning and classification. Expert Syst Appl. https://doi.org/10.1016/j.eswa.2021.115977

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the Fundamental Research Funds for the Central Universities (Grant No. 2572019BH03).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wenfeng Cheng.

Ethics declarations

Conflict of interest

The authors declare that they have no conflicts of interest to report regarding the present study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, S., Cheng, W. Augmenting the global semantic information between words to heterogeneous graph for deception detection. Neural Comput & Applic 34, 19079–19090 (2022). https://doi.org/10.1007/s00521-022-07492-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-022-07492-y

Keywords

Navigation