Fake reviews classification using deep learning ensemble of shallow convolutions

Javed, Muhammad Saad; Majeed, Hammad; Mujtaba, Hasan; Beg, Mirza Omer

doi:10.1007/s42001-021-00114-y

Fake reviews classification using deep learning ensemble of shallow convolutions

Research Article
Published: 27 March 2021

Volume 4, pages 883–902, (2021)
Cite this article

Journal of Computational Social Science Aims and scope Submit manuscript

Muhammad Saad Javed¹,
Hammad Majeed¹,
Hasan Mujtaba¹ &
…
Mirza Omer Beg ORCID: orcid.org/0000-0001-5789-2933¹

923 Accesses
23 Citations
Explore all metrics

Abstract

Online reviews have a decisive impact on consumers’ purchasing decisions. This opens the doors for spammers and scammers to post fake reviews for promoting non-existent products or undermine competitor products to affect social behavior. Thus, the identification of reviews as fake and real has become ever more important. Traditional approaches for text classification use a bag-of-words model to represent text which causes sparsity and word representations learnt from neural networks with limited ability to handle unknown words. In this paper, we propose a technique based on three different models trained on the idea of a multi-view learning technique and create an ensemble of all models by employing an aggregation technique for generating final predictions. The core idea of our methodology is to extract rich information from the text of reviews by combining bag-of-n-grams and parallel convolution neural networks(CNNs). By using an n-gram embedding layer with small kernel sizes we can use local context with the same computation power as required to train deep and complex CNNs. Our CNN-based architecture consumes n-gram embeddings as input and uses the parallel convolutional blocks to extract richer feature representations from text. Our approach for the detection of fake reviews also combines textual linguistic features and non-textual features related to reviewer behavior. We evaluate our approach on publically available Yelp Filtered Dataset and achieve F1 scores of up to 92% for classifying fake reviews.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Factitious or fact? Learning textual representations for fake online review detection

Article 28 September 2023

Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining

Article 01 February 2020

Unmasking deception: a CNN and adaptive PSO approach to detecting fake online reviews

Article 03 June 2023

Notes

References

Anwar, T., & Baig, O. (2020). Tac at semeval-2020 task 12: Ensembling approach for multilingual offensive language identification in social media. In: Proceedings of the Fourteenth Workshop on Semantic Evaluation, pp 2177–2182.
Asad, M., Asim, M., Javed, T., Beg, M. O., Mujtaba, H., & Abbas, S. (2020). Deepdetect: detection of distributed denial of service attacks using deep learning. The Computer Journal, 63(7), 983–994.
Article Google Scholar
Awan, M. N., & Beg, M. O. (2020). Top-rank: A topicalpostionrank for extraction and classification of keyphrases in text. Computer Speech & Language, p 101116.
Bengio, Y., Ducharme, R., Vincent, P., & Jauvin, C. (2003). A neural probabilistic language model. Journal of Machine Learning Research, 3(Feb), 1137–1155.
Google Scholar
Cardoso, Emerson F., Silva, Renato M., & Almeida, Tiago A. (2018). Towards automatic filtering of fake reviews. Neurocomputing, 309, 106–116.
Article Google Scholar
Deng, H., Zhao, L., Luo, N., Liu, Y., Guo, G., Wang, X., Tan, Z., Wang, S., & Zhou, F. (2017). Semi-supervised learning based fake review detection. In: 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC), pp. 1278–1280.
Dilawar, N., Majeed, H., Beg, M. O., Ejaz, N., Muhammad, K., Mehmood, I., & Nam, Y. (2018). Understanding citizen issues through reviews: A step towards data informed planning in smart cities. Applied Sciences, 8(9), 1589.
Article Google Scholar
Grave, E., Mikolov, T., Joulin, A., & Bojanowski, P. (2017). Bag of tricks for efficient text classification. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL.
Hovy, D. (2016). The enemy in your own camp: How well can we detect statistically-generated fake reviews–an adversarial study. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), volume 2, pp. 351–356.
Javed, A. R., Beg, M. O., Asim, M., Baker, T., & Al-Bayatti, A. Hi. (2020). Alphalogger: Detecting motion-based side-channel attack using smartphone keystrokes. Journal of Ambient Intelligence and Humanized Computing, pp 1–14.
Jia, S., Zhang, X., Wang, X., & Liu, Y. (2018). Fake reviews detection based on lda. In: 2018 4th International Conference on Information Management (ICIM), pp 280–283. IEEE.
Jindal, N., & Liu, B. (2008). Opinion spam and analysis. In: Proceedings of the 2008 international conference on web search and data mining, pp. 219–230.
Khawaja, H. S., Beg, M. O., & Qamar, S. (2018). Domain specific emotion lexicon expansion. In: 2018 14th International Conference on Emerging Technologies (ICET), pp 1–5.
Majeed, A., Mujtaba, H., & Beg, M. O. (2020). Emotion detection in roman urdu text using machine learning. In: Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering Workshops, pp. 125–130.
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781.
Mukherjee, A., Liu, B., Wang, J., Glance, N., & Jindal, N. (2011). Detecting group review spam. In: Proceedings of the 20th international conference companion on World wide web, pp 93–94.
Mukherjee, A., Venkataraman, V., Liu, B., & Glance, N. (2013). What yelp fake review filter might be doing? In: Seventh international AAAI conference on weblogs and social media.
Naeem, B., Khan, A., Beg, M. O., & Mujtaba, H. (2020). A deep learning framework for clickbait detection on social area network using natural language cues. Journal of Computational Social Science, pp. 1–13.
Narayan, Rohit, Rout, Jitendra Kumar, & Jena, Sanjay Kumar. (2018). Review spam detection using semi-supervised technique. Progress in Intelligent Computing Techniques: Theory, Practice, and Applications (pp. 281–286). Berlin: Springer.
Chapter Google Scholar
Ott, M., Choi, Y., Cardie, C., & Hancock, J. T. (2011). Finding deceptive opinion spam by any stretch of the imagination. In: Proceedings of the 49th annual meeting of the association for computational linguistics: Human language technologies-volume 1, Association for Computational Linguistics, pp. 309–319
Peters, M.E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., & Zettlemoyer, L. (2018). Deep contextualized word representations. In: Proc. of NAACL.
Qamar, S., Mujtaba, H., Majeed, H., & Beg, M. O. (2021). Relationship identification between conversational agents using emotion analysis. Cognitive Computation, pp. 1–15.
Ren, Y., & Zhang, Y. (2016). Deceptive opinion spam detection using neural network. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 140–150.
Rout, J. K., Dalmia, A., Choo, K.-K. R., Bakshi, S., & Jena, S. K. (2017). Revisiting semi-supervised learning for online deceptive review detection. IEEE Access, 5, 1319–1327.
Article Google Scholar
Sahar, H., Bangash, A. A., & Beg, M. O. (2019). Towards energy aware object-oriented development of android applications. Sustainable Computing: Informatics and Systems, 21, 28–46.
Google Scholar
Socher, R., Huang, E. H., Pennington, J., Ng, A. Y., & Manning, C. D. (2011). Dynamic pooling and unfolding recursive autoencoders for paraphrase detection. In: Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems (pp. 801–809). Spain.: Granada.
Speer, R., Chin, J., & Havasi, C. (2017). Conceptnet 5.5: An open multilingual graph of general knowledge. In: Thirty-First AAAI Conference on Artificial Intelligence.
Tariq, M., Majeed, H., Beg, M. O., Khan, F. A., & Derhab, A. (2019). Accurate detection of sitting posture activities in a secure IOT based assisted living environment. Future Generation Computer Systems, 92, 745–757.
Article Google Scholar
Uzair, A., Beg, M. O., Mujtaba, H., & Majeed, H. (2019). Weec: Web energy efficient computing: A machine learning approach. Sustainable Computing: Informatics and Systems, 22, 230–243.
Google Scholar
Wang, S., Manning, C. D. (2012). Baselines and bigrams: Simple, good sentiment and topic classification. In Proceedings of the 50th annual meeting of the association for computational linguistics: Short papers-volume 2, pp 90–94. Association for Computational Linguistics.
Wang, X., Liu, K., He, S., & Zhao, J. (2016). Learning to represent review with tensor decomposition for spam detection. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 866–875.
Wang, X., Liu, K., & Zhao, J. (2017). Handling cold-start problem in review spam detection by jointly embedding texts and behaviors. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 366–376.
Ye, J., Kumar, S., & Akoglu, L. (2016). Temporal opinion spam detection by multivariate indicative signals. In: Tenth International AAAI Conference on Web and Social Media.
Zahid, R., Idrees, M. O., Mujtaba, H., & Beg, M. O. (2020). Roman urdu reviews dataset for aspect based opinion mining. In: 2020 35th IEEE/ACM International Conference on Automated Software Engineering Workshops (ASEW), pp. 138–143.

Download references

Author information

Authors and Affiliations

National University of Computer and Emerging Sciences, A.K Brohi Road, Sector H-11/4, Islamabad, Pakistan
Muhammad Saad Javed, Hammad Majeed, Hasan Mujtaba & Mirza Omer Beg

Authors

Muhammad Saad Javed
View author publications
You can also search for this author in PubMed Google Scholar
Hammad Majeed
View author publications
You can also search for this author in PubMed Google Scholar
Hasan Mujtaba
View author publications
You can also search for this author in PubMed Google Scholar
Mirza Omer Beg
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mirza Omer Beg.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Javed, M.S., Majeed, H., Mujtaba, H. et al. Fake reviews classification using deep learning ensemble of shallow convolutions. J Comput Soc Sc 4, 883–902 (2021). https://doi.org/10.1007/s42001-021-00114-y

Download citation

Received: 18 October 2020
Accepted: 17 March 2021
Published: 27 March 2021
Issue Date: November 2021
DOI: https://doi.org/10.1007/s42001-021-00114-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fake reviews classification using deep learning ensemble of shallow convolutions

Abstract

Access this article

Similar content being viewed by others

Factitious or fact? Learning textual representations for fake online review detection

Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining

Unmasking deception: a CNN and adaptive PSO approach to detecting fake online reviews

Notes

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Fake reviews classification using deep learning ensemble of shallow convolutions

Abstract

Access this article

Similar content being viewed by others

Factitious or fact? Learning textual representations for fake online review detection

Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining

Unmasking deception: a CNN and adaptive PSO approach to detecting fake online reviews

Notes

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation