Skip to main content
Log in

BERT-LSTM model for sarcasm detection in code-mixed social media post

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

Sarcasm is the acerbic use of words to mock someone or something, mostly in a satirical way. Scandal or mockery is used harshly, often crudely and contemptuously, for destructive purposes in sarcasm. To extract the actual sentiment of a sentence for code-mixed language is complex because of the unavailability of sufficient clues for sarcasm. In this work, we proposed a model consisting of Bidirectional Encoder Representations from Transformers (BERT) stacked with Long Short Term Memory (LSTM) (BERT-LSTM). A pre-trained BERT model is used to create embedding for the code-mixed dataset. These embedding vectors were used by an LSTM network consisting of a single layer to identify the nature of a sentence, i.e., sarcastic or non-sarcastic. The experiments show that the proposed BERT-LSTM model detects sarcastic sentences more effectively compared to other models on the code-mixed dataset, with an improvement of up to 6 % in terms of F1-score.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Data availability

The dataset is available at https://github.com/sahilswami96/SarcasmDetection_CodeMixed

Notes

  1. https://www.oxfordlearnersdictionaries.com/definition/english/sarcasm.

  2. www.goodreads.com

  3. https://github.com/sloria/TextBlob

  4. https://github.com/rajnish8807riday/Sarcasm_multilingual/

References

Download references

Acknowledgements

The authors acknowledge the dataset creators for their support.

Author information

Authors and Affiliations

Authors

Contributions

Rajnish Pandey and Jyoti Prakash Singh conceived the idea of the BERT embedding and LSTM network to classify the text. The experiments and initial draft were developed by Rajnish Pandey. Jyoti Prakash Singh corrected the initial draft. All authors reviewed the manuscript.

Corresponding author

Correspondence to Jyoti Prakash Singh.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pandey, R., Singh, J.P. BERT-LSTM model for sarcasm detection in code-mixed social media post. J Intell Inf Syst 60, 235–254 (2023). https://doi.org/10.1007/s10844-022-00755-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-022-00755-z

Keywords

Navigation