SubCrime: Counterfactual Data Augmentation for Target Sentiment Analysis

Chen, Wei; Wang, Lulu; Du, Jinglong; He, Zhongshi

doi:10.1007/978-3-031-15931-2_26

Wei Chen¹²,
Lulu Wang¹²,
Jinglong Du¹³ &
…
Zhongshi He¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13530))

Included in the following conference series:

International Conference on Artificial Neural Networks

2572 Accesses

Abstract

The goal of Target Sentiment Analysis (TSA) is to predict the users’ sentiment towards specific targets from review sentences. However, the predicting results may not perform well due to the sparsity of training data. Data augmentation is a fruitful technology to alleviate the influence of imperfect training data, which obtains additional data by transforming the original samples. Unfortunately, there is hardly a particular data augmentation approach for TSA. To address this problem, in this paper, we propose a low-cost and effective data augmentation method called SubCrime, which constructs auxiliary sentences in two steps: Substitute and disCriminate. The former aims to substitute reasonable targets for the observed sentences through the masked language model, while the latter discriminates the restructured sentences via the constrained objective. SubCrime does not require extra knowledge and tedious manual annotation. We design SubCrime to answer the key counterfactual question: “If the review target in the sentence changed, would its sentiment be different ?”. Experiments show SubCrime improves on average 2 to 4 points in F1 scores on four datasets compared to methods without enhancement. Moreover, SubCrime also outperforms other data augmentation methods widely used in other Natural Language Processing (NLP) tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Enhancing aspect-based sentiment analysis using data augmentation based on back-translation

Article 14 August 2024

A novel approach to generate a large scale of supervised data for short text sentiment analysis

Article 12 February 2018

Multi-strategy text data augmentation for enhanced aspect-based sentiment analysis in resource-limited scenarios

Article 20 January 2024

References

Bai, X., Liu, P., Zhang, Y.: Investigating typed syntactic dependencies for targeted sentiment classification using graph attention neural network. In: IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 503–514 (2020)
Google Scholar
Chen, L., Zhang, H., Xiao, J., He, X., Pu, S., Chang, S.F.: Counterfactual critic multi-agent training for scene graph generation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 4613–4623 (2019)
Google Scholar
Chen, W., et al.: Target-based attention model for aspect-level sentiment analysis. In: Gedeon, T., Wong, K.W., Lee, M. (eds.) ICONIP 2019. LNCS, vol. 11955, pp. 259–269. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-36718-3_22
Chapter Google Scholar
Chen, Z., Qian, T.: Transfer capsule network for aspect level sentiment classification. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 547–556. Association for Computational Linguistics, Florence, Italy, July 2019
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 4171–4186 (2019)
Google Scholar
Edunov, S., Ott, M., Auli, M., Grangier, D.: Understanding back-translation at scale. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 489–500 (2018)
Google Scholar
Garg, S., Perot, V., Limtiaco, N., Taly, A., Chi, E.H., Beutel, A.: Counterfactual fairness in text classification through robustness. In: Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, pp. 219–226 (2019)
Google Scholar
Goyal, Y., Wu, Z., Ernst, J., Batra, D., Parikh, D., Lee, S.: Counterfactual visual explanations. In: International Conference on Machine Learning, pp. 2376–2384 (2019)
Google Scholar
Huang, B., Carley, K.M.: Parameterized convolutional neural networks for aspect level sentiment classification. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 1091–1096 (2018)
Google Scholar
Jiang, L., Yu, M., Zhou, M., Liu, X., Zhao, T.: Target-dependent twitter sentiment classification. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, pp. 151–160 (2011)
Google Scholar
Ke, Z., Xu, H., Liu, B.: Adapting bert for continual learning of a sequence of aspect sentiment classification tasks. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 4746–4755 (2021)
Google Scholar
Kobayashi, S.: Contextual augmentation: data augmentation by words with paradigmatic relations. arXiv preprint arXiv:1805.06201 (2018)
Li, X., Bing, L., Lam, W., Shi, B.: Transformation networks for target-oriented sentiment classification. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, pp. 946–956 (2018)
Google Scholar
Lin, P., Yang, M., Lai, J.: Deep mask memory network with semantic dependency and context moment for aspect level sentiment classification. In: Proceedings of the 28th International Joint Conference on Artificial Intelligence, pp. 5088–5094 (2019)
Google Scholar
Liu, Q., Kusner, M., Blunsom, P.: Counterfactual data augmentation for neural machine translation. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 187–197 (2021)
Google Scholar
Ma, D., Li, S., Zhang, X., Wang, H.: Interactive attention networks for aspect-level sentiment classification. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp. 4068–4074 (2017)
Google Scholar
Pontiki, M., Galanis, D., Pavlopoulos, J., Papageorgiou, H., Androutsopoulos, I., Manandhar, S.: Semeval-2014 task 4: aspect based sentiment analysis. In: Proceedings of the 8th International Workshop on Semantic, pp. 27–35 (2014)
Google Scholar
Pontiki, M., et al.: Semeval-2016 task 5: aspect based sentiment analysis. In: Proceedings of the 10th International Workshop on Semantic Evaluation, pp. 19–30 (2016)
Google Scholar
Pontiki, M., Galanis, D., Papageorgiou, H., Manandhar, S., Androutsopoulos, I.: Semeval-2015 task 12: aspect based sentiment analysis. In: Proceedings of the 9th International Workshop on Semantic Evaluation, pp. 486–495 (2015)
Google Scholar
Sennrich, R., Haddow, B., Birch, A.: Improving neural machine translation models with monolingual data. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 86–96 (2016)
Google Scholar
Song, Y., Wang, J., Jiang, T., Liu, Z., Rao, Y.: Attentional encoder network for targeted sentiment classification. arXiv preprint arXiv:1902.09314 (2019)
Sun, C., Huang, L., Qiu, X.: Utilizing bert for aspect-based sentiment analysis via constructing auxiliary sentence. arXiv preprint arXiv:1903.09588 (2019)
Tang, D., Qin, B., Liu, T.: Aspect level sentiment classification with deep memory network. arXiv preprint arXiv:1605.08900 (2016)
Wei, J., Zou, K.: Eda: easy data augmentation techniques for boosting performance on text classification tasks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (2019)
Google Scholar
Zeng, X., Li, Y., Zhai, Y., Zhang, Y.: Counterfactual generator: a weakly-supervised method for named entity recognition. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2020)
Google Scholar
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. In: Advances in Neural Information Processing Systems, pp. 649–657 (2015)
Google Scholar
Zmigrod, R., Mielke, S.J., Wallach, H., Cotterell, R.: Counterfactual data augmentation for mitigating gender stereotypes in languages with rich morphology. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 1651–1661 (2019)
Google Scholar

Download references

Acknowledgments

This work was supported in part by the Natural Science Foundation of Chongqing, China under Grant cstc2021jcyi-bshX0168, the Intelligent Medical Project of Chongqing Medical University under Grant ZHYXQNRC202101, and Graduate Research and Innovation Foundation of Chongqing, China (Grant No.CYC21072).

Author information

Authors and Affiliations

College of Computer Science, Chongqing University, Chongqing, China
Wei Chen, Lulu Wang & Zhongshi He
College of Medical Informatics, Chongqing Medical University, Chongqing, China
Jinglong Du

Authors

Wei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Lulu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jinglong Du
View author publications
You can also search for this author in PubMed Google Scholar
Zhongshi He
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jinglong Du .

Editor information

Editors and Affiliations

University of the West of England, Bristol, UK
Elias Pimenidis
Lancaster University, Lancaster, UK
Plamen Angelov
Digital Innovation, Teeside University, Middlesbrough, UK
Chrisina Jayne
Democritus University of Thrace, Xanthi, Greece
Antonios Papaleonidas
The University of the West of England, Bristol, UK
Mehmet Aydin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, W., Wang, L., Du, J., He, Z. (2022). SubCrime: Counterfactual Data Augmentation for Target Sentiment Analysis. In: Pimenidis, E., Angelov, P., Jayne, C., Papaleonidas, A., Aydin, M. (eds) Artificial Neural Networks and Machine Learning – ICANN 2022. ICANN 2022. Lecture Notes in Computer Science, vol 13530. Springer, Cham. https://doi.org/10.1007/978-3-031-15931-2_26

Download citation

DOI: https://doi.org/10.1007/978-3-031-15931-2_26
Published: 07 September 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-15930-5
Online ISBN: 978-3-031-15931-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

SubCrime: Counterfactual Data Augmentation for Target Sentiment Analysis

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Enhancing aspect-based sentiment analysis using data augmentation based on back-translation

A novel approach to generate a large scale of supervised data for short text sentiment analysis

Multi-strategy text data augmentation for enhanced aspect-based sentiment analysis in resource-limited scenarios

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

SubCrime: Counterfactual Data Augmentation for Target Sentiment Analysis

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Enhancing aspect-based sentiment analysis using data augmentation based on back-translation

A novel approach to generate a large scale of supervised data for short text sentiment analysis

Multi-strategy text data augmentation for enhanced aspect-based sentiment analysis in resource-limited scenarios

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation