Skip to main content
Log in

Topic enhanced sentiment co-attention BERT

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

In the sentiment analysis tasks, the idea of introducing external information to improve the prediction performance is sprouting up. The Topic Sentiment Joint Model shows that as the carrier of fine-grained sentiment, topics have an auxiliary effect on sentiment analysis. However, the current research on fine-grained view focuses on recognizing sentiment, but pays little attention to their impact on the overall feeling. To use topic information to assist sentiment analysis, this paper proposes a topic-enhanced sentiment analysis model. Through the multi-task learning framework, the topic information is learned and used to guide the model for sentiment classification. The multi-stage learning strategy guarantees the accuracy of prior knowledge. With the help of the alternating co-attention mechanism, the essential topics and emotional expression are concerned. In this paper, the labels of Chinese bank forum data and restaurant review data are automatically transformed to obtain supervised topic information. The experimental results show that the model has achieved state-of-the-art over the baseline in two datasets. The Kappa coefficient of sentiment prediction has increased by more than 1%. The model also has apparent advantages in interpretability and noise resistance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

The data and source code is available at https://github.com/WithMeteor/TescaBERT

Notes

  1. https://www.datafountain.cn/competitions/529/

References

  • Ali, S.M., Noorian, Z., Bagheri, E., & et al. (2020). Topic and sentiment aware microblog summarization for twitter. Journal of Intelligent Information Systems, 54(1), 129–156. https://doi.org/10.1007/s10844-018-0521-8.

    Article  Google Scholar 

  • Blei, D.M., Ng, A.Y., & Jordan, M.I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022. https://doi.org/10.1162/jmlr.2003.3.4-5.993.

    MATH  Google Scholar 

  • Bu, J., Ren, L., Zheng, S., & et al. (2021). Asap: A chinese review dataset towards aspect category sentiment analysis and rating prediction. arXiv:2103.06605.

  • Cambria, E., Poria, S., Gelbukh, A., & et al. (2018). Sentiment analysis is a big suitcase. Intelligent Systems, IEEE, 32(6), 74–80. https://doi.org/10.1109/MIS.2017.4531228.

    Article  Google Scholar 

  • Chintalapudi, N., Battineni, G., Di Canio, M., & et al. (2021). Text mining with sentiment analysis on seafarers’ medical documents. International Journal of Information Management Data Insights, 1(1), 1–9. https://doi.org/10.1016/j.jjimei.2020.100005.

    Article  Google Scholar 

  • Cho, K., Van Merriënboer, B., Gulcehre, C., & et al. (2014). Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv:1406.1078.

  • Cui, Y., Che, W., Liu, T., & et al. (2021). Pre-training with whole word masking for Chinese Bert. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29, 3504–3514. https://doi.org/10.1109/TASLP.2021.3124365.

    Article  Google Scholar 

  • Davidov, D., Tsur, O., & Rappoport, A. (2010). Enhanced sentiment learning using twitter hashtags and smileys. In Coling 2010: Posters (pp. 241–249).

  • Devlin, J., Chang, M.-W., Lee, K., & et al. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805.

  • Fu, X., Sun, X., Wu, H., & et al. (2018). Weakly supervised topic sentiment joint model with word embeddings. Knowledge-Based Systems, 147, 43–54. https://doi.org/10.1016/j.knosys.2018.02.012.

    Article  Google Scholar 

  • Hamouda, A., & Rohaim, M. (2011). Reviews classification using sentiwordnet lexicon. In World congress on computer science and information technology, (Vol. 23 pp. 104–105).

  • Haque, T.U., Saber, N.N., & Shah, F.M. (2018). Sentiment analysis on large scale amazon product reviews. In 2018 IEEE International Conference on Innovative Research and Development (ICIRD). https://doi.org/10.1109/ICIRD.2018.8376299 (pp. 1–6). IEEE.

  • Hofmann, T. (1999). Probabilistic latent semantic indexing. In Proceedings of the 22nd Annual International ACM SIGIR conference on research and development in information retrieval. https://doi.org/10.1145/312624.312649 (pp. 50–57).

  • Johnson, R., & Zhang, T. (2017). Deep pyramid convolutional neural networks for text categorization. In Proceedings of the 55th annual meeting of the association for computational linguistics (Volume 1: Long Papers). https://doi.org/10.18653/v1/P17-1052 (pp. 562–570).

  • Joulin, A., Grave, E., Bojanowski, P., & et al. (2016). Bag of tricks for efficient text classification. arXiv:1607.01759.

  • Ke, P., Ji, H., Liu, S., & et al. (2019). Sentilare: Sentiment-aware language representation learning with linguistic knowledge. arXiv:1911.02493.

  • Khan, F.H., Qamar, U., & Bashir, S. (2016). esap: A decision support framework for enhanced sentiment analysis and polarity classification. Information Sciences, 367, 862–873. https://doi.org/10.1016/j.ins.2016.07.028.

    Article  Google Scholar 

  • Kim, Y. (2014). Convolutional neural networks for sentence classification. Eprint Arxiv. https://doi.org/10.3115/v1/D14-1181.

  • Li, M., Chen, L., Zhao, J., & et al. (2021). Sentiment analysis of chinese stock reviews based on bert model. Applied Intelligence, 51(7), 5016–5024. https://doi.org/10.1007/s10489-020-02101-8.

    Article  Google Scholar 

  • Li, F., Huang, M., & Zhu, X. (2010). Sentiment analysis with global topics and local dependency. In Proceedings of the AAAI conference on artificial intelligence. https://doi.org/10.1609/aaai.v24i1.7523, (Vol. 24 pp. 1371–1376).

  • Li, C., Zhang, J., Sun, J.-T., & et al. (2013). Sentiment topic model with decomposed prior. In Proceedings of the 2013 SIAM international conference on data mining. https://doi.org/10.1137/1.9781611972832.85 (pp. 767–775). SIAM.

  • Li, S., Zhao, Z., Hu, R., & et al. (2018). Analogical reasoning on chinese morphological and semantic relations. arXiv:1805.06504.

  • Lin, C., He, Y., Everson, R., & et al. (2011). Weakly supervised joint sentiment-topic detection from text. IEEE Transactions on Knowledge and Data Engineering, 24(6), 1134–1145. https://doi.org/10.1109/TKDE.2011.48.

    Article  Google Scholar 

  • Liu, P., Qiu, X., Chen, X., & et al. (2015). Multi-timescale long short-term memory neural network for modelling sentences and documents. In Proceedings of the 2015 conference on empirical methods in natural language processing. https://doi.org/10.18653/v1/D15-1280 (pp. 2326–2335).

  • Lu, J., Yang, J., Batra, D., & et al. (2016). Hierarchical co-attention for visual question answering. Advances in Neural Information Processing Systems (NIPS) 2. https://doi.org/10.48550/arXiv.1606.00061.

  • McAuley, J.J., & Leskovec, J. (2013). From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews. In Proceedings of the 22nd international conference on World Wide Web. https://doi.org/10.1145/2488388.2488466(pp. 897–908).

  • Mei, Q., Ling, X., Wondra, M., & et al. (2007). Topic sentiment mixture: modeling facets and opinions in weblogs. In Proceedings of the 16th international conference on World Wide Web. https://doi.org/10.1145/1242572.1242596 (pp. 171–180).

  • Nazir, A., Yuan, R., Wu, L., & et al. (2020). Issues and challenges of aspect-based sentiment analysis: A comprehensive survey. IEEE Transactions on Affective Computing, PP(99), 1–1. https://doi.org/10.1109/TAFFC.2020.2970399.

    Google Scholar 

  • Ohana, B., & Tierney, B. (2009). Sentiment classification of reviews using sentiwordnet. Proceedings of IT&T, 22–23. https://doi.org/10.13140/2.1.4547.0089.

  • Ramaswamy, S.L., & Jayakumar, C. (2022). Recognet-lstm+cnn: a hybrid network with attention mechanism for aspect categorization and sentiment classification. Journal of Intelligent Information Systems, 58(2), 379–404. https://doi.org/10.1007/s10844-021-00692-3.

    Article  Google Scholar 

  • Ren, Y., Wang, R., & Ji, D. (2016). A topic-enhanced word embedding for twitter sentiment classification. Information Sciences, 369, 188–198. https://doi.org/10.1016/j.ins.2016.06.040.

    Article  Google Scholar 

  • Sharma, P., & Moh, T.-S. (2016). Prediction of indian election using sentiment analysis on hindi twitter. In 2016 IEEE international conference on big data (big Data). https://doi.org/10.1109/BigData.2016.7840818 (pp. 1966–1971). IEEE.

  • Singh, L.G., & Singh, S.R. (2021). Empirical study of sentiment analysis tools and techniques on societal topics. Journal of Intelligent Information Systems, 56(2), 379–407. https://doi.org/10.1007/s10844-020-00616-7.

    Article  Google Scholar 

  • Socher, R., Perelygin, A., Wu, J., & et al. (2013). Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the 2013 conference on empirical methods in natural language processing (pp. 1631–1642).

  • Sun, Y., Wang, S., Li, Y., & et al. (2019). Ernie: Enhanced representation through knowledge integration. arXiv:1904.09223.

  • Tai, K.S., Socher, R., & Manning, C.D. (2015). Improved semantic representations from tree-structured long short-term memory networks. arXiv:1503.00075.

  • Tian, H., Gao, C., Xiao, X., & et al. (2020). Skep: Sentiment knowledge enhanced pre-training for sentiment analysis. arXiv:2005.05635.

  • Wang, B. (2018). Disconnected recurrent neural networks for text categorization. In Proceedings of the 56th annual meeting of the association for computational linguistics (Volume 1: Long Papers). https://doi.org/10.18653/v1/P18-1215 (pp. 2311–2320).

  • Wang, Y., Huang, M., Zhu, X., & et al. (2016). Attention-based lstm for aspect-level sentiment classification. In Proceedings of the 2016 conference on empirical methods in natural language processing. https://doi.org/10.18653/v1/D16-1058 (pp. 606–615).

  • Wang, R., Li, Z., Cao, J., & et al. (2019). Convolutional recurrent neural networks for text classification. In 2019 international joint conference on neural networks (IJCNN). https://doi.org/10.1109/IJCNN.2019.8852406 (pp. 1–6). IEEE.

  • Wang, X., Wei, F., Liu, X., & et al. (2011). Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach. In Proceedings of the 20th ACM international conference on information and knowledge management. https://doi.org/10.1145/2063576.2063726 (pp. 1031–1040).

  • Wu, D., Yang, R.X., & Shen, C. (2021). Sentiment word co-occurrence and knowledge pair feature extraction based lda short text clustering algorithm. Journal of Intelligent Information Systems, 56(1), 1–23. https://doi.org/10.1007/s10844-020-00597-7.

    Article  Google Scholar 

  • Xuan, J., Lu, J., Zhang, G., & et al. (2017). Nonparametric relational topic models through dependent gamma processes. IEEE Transactions on Knowledge & Data Engineering, 29(7), 1357–1369. https://doi.org/10.1109/TKDE.2016.2636182.

    Article  Google Scholar 

  • Yadav, A., & Vishwakarma, D.K. (2020). Sentiment analysis using deep learning architectures: a review. Artificial Intelligence Review, 53(6), 4335–4385. https://doi.org/10.1007/s10462-019-09794-5.

    Article  Google Scholar 

  • Yang, Z., Yang, D., Dyer, C., & et al. (2016). Hierarchical attention networks for document classification. In Proceedings of the 2016 Conference of the North American chapter of the association for computational linguistics: Human language technologies. https://doi.org/10.18653/v1/N16-1174 (pp. 1480–1489).

  • Yang, C., Zhang, H., Jiang, B., & et al. (2019). Aspect-based sentiment analysis with alternating coattention networks. Information Processing & Management, 56(3), 463–478. https://doi.org/10.1016/j.ipm.2018.12.004.

    Article  Google Scholar 

  • Yin, D., Meng, T., & Chang, K.-W. (2020). Sentibert: A transferable transformer-based architecture for compositional sentiment semantics. arXiv:2005.04114.

  • Yue, L., Chen, W., Li, X., & et al. (2019). A survey of sentiment analysis in social media. Knowledge and Information Systems, 60(2), 617–663. https://doi.org/10.1007/s10115-018-1236-4.

    Article  Google Scholar 

Download references

Acknowledgements

This research is supported by the Science and Technology Research Program of the Department of Science and Technology of Henan Province (approval No.: 222102210081)

Author information

Authors and Affiliations

Authors

Contributions

Shiyu Wang: Data curation, Writing- Original draft preparation, Software. Gang Zhou: Conceptualization, Methodology, Validation. Jicang Lu: Writing - Reviewing and Editing, Investigation, Supervision. Jing Chen: Funding acquisition, Investigation, Supervision. Yi Xia: Formal analysis, Visualization.

Corresponding author

Correspondence to Gang Zhou.

Ethics declarations

Ethics approval and consent to participate

Not Applicable.

Consent for publication

Not Applicable.

Competing interests

The authors declare no competing interests.

Conflict of interests

The authors declare that they have no conflicts of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Jicang Lu and Jing Chen contributed equally to this work.

Appendix 1: Training Algorithm

Appendix 1: Training Algorithm

The algorithm for training TescaBERT is described as Algorithm 1.

Algorithm 1
figure a

Training of TescaBERT.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, S., Zhou, G., Lu, J. et al. Topic enhanced sentiment co-attention BERT. J Intell Inf Syst 60, 175–197 (2023). https://doi.org/10.1007/s10844-022-00749-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-022-00749-x

Keywords

Navigation