Few-shot relation classification based on the BERT model, hybrid attention and fusion networks

Li, Yibing; Ding, Zenghui; Ma, Zuchang; Wu, Yichen; Wang, Yu; Zhang, Ruiqi; Xie, Fei; Ren, Xiaoye

doi:10.1007/s10489-023-04634-0

Few-shot relation classification based on the BERT model, hybrid attention and fusion networks

Published: 03 June 2023

Volume 53, pages 21448–21464, (2023)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Yibing Li ORCID: orcid.org/0000-0002-7085-1789^1,2,3,
Zenghui Ding¹,
Zuchang Ma¹,
Yichen Wu^1,2,
Yu Wang^1,2,
Ruiqi Zhang^1,2,
Fei Xie³ &
…
Xiaoye Ren³

393 Accesses
1 Citation
Explore all metrics

Abstract

Relation classification (RC) is an essential task in information extraction. The distance supervision (DS) method can use many unlabeled data and solve the lack of training data on the RC task. However, the DS method has the problems of long tails and noise. Intuitively, people can solve these problems using few-shot learning (FSL). Our work aims to improve the accuracy and rapidity of convergence on the few-shot RC task. We believe that entity pairs have an essential role in the few-shot RC task. We propose a new context encoder, which is improved based on the bidirectional encoder representations from transformers (BERT) model to fuse entity pairs and their dependence information in instances. At the same time, we design hybrid attention, which includes support instance-level and query instance-level attention. The support instance level dynamically assigns the weight of each instance in the support set. It makes up for the insufficiency of prototypical networks, which distribute weights to sentences equally. Query instance-level attention is dynamically assigned weights to query instances by similarity with the prototype. The ablation study shows the effectiveness of our proposed method. In addition, a fusion network is designed to replace the Euclidean distance method of previous works when class matching is performed, improving the convergence’s rapidity. This makes our model more suitable for industrial applications. The experimental results show that the proposed model’s accuracy is better than that of several other models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Adaptive Prototype Network with Common and Discriminative Representation Learning for Few-Shot Relation Extraction

Knowledge-Enhanced Prototypical Network with Structural Semantics for Few-Shot Relation Classification

Prototypical attention network for few-shot relation classification with entity-aware embedding module

Article 27 August 2022

Data availability

Data openly available in a public repository.

Notes

https://github.com/thunlp/FewRel

References

Wang, L., Cao, Z., De Melo, G., Liu, Z.: Relation classification via multi-level attention cnns. In: 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7, 2016 - August 12, 2016. 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers, vol. 3, pp. 1298–1307. Association for Computational Linguistics (ACL). 10.18653/v1/p16-1123
Chen T, Wang N, Wang H, Zhan H (2021) Distant supervision for relation extraction with sentence selection and interaction representation. Wireless Communications and Mobile Computing 2021:1–16. https://doi.org/10.1155/2021/8889075
Article Google Scholar
Feng RW, Zheng XS, Gao TX, Chen JT, Wang WZ, Chen DZ, Wu J (2021) Interactive few-shot learning: Limited supervision, better medical image segmentation. IEEE Transactions on Medical Imaging 40(10):2575–2588. https://doi.org/10.1109/tmi.2021.3060551
Article Google Scholar
Ye HJ, Hu HX, Zhan DC (2021) Learning adaptive classifiers synthesis for generalized few-shot learning. International Journal of Computer Vision 129(6):1930–1953. https://doi.org/10.1007/s11263-020-01381-4
Article MATH Google Scholar
Gao, T., Han, X., Zhu, H., Liu, Z., Li, P., Sun, M., Zhou, J.: Fewrel 2.0: Towards more challenging few-shot relation classification. In: 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019, November 3, 2019 - November 7, 2019. EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference, pp. 6250–6255. Association for Computational Linguistics. 10.18653/v1/D19-1649
Han, X., Zhu, H., Yu, P., Wang, Z., Yao, Y., Liu, Z., Sun, M.: Fewrel: A large-scale supervised few-shot relation classification dataset with state-of-the-art evaluation. In: 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018, October 31, 2018 - November 4, 2018. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018, pp. 4803–4809. Association for Computational Linguistics. 10.18653/v1/D18-1514
Chen YS, Chiang SW, Wu ML (2022) A few-shot transfer learning approach using text-label embedding with legal attributes for law article prediction. Applied Intelligence 52(3):2884–2902. https://doi.org/10.1007/s10489-021-02516-x
Article Google Scholar
Gao, T., Han, X., Liu, Z., Sun, M.: Hybrid attention-based prototypical networks for noisy few-shot relation classification. In: 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Annual Conference on Innovative Applications of Artificial Intelligence, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, January 27, 2019 - February 1, 2019. 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, pp. 6407–6414. AAAI Press. 10.1609/aaai.v33i01.33016407
Xie Y, Wang H, Yu B, Zhang C (2020) Secure collaborative few-shot learning. Knowledge-Based Systems 203:10. https://doi.org/10.1016/j.knosys.2020.106157
Article Google Scholar
Xu H, Wang JX, Li H, Ouyang DQ, Shao J (2021) Unsupervised meta-learning for few-shot learning. Pattern Recognition 116:10. https://doi.org/10.1016/j.patcog.2021.107951
Article Google Scholar
Li DW, Tian YJ (2018) Survey and experimental study on metric learning methods. Neural Networks 105:447–462. https://doi.org/10.1016/j.neunet.2018.06.003
Article Google Scholar
Snell, J., Swersky, K., Zemel, R.: Prototypical networks for few-shot learning. In: 31st Annual Conference on Neural Information Processing Systems, NIPS 2017, December 4, 2017 - December 9, 2017. Advances in Neural Information Processing Systems, vol. 2017-December, pp. 4078–4088. Neural information processing systems foundation
Li LQ, Wang JB, Li JC, Ma QL, Wei J (2019) Relation classification via keyword-attentive sentence mechanism and synthetic stimulation loss. IEEE-ACM Transactions on Audio Speech and Language Processing 27(9):1392–1404. https://doi.org/10.1109/taslp.2019.2921726
Article Google Scholar
Sun HY, Grishman R (2022) Lexicalized dependency paths based supervised learning for relation extraction. Computer Systems Science and Engineering 43(3):861–870. https://doi.org/10.32604/csse.2022.030759
Article Google Scholar
Shi Y, Xiao Y, Quan P, Lei ML, Niu LF (2021) Distant supervision relation extraction via adaptive dependency-path and additional knowledge graph supervision. Neural Networks 134:42–53. https://doi.org/10.1016/j.neunet.2020.10.012
Article Google Scholar
Liu Y, Li SJ, Wei FR, Ji H (2016) Relation classification via modeling augmented dependency paths. IEEE-ACM Transactions on Audio Speech and Language Processing 24(9):1589–1598. https://doi.org/10.1109/taslp.2016.2573050
Article Google Scholar
Ma, Y.H., Zhu, J., Liu, J.: Enhanced semantic representation learning for implicit discourse relation classification. Applied Intelligence (2021). 10.1007/s10489-021-02785-6
Runyan Z, Fanrong M, Yong Z, Bing L (2018) Relation classification via recurrent neural network with attention and tensor layers. Big Data Mining and Analytics 1(3):234–244. https://doi.org/10.26599/bdma.2018.9020022
Article Google Scholar
Mintz, M., Bills, S., Snow, R., Jurafsky, D.: Distant supervision for relation extraction without labeled data. In: ACL 2009, Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the AFNLP, 2-7 August 2009, Singapore. 10.3115/1690219.1690287
Zhang, N., Deng, S., Sun, Z., Wang, G., Chen, X., Zhang, W., Chen, H.: Long-tail relation extraction via knowledge graph embeddings and graph convolution networks. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 3016–3025. Association for Computational Linguistics. 10.18653/v1/N19-1306
Khan MS, Lohani QMD (2022) Topological analysis of intuitionistic fuzzy distance measures with applications in classification and clustering. Engineering Applications of Artificial Intelligence 116. https://doi.org/10.1016/j.engappai.2022.105415
Hallajian B, Motameni H, Akbari E (2022) Ensemble feature selection using distance-based supervised and unsupervised methods in binary classification. Expert Systems with Applications 200:18. https://doi.org/10.1016/j.eswa.2022.116794
Article Google Scholar
Jiang W, Huang K, Geng J, Deng XY (2021) Multi-scale metric learning for few-shot learning. IEEE Transactions on Circuits and Systems for Video Technology 31(3):1091–1102. https://doi.org/10.1109/tcsvt.2020.2995754
Article Google Scholar
Lake BM, Salakhutdinov R, Tenenbaum JB (2015) Human-level concept learning through probabilistic program induction. Science 350(6266):1332–1338. https://doi.org/10.1126/science.aab3050
Article MathSciNet MATH Google Scholar
Li, W.B., Xu, J.L., Huo, J., Wang, L., Gao, Y., Luo, J.B., Aaai: Distribution consistency based covariance metric networks for few-shot learning. In: 33rd AAAI Conference on Artificial Intelligence / 31st Innovative Applications of Artificial Intelligence Conference / 9th AAAI Symposium on Educational Advances in Artificial Intelligence, pp. 8642–8649. Assoc Advancement Artificial Intelligence, PALO ALTO (2019). 10.1609/aaai.v33i01.33018642
Xie, Y.X., Xu, H., Yang, C.C., Gao, K., Assoc Advancement Artificial, I.: Multi-channel convolutional neural networks with adversarial training for few-shot relation classification. In: 34th AAAI Conference on Artificial Intelligence / 32nd Innovative Applications of Artificial Intelligence Conference / 10th AAAI Symposium on Educational Advances in Artificial Intelligence. AAAI Conference on Artificial Intelligence, vol. 34, pp. 13967–13968. Assoc Advancement Artificial Intelligence, PALO ALTO (2020). 10.1609/aaai.v34i10.7256
Xie Y, Xu H, Li J, Yang C, Gao K (2020) Heterogeneous graph neural networks for noisy few-shot relation classification. Knowledge-Based Systems 194. https://doi.org/10.1016/j.knosys.2020.105548
Ye, Z.X., Ling, Z.H.: Multi-level matching and aggregation network for few-shot relation classification. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 10.18653/v1/P19-1277
Gao, T.Y., Han, X., Xie, R.B., Liu, Z.Y., Lin, F., Lin, L.Y., Sun, M.S., Assoc Advancement Artificial, I.: Neural snowball for few-shot relation learning. In: 34th AAAI Conference on Artificial Intelligence / 32nd Innovative Applications of Artificial Intelligence Conference / 10th AAAI Symposium on Educational Advances in Artificial Intelligence. AAAI Conference on Artificial Intelligence, vol. 34, pp. 7772–7779. Assoc Advancement Artificial Intelligence, PALO ALTO (2020). 10.1609/aaai.v34i05.6281
Pang N, Tan Z, Xu H, Xiao WD (2020) Boosting knowledge base automatically via few-shot relation classification. Frontiers in Neurorobotics 14. https://doi.org/10.3389/fnbot.2020.584192
Xiao, Y., Jin, Y., Hao, K.: Adaptive prototypical networks with label words and joint representation learning for few-shot relation classification. IEEE transactions on neural networks and learning systems PP (2021). 10.1109/tnnls.2021.3105377
Wu JY, Zhao ZB, Sun C, Yan RQ, Chen XF (2020) Few-shot transfer learning for intelligent fault diagnosis of machine. Measurement 166. https://doi.org/10.1016/j.measurement.2020.108202
Hou, Y.T., Lai, Y.K., Wu, Y.S., Che, W.X., Liu, T., Assoc Advancement Artificial, I.: Few-shot learning for multi-label intent detection. In: 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence. AAAI Conference on Artificial Intelligence, vol. 35, pp. 13036–13044. Assoc Advancement Artificial Intelligence, PALO ALTO (2021)
Bao, Y., Wu, M., Chang, S., Barzilay, R.: Few-shot text classification with distributional signatures. In: International Conference on Learning Representations (2020). https://doi.org/10.1007/978-981-33-4859-2_14
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention is all you need. In: 31st Annual Conference on Neural Information Processing Systems, NIPS 2017, December 4, 2017 - December 9, 2017. Advances in Neural Information Processing Systems, vol. 2017-December, pp. 5999–6009. Neural information processing systems foundation
Chen XF, Wang GH, Ren HP, Cai Y, Leung HF, Wang T (2022) Task-adaptive feature fusion for generalized few-shot relation classification in an open world environment. IEEE-ACM Transactions on Audio Speech and Language Processing 30:1003–1015. https://doi.org/10.1109/taslp.2022.3153254
Wang, W., Yan, M., Wu, C.: Multi-granularity hierarchical attention fusion networks for reading comprehension and question answering. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1705–1714. Association for Computational Linguistics. 10.18653/v1/P18-1158

Download references

Acknowledgements

This paper was supported by the Exercise and Health Laboratory of the Institute of Intelligent Machinery, Chinese Academy of Sciences. Thanks to Tsinghua University for developing the FewRel dataset. We would like to thank the anonymous reviewers for their helpful comments.

Funding

This research was supported by the National Key Research and Development Program of China (2022YFC2010200, 2020YFC2005603), the National Natural Science Foundation of China (NSFC) (grant numbers 61701482), the Key projects of the National Natural Science Foundation of universities in Anhui Province (grant number KJ2020A0112),the Major Special Projects of Anhui Province(grant number 202103a07020004), the Natural Science Foundation of Anhui Province, China (grant number 1808085MF191),the Education Research Project of Anhui Province, China(2020jyxm1573) and the High-level Talents Research Start-up Fund of Hefei Normal University (grant number 2020rcjj45). In addition, we would like to thank the anonymous reviewers who have helped to improve the paper.

Author information

Authors and Affiliations

Institute of Intelligent Machines, Institutes of Physical Science, Chinese Academy of Sciences, Hefei, 230031, China
Yibing Li, Zenghui Ding, Zuchang Ma, Yichen Wu, Yu Wang & Ruiqi Zhang
Science Island Branch of Graduate School, University of Science and Technology of China, Hefei, 230026, China
Yibing Li, Yichen Wu, Yu Wang & Ruiqi Zhang
School of Computer Science and Technology, Hefei Normal University, Hefei, 230601, China
Yibing Li, Fei Xie & Xiaoye Ren

Authors

Yibing Li
View author publications
You can also search for this author in PubMed Google Scholar
Zenghui Ding
View author publications
You can also search for this author in PubMed Google Scholar
Zuchang Ma
View author publications
You can also search for this author in PubMed Google Scholar
Yichen Wu
View author publications
You can also search for this author in PubMed Google Scholar
Yu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ruiqi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Fei Xie
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoye Ren
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Zenghui Ding or Zuchang Ma.

Ethics declarations

Conflict of interest

We declare that we have no financial and personal relationships with other people or organizations that can inappropriately influence our work. There is no professional or other personal interest of any nature or kind in any product, service and/or company that could be construed as influencing the position presented in, or the review of the manuscript entitled.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Li, Y., Ding, Z., Ma, Z. et al. Few-shot relation classification based on the BERT model, hybrid attention and fusion networks. Appl Intell 53, 21448–21464 (2023). https://doi.org/10.1007/s10489-023-04634-0

Download citation

Accepted: 11 April 2023
Published: 03 June 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s10489-023-04634-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Few-shot relation classification based on the BERT model, hybrid attention and fusion networks

Abstract

Access this article

Similar content being viewed by others

Adaptive Prototype Network with Common and Discriminative Representation Learning for Few-Shot Relation Extraction

Knowledge-Enhanced Prototypical Network with Structural Semantics for Few-Shot Relation Classification

Prototypical attention network for few-shot relation classification with entity-aware embedding module

Data availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Few-shot relation classification based on the BERT model, hybrid attention and fusion networks

Abstract

Access this article

Similar content being viewed by others

Adaptive Prototype Network with Common and Discriminative Representation Learning for Few-Shot Relation Extraction

Knowledge-Enhanced Prototypical Network with Structural Semantics for Few-Shot Relation Classification

Prototypical attention network for few-shot relation classification with entity-aware embedding module

Data availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation