Few-Shot NER in Marine Ecology Using Deep Learning

Wang, Jian; Liu, Ming; Zhao, Danfeng; Shi, Shuai; Song, Wei

doi:10.1007/978-981-99-8145-8_2

Jian Wang¹⁰,
Ming Liu¹⁰,
Danfeng Zhao¹⁰,
Shuai Shi¹¹ &
…
Wei Song¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1965))

Included in the following conference series:

International Conference on Neural Information Processing

835 Accesses

Abstract

In the field of marine ecological named entity recognition (NER), challenges arise due to limited domain-specific text, weak semantic representations of input vectors and the neglect of local features. To address these challenges of NER in a low-resource environment, a deep learning-based few-shot NER model was proposed. Firstly, Sequence Generative Adversarial Nets (SeqGAN) was utilized to train on the original text and generated new text, thereby expanding the original corpus. Subsequently, BERT-IDCNN-BiLSTM-CRF was introduced for extracting marine ecological entities. BERT (Bidirectional Encoder Representation from Transformers) was pre-trained on the expanded corpus. The embeddings produced by BERT were then fed into Iterative Dilation Convolutional Networks (IDCNN) and Bidirectional Long Short-Term Memory Networks (BiLSTM) to facilitate feature extraction. Finally, Conditional Random Fields (CRF) was employed to enforce label sequence constraints and yielded the final results. For the proposed few-shot NER method based on deep learning, comparative experiments were conducted horizontally and vertically against BiLSTM-CRF, IDCNN-CRF, BERT-IDCNN-CRF and BERT-BiLSTM-CRF models on both the original and expanded corpora. The results show that BERT-IDCNN-BiLSTM-CRF outperforms BERT-BiLSTM-CRF by 2.48 percentage points in F1-score on the original corpus. On the expanded corpus, BERT-IDCNN-BiLSTM-CRF achieves a F1-score 2.65 percentage points higher than that on the original corpus. This approach effectively enhances entity extraction in the domain of marine ecology, laying a foundation for downstream tasks such as constructing marine ecological knowledge graphs and ecological governance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Extracting Named Entity Using Entity Labeling in Geological Text Using Deep Learning Approach

Article 18 October 2023

Information extraction from green channel textual records on expressways using hybrid deep learning

Article Open access 28 December 2024

Few-shot named entity recognition framework for forestry science metadata extraction

Article 01 February 2024

References

Li, J., Sun, A., Han, J.: A survey on deep learning for named entity recognition. IEEE Trans. Knowl. Data Eng. 34(1), 50–70 (2022)
Article Google Scholar
Liu, B.: Text sentiment analysis based on CBOW model and deep learning in big data environment. J. Ambient. Intell. Humaniz. Comput. 11(2), 451–458 (2018). https://doi.org/10.1007/s12652-018-1095-6
Article Google Scholar
Akbik, A., Blythe, D., Vollgraf, R.: Contextual string embeddings for sequence labeling. In: International Conference on Computational Linguistics (2018)
Google Scholar
Zhang, X., Guo, R., Huang, D.: Named entity recognition based on dependency. J. Chin. Inform. Process. 35(6), 63–73 (2021)
Google Scholar
Li, L., Guo, Y.: Biomedical named entity recognition with CNN-BLSTM-CRF. J. Chin. Inform. Process. 32(1), 116–122 (2018)
MathSciNet Google Scholar
Chen, K., Yan, Z., Huo, Q.: A context-sensitive-chunk BPTT approach to training deep LSTM/BLSTM recurrent neural networks for offline handwriting recognition. In: ICDAR (2015)
Google Scholar
Cui L., Zhang Y.: Hierarchically-refined label attention network for sequence labeling. In: EMNLP-IJCNLP (2019)
Google Scholar
Lafferty, J.D., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: International Conference on Machine Learning, pp. 282–289 (2001)
Google Scholar
Andrew, N.: Machine Learning Yearning. Self-publishing (2018)
Google Scholar
Yang, H., Yu, H., Liu, J.: Fishery standard named entity recognition based on BERT+BiLSTM+CRF deep learning model and multivariate combination data augmentation. J. Dalian Ocean Univ. 36(4), 661–669 (2021)
Google Scholar
Chen, X., Xu, L., Liu, Z., Sun, M., Luan, H.: Joint learning of character and word embeddings. In: International Joint Conference on Artificial Intelligence (2015)
Google Scholar
Yu, L., Zhang, W., Wang, J., Yu, Y.: SeqGAN: sequence generative adversarial nets with policy gradient. In AAAI (2017)
Google Scholar
Bachman, P., Precup, D.: Data generation as sequential decision making. In: NIPS (2015)
Google Scholar
Devlin, J., Chang, M., Lee, K., Toutanova, K: Bert: pre-training of deep bidirectional transformers for language understanding. In: NAACL-HLT, pp. 4171–4186 (2019)
Google Scholar
Strubell, E., Verga, P., Belanger, D., McCallum, A.: Fast and accurate entity recognition with iterated dilated convolutions. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (2017)
Google Scholar
Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions. In: ICLR (2016)
Google Scholar
Bengio, Y., Simard, P., Frasconi, P.: Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Networks 5(2), 157–166 (1994)
Article Google Scholar
Sun, J., Yu, H., Feng, Y.: Recognition of nominated fishery domain entity based on deep learning architectures. J. Dalian Ocean Univ. 32(2), 265–269 (2018)
Google Scholar
Feng, H., Sun, Y., Wu, T.: Chinese electronic medical record named entity recognition based on multi-features and IDCNN. J. Changzhou Univ. (Natl. Sci. Edn.) 35(1), 59–67 (2023)
Google Scholar
Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF Models for Sequence Tagging (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Information Technology, Shanghai Ocean University, Shanghai, China
Jian Wang, Ming Liu, Danfeng Zhao & Wei Song
College of Electrical Engineering, Shanghai University of Electric Power, Shanghai, China
Shuai Shi

Authors

Jian Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ming Liu
View author publications
You can also search for this author in PubMed Google Scholar
Danfeng Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Shuai Shi
View author publications
You can also search for this author in PubMed Google Scholar
Wei Song
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Song .

Editor information

Editors and Affiliations

School of Automation, Central South University, Changsha, China
Biao Luo
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Long Cheng
Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou, China
Zheng-Guang Wu
School of Automation, Guangdong University of Technology, Guangzhou, China
Hongyi Li
School of Electrical Engineering and Telecommunications, UNSW Sydney, Sydney, NSW, Australia
Chaojie Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, J., Liu, M., Zhao, D., Shi, S., Song, W. (2024). Few-Shot NER in Marine Ecology Using Deep Learning. In: Luo, B., Cheng, L., Wu, ZG., Li, H., Li, C. (eds) Neural Information Processing. ICONIP 2023. Communications in Computer and Information Science, vol 1965. Springer, Singapore. https://doi.org/10.1007/978-981-99-8145-8_2

Download citation

DOI: https://doi.org/10.1007/978-981-99-8145-8_2
Published: 27 November 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8144-1
Online ISBN: 978-981-99-8145-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics