Few-shot learning for name entity recognition in geological text based on GeoBERT

Liu, Hao; Qiu, Qinjun; Wu, Liang; Li, Wenjia; Wang, Bin; Zhou, Yuan

doi:10.1007/s12145-022-00775-x

Few-shot learning for name entity recognition in geological text based on GeoBERT

Research Article
Published: 11 March 2022

Volume 15, pages 979–991, (2022)
Cite this article

Earth Science Informatics Aims and scope Submit manuscript

Hao Liu¹,
Qinjun Qiu^1,2,3,
Liang Wu ORCID: orcid.org/0000-0002-1304-6353^1,2,
Wenjia Li¹,
Bin Wang¹ &
…
Yuan Zhou²

1420 Accesses
15 Citations
Explore all metrics

Abstract

Geological reports are records of the geological elements and survey contents found in geological exploration, but it is difficult to extract useful concepts from such reports. In the process of information extraction, accurately identification of entities in unstructured geotext is a foundational task that is known as geological named entity recognition (Geo-NER). However, the existing methods generally require a large number of annotated corpora, and face problems with long entity recognition. Therefore, this paper proposes a two-stage fine-tuning method. In the first fine-tuning stage, we use a bidirectional encoder representations from transformers language model with geological domain knowledge (GeoBERT), which combines geological domain knowledge, on a pretrained BERT model, and in the second stage, we use a small number of samples to complete the NER task in the geological report based on GeoBERT. Our proposed model achieves a very high F1-score compared to baseline models on the constructed dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Less is More: A Prototypical Framework for Efficient Few-Shot Named Entity Recognition

KSRE-CNER: A Knowledge and Semantic Relation Enhancement Framework for Chinese NER

Context Enhanced and Data Augmented $$\mathrm W^{2}NER$$ System for Named Entity Recognition

References

Akkasi A, Varoglu E (2017) Improving biochemical named entity recognition using PSO classifier selection and Bayesian combination methods. IEEE/ACM Transactions on Computational Biology and Bioinformatics 14(6):1327–1338. https://doi.org/10.1109/TCBB.2016.2570216
Article Google Scholar
Atkinson J, Bull V (2012) A multi-strategy approach to biological named entity recognition. Expert Syst Appl 39(17):12968–12974. https://doi.org/10.1016/j.eswa.2012.05.033
Article Google Scholar
Bao Y, Wu M, Chang S, Barzilay R (2019) Few-shot text classification with distributional signatures. ArXiv. http://arxiv.org/abs/1908.06039
Chu D, Wan B, Li H, Fang F, Wang R (2020) Geological entity recognition based on ELMO-CNN-BiLSTM-CRF model. Earth Sci:1–22
Dai Z, Wang X, Ni P, Li Y, Li G, Bai X (2019) Named entity recognition using BERT BiLSTM CRF for Chinese electronic health records. In: 2019 12th international congress on image and signal processing, BioMedical engineering and informatics (CISP-BMEI), pp 1–5. https://doi.org/10.1109/CISP-BMEI48845.2019.8965823
Chapter Google Scholar
Devlin J, Chang MW, Lee K, Toutanova K (2018) BERT: pre-training of deep bidirectional transformers for language understanding. ArXiv. https://arxiv.org/abs/1810.04805v2
Fan R, Wang L, Yan J, Song W, Zhu Y, Chen X (2019) Deep learning-based named entity recognition and knowledge graph construction for geological hazards. ISPRS Int J Geo Inf 9(1):15. https://doi.org/10.3390/ijgi9010015
Article Google Scholar
Fu R, Qin B, Liu T (2014) Generating Chinese named entity data from parallel corpora. Front Comput Sci 8(4):629–641. https://doi.org/10.1007/s11704-014-3127-5
Article Google Scholar
Hofer M, Kormilitzin A, Goldberg P, Nevado-Holgado A (2018) Few-shot learning for named entity recognition in medical text. ArXiv. http://arxiv.org/abs/1811.05468
Howard J, Ruder S (2018) Universal language model fine-tuning for text classification. ArXiv. http://arxiv.org/abs/1801.06146
Huang C, Li Y, Zhu X (2006) Tokenization Guidelines of 67
Huang Z, Xu W, Yu K (2015) Bidirectional LSTM-CRF models for sequence tagging. ArXiv:1508.01991 [Cs]. http://arxiv.org/abs/1508.01991
Ji B, Liu R, Xu WS, Li SS, Tang JT, Yu J, Li Q (2018) A BILSTM-CRF method to Chinese electronic medical record named entity recognition. In: ACM international conference proceeding series, pp 1–6. https://doi.org/10.1145/3302425.3302465
Chapter Google Scholar
Ju Z, Wang J, Zhu F (2011) Named entity recognition from biomedical text using SVM. In: 5th international conference on bioinformatics and biomedical engineering, ICBBE 2011, pp 1–4. https://doi.org/10.1109/icbbe.2011.5779984
Chapter Google Scholar
Liu W, Yu B, Zhang C, Wang H, Pan K (2018) Chinese named entity recognition based on rules and conditional random field. In: ACM international conference proceeding series, pp 268–272. https://doi.org/10.1145/3297156.3297196
Chapter Google Scholar
Liu H, Jun G, Zheng Y (2021) Chinese named entity recognition model based on BERT. MATEC Web of Conferences 336:06021. https://doi.org/10.1051/MATECCONF/202133606021
Article Google Scholar
Luo L, Yang Z, Yang P, Zhang Y, Wang L, Lin H, Wang J (2018) An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition. Bioinformatics 34(8):1381–1388. https://doi.org/10.1093/bioinformatics/btx761
Article Google Scholar
Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. In: 1st international conference on learning representations, ICLR 2013 - workshop track proceedings http://arxiv.org/abs/1301.3781
Google Scholar
Naacl A (2019) Few-shot text classification with induction network. Naacl:1–10
Nadeau D, Sekine S (2007) A survey of named entity recognition and classification. International Journal of Linguistics and Language Resources Lingvisticæ Investigationes 30(1):3–26. https://doi.org/10.1075/li.30.1.03nad
Article Google Scholar
Pauls A, Klein D (2011) Faster and smaller n-gram language models. In: ACL-HLT 2011 - proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies (Vol. 1) http://code.google.com/p/berkeleylm/
Google Scholar
Qiu Q, Xie Z, Wu L, Tao L (2019a) GNER: a generative model for geological named entity recognition without labeled data using deep learning. Earth and Space Science 6(6):931–946. https://doi.org/10.1029/2019EA000610
Article Google Scholar
Qiu Q, Xie Z, Wu L, Tao L, Li W (2019b) BiLSTM-CRF for geological named entity recognition from the geoscience literature. Earth Sci Inf 12(4):565–579. https://doi.org/10.1007/s12145-019-00390-3
Article Google Scholar
Ratinov L, Roth D (2009) Design challenges and misconceptions in named entity recognition. In: CoNLL 2009 - proceedings of the thirteenth conference on computational natural language learning, pp 147–155. https://doi.org/10.3115/1596374.1596399
Chapter Google Scholar
Santoro A, Bartunov S, Botvinick M, Wierstra D, Lillicrap T (2016) Meta-learning with memory-augmented neural networks. In: International conference on machine learning, pp 1842–1850
Google Scholar
Shen Y, Yun H, Lipton ZC, Kronrod Y, Anandkumar A (2018) Deep active learning for named entity recognition. In: 6th international conference on learning representations, ICLR 2018 - conference track proceedings http://arxiv.org/abs/1707.05928
Google Scholar
Singh A, Thakur N, Sharma A (2016) A review of supervised machine learning algorithms. In: Proceedings of the 10th INDIACom; 2016 3rd international conference on computing for sustainable global development, INDIACom 2016, pp 1310–1315
Google Scholar
Sobhana N, Mitra P, Ghosh SK (2010) Conditional random field based named entity recognition in geological text. Int J Comput Appl 1(3):143–147. https://doi.org/10.5120/72-166
Article Google Scholar
Strubell E, Verga P, Belanger D, McCallum A (2017) Fast and accurate entity recognition with iterated dilated convolutions. ArXiv. http://arxiv.org/abs/1702.02098
Tang S, Zhang N, Zhang J, Wu F, Zhuang Y (2017) NITE: a neural inductive teaching framework for domain-specific NER. In: EMNLP 2017 - conference on empirical methods in natural language processing, proceedings, pp 2652–2657. https://doi.org/10.18653/v1/d17-1280
Chapter Google Scholar
Wang YX, Hebert M (2016) Learning to learn: model regression networks for easy small sample learning. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), 9910 LNCS, pp 616–634. https://doi.org/10.1007/978-3-319-46466-4_37
Chapter Google Scholar
Wang C, Ma X, Chen J, Chen J (2018a) Information extraction and knowledge graph construction from geoscience literature. Comput Geosci 112:112–120. https://doi.org/10.1016/j.cageo.2017.12.007
Article Google Scholar
Wang S, Zhang X, Ye P, Du M (2018b) Deep Belief Networks Based Toponym Recognition for Chinese Text. ISPRS International Journal of Geo-Information 2018 7(6):217. https://doi.org/10.3390/IJGI7060217
Article Google Scholar
Wang Y, Yao Q, Kwok JT, Ni LM (2019) Generalizing from a few examples: a survey on few-shot learning. ACM Comput Surv 53(3). https://doi.org/10.1145/3386252
Wang J, Hu Y, Joseph K (2020a) NeuroTPR: a neuro-net toponym recognition model for extracting locations from social media messages. Trans GIS 24(3):719–735. https://doi.org/10.1111/TGIS.12627
Article Google Scholar
Wang, Yu, Sun Y, Ma Z, Gao L, Xu Y (2020b) An ERNIE-based joint model for Chinese named entity recognition. Appl Sci 10(16):5711. https://doi.org/10.3390/APP10165711
Article Google Scholar
Wu L, Liu L, Li H, Gao Y (2017) A Chinese toponym recognition method based on conditional random field. Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University 42(2):150–156. https://doi.org/10.13203/J.WHUGIS20141009
Article Google Scholar
Xiang X, Shi X, Applications H. Z.-C., & 2005, U. (2005) Chinese named entity recognition system using statistics-based and rules-based method. En.Cnki.Com.Cn
Yang SM, Yoo SY, Jeong OR (2020) DeNERT-KG: named entity and relation extraction model using DQN, knowledge graph, and BERT. Appl Sci 10(18):6429. https://doi.org/10.3390/APP10186429
Article Google Scholar
Zaremba W, Sutskever I, Vinyals O (2014) Recurrent neural network regularization. ArXiv:1409.2329 [Cs]. http://arxiv.org/abs/1409.2329
Zhang X, Ye P, Wang S, Du M (2018) Geological entity recognition method based on deep belief networks. Yanshi Xuebao/Acta Petrologica Sinica 34(2):343–351
Google Scholar
Zhou P, Shi W, Tian J, Qi Z, Li B, Hao H, Xu B (2016) Attention-based bidirectional long short-term memory networks for relation classification. In: 54th annual meeting of the Association for Computational Linguistics, ACL 2016 - short papers, pp 207–212. https://doi.org/10.18653/v1/p16-2034
Chapter Google Scholar
Zhou F, Cao C, Zhong T, Geng J (2021) Learning meta-knowledge for few-shot image emotion recognition. Expert Syst Appl 168:114274. https://doi.org/10.1016/j.eswa.2020.114274
Article Google Scholar

Download references

Acknowledgments

This study was supported by the National Science Foundation of China (Grant No. 41871311, 42050101), Open Research Project of The Hubei Key Laboratory of Intelligent Geo-Information Processing (No. KLIGIP-2021A01), and the Fundamental Research Funds for the Central Universities, China University of Geosciences (Wuhan) (No. CUG2106116)). The authors thank the Development and Research Center of the China Geological Survey for providing technical support. We thank the National Engineering Research Center of Geographic Information System for providing hardware support.

Author information

Authors and Affiliations

School of Geography and Information Engineering, China University of Geosciences, Wuhan, 430074, China
Hao Liu, Qinjun Qiu, Liang Wu, Wenjia Li & Bin Wang
National Engineering Research Center of Geographic Information System, Wuhan, 430074, China
Qinjun Qiu, Liang Wu & Yuan Zhou
Hubei Key Laboratory of Intelligent Geo-Information Processing, China University of Geosciences, Wuhan, 430074, China
Qinjun Qiu

Authors

Hao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Qinjun Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Liang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Wenjia Li
View author publications
You can also search for this author in PubMed Google Scholar
Bin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Liang Wu.

Ethics declarations

Declaration of interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Communicated by: H. Babaie

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

• The study proposes a two-stage fine-tuning method for name entity recognition.

• Considering the problem of small sample datasets and long entity identification.

• Capturing long-distance dependency features within longer geological entities.

• The method has achieved better performances compared to other models.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, H., Qiu, Q., Wu, L. et al. Few-shot learning for name entity recognition in geological text based on GeoBERT. Earth Sci Inform 15, 979–991 (2022). https://doi.org/10.1007/s12145-022-00775-x

Download citation

Received: 11 July 2021
Accepted: 29 January 2022
Published: 11 March 2022
Issue Date: June 2022
DOI: https://doi.org/10.1007/s12145-022-00775-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Few-shot learning for name entity recognition in geological text based on GeoBERT

Abstract

Access this article

Similar content being viewed by others

Less is More: A Prototypical Framework for Efficient Few-Shot Named Entity Recognition

KSRE-CNER: A Knowledge and Semantic Relation Enhancement Framework for Chinese NER

Context Enhanced and Data Augmented $$\mathrm W^{2}NER$$ System for Named Entity Recognition

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Declaration of interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Few-shot learning for name entity recognition in geological text based on GeoBERT

Abstract

Access this article

Similar content being viewed by others

Less is More: A Prototypical Framework for Efficient Few-Shot Named Entity Recognition

KSRE-CNER: A Knowledge and Semantic Relation Enhancement Framework for Chinese NER

Context Enhanced and Data Augmented $$\mathrm W^{2}NER$$ System for Named Entity Recognition

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Declaration of interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation