skip to main content
10.1145/3640771.3643045acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiscaiConference Proceedingsconference-collections
research-article

Research on Named Entity Recognition in the Steel Industry Based on MacBERT

Authors Info & Claims
Published:29 March 2024Publication History

ABSTRACT

Abstract. In response to the lack of larger-scale and high-quality NER datasets and research on NER in the steel industry, this paper constructs a NER dataset for the steel industry that includes 4835 pieces of data, and annotates four entity categories: device, material, process, and product, A NER research method based on MacBERT_large-BiLSTM-CRF model was built. This method first utilizes the MacBERT model to generate semantically rich dynamic word vectors, then inputs the word vectors into the BiLSTM network model to obtain global features, and finally uses the CRF model to add effective constraints to the test labels to ensure the effectiveness of the generated labels. The model was compared with three other models, and the experimental results showed that the precision of the model was 90.01%, the recall rate was 91.02%, and the F1 value was 90.51%. The recognition performance of the model was superior to the other three models.

References

  1. Bikel D M, Miller S, Schwartz R, (1998). Nymble: a high-performance learning name-finder. arXiv preprint cmp-lg/9803003.Google ScholarGoogle Scholar
  2. Borthwick A E(1999). A maximum entropy approach to named entity recognition. New York University.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Asahara M, Matsumoto Y(2003). Japanese named entity extraction with redundant morphological analysis. Proceedings of the 2003 human language technology conference of the North American chapter of the association for computational linguistics. 8-15.Google ScholarGoogle Scholar
  4. McCallum A, Li W(2003). Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons.Google ScholarGoogle Scholar
  5. Zeng D, Sun C, Lin L, (2017). LSTM-CRF for drug-named entity recognition. Entropy. 19(6): 283.Google ScholarGoogle ScholarCross RefCross Ref
  6. Huang Z, Xu W, Yu K(2015). Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991.Google ScholarGoogle Scholar
  7. Devlin J, Chang M W, Lee K, (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.Google ScholarGoogle Scholar
  8. Zhai, C., & Wang, C. (2019). Named entity recognition in steel field based on BiLSTM-CRF model. Journal of Physics: Conference Series, 1314.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ISCAI '23: Proceedings of the 2023 2nd International Symposium on Computing and Artificial Intelligence
    October 2023
    120 pages
    ISBN:9798400708954
    DOI:10.1145/3640771

    Copyright © 2023 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 29 March 2024

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited
  • Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)3

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format