skip to main content
10.1145/3573428.3573736acmotherconferencesArticle/Chapter ViewAbstractPublication PageseitceConference Proceedingsconference-collections
research-article

Chinese News Headline Classification Model Based on ERNIE and Deep Learning Algorithm

Published:15 March 2023Publication History

ABSTRACT

Although the Word2Vec model can solve the problem of sparse features and high dimensionality in text representation, it cannot handle the problem of multiple meaning words in Chinese vocabulary. Therefore, this paper proposes a text classification model (EBGM) based on a combination of ERNIE, bidirectional gated recurrent unit (BiGRU) and maximum pool processing. First, more contextual semantic representation of Chinese text is performed by ERNIE pre-training model; then, to enhance the relevance of contextual semantics, contextual semantic information is extracted by BiGRU; finally, maximum pooling is performed to obtain important information of the text. The final results on the experimental dataset show that the model has good performance in the Chinese news headline classification task, which proves the feasibility of the model.

References

  1. Minaee S, Kalchbrenner N, Cambria E, Deep Learning Based Text Classification: A Comprehensive Review [J]. 2020.Google ScholarGoogle Scholar
  2. Kim Y. Convolutional Neural Networks for Sentence Classification [J]. Eprint Arxiv, 2014: 35-44.Google ScholarGoogle Scholar
  3. LU Ling, YANG Wu, WANG Yuanlun, LEI Zijian, LI Ying. Long text classification combined with attention mechanism [J]. Computer applications, 2018, 38 (05): 1272-1277.Google ScholarGoogle Scholar
  4. Liu Z, You F. TextGCN based on data enhancement [C]// EITCE 2020: 2020 4th International Conference on Electronic Information Technology and Computer Engineering. 2020.Google ScholarGoogle Scholar
  5. Hochreiter S, Schmidhuber J. Long Short-Term Memory [J]. Neural Computation, 1997, 9(8): 1735-1780.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. CHEN Kejia, LIU Hui.Chinese text classification method based on improved BiGRU-CNN [J]. Computer engineering, 2022, 48 (05): 59-66+73.Google ScholarGoogle Scholar
  7. DUAN Dandan, TANG Jiashan, WEN Yong, Chinese short text classification algorithm based on BERT model [J]. Computer Engineering, 2021, 47(01): 79-86.Google ScholarGoogle Scholar
  8. Li Tianhao, Huo Qilun, Yan Yue, Xu Yuanchao. A Chinese relation extraction model integrating ERNIE and attention mechanism [J]. Small Microcomputer System, 2022, 43 (06): 1226-1231. DOI: 10.20009/j.cnki.21-1106/TP.2021-0918Google ScholarGoogle Scholar
  9. Devlin J, Chang M W, Lee K, BERT:Pre-training of Deep Bidirectional Transformers for Language Understanding [C]//Proceedings of NAACL-HIL. 2019: 4171-4186.Google ScholarGoogle Scholar
  10. Vaswani A, Shazeer N, Parmar N, Attention is All You Need [C]// Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017: 5998-6008.Google ScholarGoogle Scholar
  11. Sun Y, Wang S, Li Y, ERNIE: Enhanced Representation through Knowledge Integration [J]. 2019.Google ScholarGoogle Scholar
  12. Cho K, Merrienboer B V, Gulcehre C, Learning phrase representations using RNN Encoder-Decoder for statistical machine translation [J]. Association for Computational Linguistics, 2014, 14(1): 1724-1734.Google ScholarGoogle Scholar
  13. LAI Siwei, XU Liheng, LIU Kang, Recurrent convolutional neural networks for text classification [C]// Proceedings of the 29th AAAI Conference on Artificial Intelligence. [S.L.]: AAAI, 2015: 2267-2273.Google ScholarGoogle Scholar
  14. Collobert R, Weston J, Bottou L, Natural Language Processing (almost) from Scratch [J]. Journal of Machine Learning Research, 2011, 12(1): 2493-2537.Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering
    October 2022
    1999 pages
    ISBN:9781450397148
    DOI:10.1145/3573428

    Copyright © 2022 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 15 March 2023

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate508of972submissions,52%
  • Article Metrics

    • Downloads (Last 12 months)20
    • Downloads (Last 6 weeks)1

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format