research-article

Chinese News Classification Based on ERNIE and Attention Fusion Features

Authors:

Siju Zhu,

Chengwan HeAuthors Info & Claims

ICRSA '23: Proceedings of the 2023 6th International Conference on Robot Systems and Applications

Pages 134 - 138

https://doi.org/10.1145/3655532.3655553

Published: 28 June 2024 Publication History

Get Access

Abstract

In order to improve the classification performance of Chinese news and correlate the features extracted by multiple network models at the same time, we propose a deep network model based on the pre-trained model ERNIE and attention fusion features. Use ERNIE as the word embedding layer to obtain dynamic word vectors, use the self-attention mechanism SAT and DPCNN network to obtain the long-distance dependence of the text, and use the bidirectional gating unit BIGRU and the soft attention mechanism AT to obtain contextual timing features. Combine the label CLS processed by ERNIE, the hidden layer state of BIGRU at the last moment, and the above two features into a feature sequence, and use the attention mechanism to assign the weight ratio of each feature. Finally, the weighted and summed feature vectors are classified and output using the fully connected layer. Trained on the public Chinese dataset THUCNews, the experimental results show that this model can effectively improve the accuracy of text classification compared with other comparison models.

References

[1]

Yoon Kim. 2014. Convolutional neural networks for sentence classification. CoRR abs/1408.5882. arXiv preprint arXiv:1408.5882.

Google Scholar

[2]

Chunting Zhou, Chonglin Sun, Zhiyuan Liu, and Francis C.M. Lau. 2015. A C-LSTM neural network for text classification. arXiv preprint arXiv:1511.08630.

Google Scholar

[3]

Kyunghyun Cho, Bart van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078.

Google Scholar

[4]

Wiwei Lai, Liheng Xu, Kang Liu, and Jun Zhao. 2015. "Recurrent convolutional neural networks for text classification." Proceedings of the AAAI conference on artificial intelligence. Vol. 29. No. 1. https://doi.org/10aaai.v29i1.9513.

Google Scholar

[5]

Rie Johnson, and Tong Zhang. 2017. "Deep pyramid convolutional neural networks for text categorization." Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

Google Scholar

[6]

Yao, Liang, Chengsheng Mao, and Yuan Luo. 2019. "Graph convolutional networks for text classification." Proceedings of the AAAI conference on artificial intelligence. Vol. 33. No. 01.

Google Scholar

[7]

Ke Zhao, Lan Huang, Rui Song, Qiang Shen, and Hao Xu. 2021. A sequential graph neural network for short text classification. Algorithms,14(12):352. https://doi.org/10.3390/a14120352.

Crossref

Google Scholar

[8]

Matthew E. Peters, Mark Neumann, Mohit lyyer, Matt Gardner, Christopher Clark, Kenton Lee, and Luke Zettlemoyer. 2018. Deep contextualized word representations. arXiv 2018. arXiv preprint arXiv:1802.05365, 12.

Google Scholar

[9]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in neural information processing systems, 30.

Google Scholar

[10]

Alec Radford, Karthik Narasimhan, Tim Salimans, and Ilya Sutskever. 2018. Improving language understanding with unsupervised learning. Technical report, OpenAI.

Google Scholar

[11]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.

Google Scholar

[12]

Yu Sun, Shouhuang Wang, Yukun Li, Shikun Feng, Xuyi Chen, Han Zhang, Xin Tian, Dangxiang Zhu, Hao Tian, and Hua Wu. 2019. Ernie: Enhanced representation through knowledge integration. arXiv preprint arXiv:1904.09223.

Google Scholar

[13]

Meng Zhang, and Xiang Shang. 2022. Chinese Short Text Classification by ERNIE Based on LTC_Block. Wireless Communications and Mobile Computing, 2022.

Digital Library

Google Scholar

Index Terms

Chinese News Classification Based on ERNIE and Attention Fusion Features
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks
2. Software and its engineering
  1. Software creation and management
    1. Designing software
      1. Software design engineering

Recommendations

A Multi-feature Fusion Method with Attention Mechanism for Long Text Classification
ICCDA '22: Proceedings of the 2022 6th International Conference on Compute and Data Analysis

As for the situation that the text content is long and contains much information irrelevant to the subject, which affects the performance of text classification. This paper proposes a multi-feature fusion method with attention mechanism for long text ...
Chinese Question Classification Based on ERNIE and Feature Fusion
Natural Language Processing and Chinese Computing
Abstract
Question classification (QC) is a basic task of question answering (QA) system. This task effectively narrows the range of candidate answers and improves the operating efficiency of the system by providing semantic restrictions for the subsequent ...
Bidirectional Gated Temporal Convolution with Attention for text classification
Abstract
In text classification models based on deep learning, feature extraction and feature aggregation are two key steps. As one of the basic feature extraction methods, CNN has certain limitations due to its inability to effectively extract ...

Comments

Information & Contributors

Information

Published In

ICRSA '23: Proceedings of the 2023 6th International Conference on Robot Systems and Applications

September 2023

335 pages

ISBN:9798400708039

DOI:10.1145/3655532

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 June 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICRSA 2023

ICRSA 2023: 2023 the 6th International Conference on Robot Systems and Applications

September 22 - 24, 2023

Wuhan, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
15
Total Downloads

Downloads (Last 12 months)15
Downloads (Last 6 weeks)1

Reflects downloads up to 20 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Abstract

References

Index Terms

Recommendations

A Multi-feature Fusion Method with Attention Mechanism for Long Text Classification

Chinese Question Classification Based on ERNIE and Feature Fusion

Bidirectional Gated Temporal Convolution with Attention for text classification

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

HTML Format

Share

Share this Publication link

Share on social media

Affiliations