research-article

Short Text Classification Model Based on BERT and Fusion Network

Authors:

Lila HongAuthors Info & Claims

CSAI '21: Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence

Pages 168 - 174

https://doi.org/10.1145/3507548.3507574

Published: 09 March 2022 Publication History

Abstract

Abstract: Aiming at short texts lacking contextual information, large amount of text data, sparse features, and traditional text feature representations that cannot dynamically obtain the key classification information of a word polysemous and contextual semantics. this paper proposes a pre-trained language model based on BERT. The network model B-BAtt-MPC (BERT-BiLSTM-Attention-Max-Pooling-Concat) that integrates BiLSTM, Attention mechanism and Max-Pooling mechanism. Firstly, obtain multi-dimensional and rich feature information such as text context semantics, grammar, and context through the BERT model; Secondly, use the BERT output vector to obtain the most important feature information worth noting through the BiLSTM, Attention layer and Max-Pooling layer; In order to optimize the classification model, the BERT and BiLSTM output vectors are fused and input into Max-Pooling; Finally, the classification results are obtained by fusing two feature vectors with Max-Pooling. The experimental results of two data sets show that the model proposed in this paper can obtain the importance and key rich semantic features of short text classification, and can improve the text classification effect.

References

[1]

Shah K, Patel H, Sanghvi D, A comparative analysis of logistic regression, random forest and KNN models for the text classification[J]. Augmented Human Research, 2020, 5(1): 1-16.

[2]

Ruan S, Li H, Li C, Class-specific deep feature weighting for Naïve Bayes text classifiers[J]. IEEE Access, 2020, 8: 20151-20159.

[3]

Y. Bengio,R. Ducharme,P. Vincent. A Neural Probabilistic Language Model[J] Journal of Machine Learning Research,2003,3:1137- 1155.

[4]

Mikolov T, Chen K, Corrado G, .Efficient estimation of word representations in vector space[EB/OL]. (2013- 09-07) [2017-05-08]. https: //arxiv.org /abs/1301.3781.

[5]

Mikolov T, Sutskever I, Chen K, . Distributed representations of words and phrases and their compositionality [C ]/ /MIT. Advances in Neural Information Processing Systems. Massachusetts: MIT Press, 2013: 3111-3119.

[6]

PENNINGTON J, SOCHER R, MANNING C D. Glove: global vectors for word representation[C]//Proceedings of the 2014 conference on empirical methods in natural language processing, Stroudsburg, PA: Association for Computational Linguistics, 2014,

[7]

VASWANI A, SHAZEER N, PARMAR N, Attention is all you need[C]//Proceedings of the 2017 conference on neural information processing systems, Stroudsburg, PA: Association for Computational Linguistics,2017, 5998-6008.

[8]

DEVLIN J, CHANG M W, LEE K, BERT: pre-training of deep bidirectional transformers for language understanding[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Stroudsburg, PA: Association for Computational Linguistics,2019:

[9]

KIM Y.Convolutional neural networks for sentence classification[J]. arXiv preprint arXiv: 1408.5882, 2014.

[10]

Jordan M I. Serial order: A parallel distributed processing approach[M]//Advances in psychology. North-Holland, 1997, 121: 471-495.

[11]

Tai K S, Socher R, Manning C D. Improved semantic representations from tree-structured long short-term memory networks[J]. arXiv preprint arXiv:1503.00075, 2015.

[12]

Mnih V, Heess N, Graves A. Recurrent models of visual attention[C]//Advances in neural information processing systems. 2014: 2204-2212.

[13]

Cai L, Song Y, Liu T, A hybrid BERT model that incorporates label semantics via adjustive attention for multi-label text classification[J]. Ieee Access, 2020, 8: 152183-152192.

[14]

Joulin A, Grave E, Bojanowski P, Bag of tricks for efficient text classification[C] // Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Valencia, Spain, April, 2016: 427–431.

[15]

Lai S, Xu L, Liu K, Recurrent convolutional neural networks for text classification[C]//Twenty-ninth AAAI conference on artificial intelligence. 2015.

[16]

Aurangzeb Khan, Baharum Baharudin, Lam Hong Lee, and Khairullah khan, "A Review of Machine Learning Algorithms for Text-Documents Classification," Journal of Advances in Information Technology, Vol. 1, No. 1,pp. 4-20, February, 2010.

[17]

Ismaïl Biskri, Abdelghani Achouri, Louis Rompré, Steve Descoteaux and Boucif Amar Bensaber, "Computer-Assisted Reading: Getting Help from Text Classification and Maximal Association Rules," Journal of Advances in Information Technology, Vol. 4, No. 4, pp. 157-165, November, 2013.

Cited By

Li ZJin ZWang RJi JLiu H(2022)A Generative Adversarial Net Assisted Method for User Intention Recognition on Imbalanced Dataset2022 IEEE International Conference on Knowledge Graph (ICKG)10.1109/ICKG55886.2022.00027(157-163)Online publication date: Nov-2022
https://doi.org/10.1109/ICKG55886.2022.00027

Index Terms

Short Text Classification Model Based on BERT and Fusion Network
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
  2. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
    2. Machine learning approaches
      1. Classification and regression trees
      2. Neural networks

Index terms have been assigned to the content through auto-classification.

Recommendations

Research on Text Classification Model based on the Fusion of Temporal and Feature Concerns
ICCDA '24: Proceedings of the 2024 8th International Conference on Computing and Data Analysis

Aiming at the problem that traditional short term memory networks (LSTM) and convolutional neural networks (CNN) fail to effectively reflect the importance of each word in the text in the process of feature extraction, this study proposes a hybrid model ...
Automatic classification of interactive texts in online collaborative discussion based on multi-feature fusion
Abstract
The recognition of learners' speech intention in the online collaborative learning scene is of great significance for exploring the rules of knowledge construction such as knowledge development and emotional communication in the ...
MEFE: A Multi-fEature Knowledge Fusion and Evaluation Method Based on BERT
Algorithms and Architectures for Parallel Processing
Abstract
Knowledge fusion is an important part of constructing a knowledge graph. In recent years, with the development of major knowledge bases, the integration of multi-source knowledge bases is the focus and difficulty in the field of knowledge fusion. ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

CSAI '21: Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence

December 2021

437 pages

ISBN:9781450384155

DOI:10.1145/3507548

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 March 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

Innovation Project of GuangXi Graduate Education
Natural Science Foundation of Guangxi Province

Conference

CSAI 2021

CSAI 2021: 2021 5th International Conference on Computer Science and Artificial Intelligence

December 4 - 6, 2021

Beijing, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
220
Total Downloads

Downloads (Last 12 months)29
Downloads (Last 6 weeks)0

Reflects downloads up to 16 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Li ZJin ZWang RJi JLiu H(2022)A Generative Adversarial Net Assisted Method for User Intention Recognition on Imbalanced Dataset2022 IEEE International Conference on Knowledge Graph (ICKG)10.1109/ICKG55886.2022.00027(157-163)Online publication date: Nov-2022
https://doi.org/10.1109/ICKG55886.2022.00027

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten