skip to main content
10.1145/3501409.3501636acmotherconferencesArticle/Chapter ViewAbstractPublication PageseitceConference Proceedingsconference-collections
research-article

Classification of News Texts Based on Bayes Algorithm

Published: 31 December 2021 Publication History

Abstract

At present, in order to obtain valuable information from the mixed information in a short time, the text classification in data mining emerges with the time going. we used the naive Bayes algorithm to solve the problem of text classification in data mining. Firstly, we need further preprocessing to reflect this feature of the text, which is TF-ID. Secondly, through the integrated environment anaconda platform of Python, the principle of naive Bayesian classification model is mastered and the whole process of news classification is displayed. Finally, the characteristic extraction with TF-IDF value, the accuracy of Bayesian classifier was significantly improved. The emphasis is on the working principle of naive Bayes model with multinomial as a priori and the improvement on the methodof text feature extraction.: the extraction method of TF-IDF is more reasonable.

References

[1]
Zhang, H (Zhang, Huan)[1]; Jiang, LX (Jiang, Liangxiao)[ 1, 2]; Yu, LJ (Yu, Liangjun)[3] Attribute and instance weighted naive Bayes. PATTERN RECOGNITION. vol: 111
[2]
Khajenezhad, A; Bashiri, MA; Beigy, H. A distributed density estimation algorithm and its application to naive Bayes classification. APPLIED SOFT COMPUTING. vol: 98: 10.1016/j.asoc.2020.106837.
[3]
Khajenezhad Ahmad, Bashiri Mohammad Ali, Beigy Hamid A distributed density estimation algorithm and its application to naive Bayes classification[J] Applied Soft Computing, 2020.
[4]
Kurniawan Yogiek Indra; Cahyono Teguh; Nofiyati; Maryanto Eddy; Fadli Ari; Indraswari Naisha Rahma. Preprocessing Using Correlation Based Features Selection on Naive Bayes Classification. IEEE Access. vol:8 Page: 145381--145400.
[5]
Marcos de Moraes Ronei, Soares Elaine Anita de Melo Gomes, Machado Liliane dos Santos et al. A double weighted fuzzy gamma naive bayes classifier[J] Journal of Intelligent & Fuzzy Systems, 2020, 38(1).
[6]
Rukmawan S H, Aszhari F R, Rustam Z et al. Cerebral Infarction Classification Using the K-Nearest Neighbor and Naive Bayes Classifier[J] Journal of Physics: Conference Series, 2021, 1752(1).
[7]
Hongpo Zhang, Ning Cheng, Yang Zhang et al. Label flipping attacks against Naive Bayes on spam filtering systems[J] Applied Intelligence, 2021(prepublish).
[8]
Dwi Andini Putri, Putri Dwi Andini, Kristiyanti Dinar Ajeng et al. Comparison of Naive Bayes Algorithm and Support Vector Machine using PSO Feature Selection for Sentiment Analysis on E-Wallet Review[J] Journal of Physics: Conference Series, 2020, 1641(1).

Cited By

View all
  • (2023)BAG: Text Classification Based on Attention Mechanism Combining BERT and GCNSoftware Engineering and Applications10.12677/SEA.2023.12202312:02(230-241)Online publication date: 2023
  • (2023)On the class separability of contextual embeddings representations – or “The classifier does not matter when the (text) representation is so good!”Information Processing and Management: an International Journal10.1016/j.ipm.2023.10333660:4Online publication date: 1-Jul-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
EITCE '21: Proceedings of the 2021 5th International Conference on Electronic Information Technology and Computer Engineering
October 2021
1723 pages
ISBN:9781450384322
DOI:10.1145/3501409
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 December 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Data Mining
  2. Naive-Bayes Classifiers
  3. TF-IDF
  4. Text Classification

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

EITCE 2021

Acceptance Rates

EITCE '21 Paper Acceptance Rate 294 of 531 submissions, 55%;
Overall Acceptance Rate 508 of 972 submissions, 52%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)BAG: Text Classification Based on Attention Mechanism Combining BERT and GCNSoftware Engineering and Applications10.12677/SEA.2023.12202312:02(230-241)Online publication date: 2023
  • (2023)On the class separability of contextual embeddings representations – or “The classifier does not matter when the (text) representation is so good!”Information Processing and Management: an International Journal10.1016/j.ipm.2023.10333660:4Online publication date: 1-Jul-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media