Abstract
This paper researches the technology of text classification, and proposes a two-layer structure automatic text classification based on HMM and SVM. The given text is classified with HMM classifiers first to select the most likely two classes. Then these classes as SVM input are processed. Finally the given text is classified into the corresponding category with SVM classifier. The experimental results show that this method is more efficient for text classification in recognition ratio.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Liu Y-Z, Lin Y-P, Chen Z-P (2004) Text information extraction based on hidden markov model. J Syst Simul 16(3):507–509
Zhou S-X, Lin Y-P, Wang Y-N (2007) Text information extraction based on the second-order hidden markov model. ACTA Electronica Sinica 11:2226–2231
Luo S-H, Ouyang W-M (2007) HMM based text categorization. Comput Eng Appl 10:179–181
Yang J, Wang H-H (2010) Text classification algorithm based on hidden Markov model. J Comput Appl 9:2348–2351
Luo X, Xia D-L, Yan P-L (2005) Improved feature selection method and TF-IDF formula based on word frequency differentia. J Comput Appl 5(9):2031–2033
Sebastiani F (2002) Machine learning in automated text categorization. J ACM Comput Surv 34(1):1–47
Tsang IW, Kwok JT, Cheung P-M (2005) Core vector machines: fast SVM training on very large data sets. J Mach Learning Res 363–392
Gulinazi L, Sun T-L (2011) The application of multi-class SVM based binary tree in web text categorization. J XinJiang Univ (2):100–104
Acknowledgments
This work is supported by the provincial natural science research projects (KJ2011Z097) of Anhui.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, L., Li, L. (2013). Automatic Text Classification Based on Hidden Markov Model and Support Vector Machine. In: Yin, Z., Pan, L., Fang, X. (eds) Proceedings of The Eighth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA), 2013. Advances in Intelligent Systems and Computing, vol 212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37502-6_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-37502-6_27
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37501-9
Online ISBN: 978-3-642-37502-6
eBook Packages: EngineeringEngineering (R0)