Skip to main content

Automatic Text Classification Based on Hidden Markov Model and Support Vector Machine

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 212))

Abstract

This paper researches the technology of text classification, and proposes a two-layer structure automatic text classification based on HMM and SVM. The given text is classified with HMM classifiers first to select the most likely two classes. Then these classes as SVM input are processed. Finally the given text is classified into the corresponding category with SVM classifier. The experimental results show that this method is more efficient for text classification in recognition ratio.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Liu Y-Z, Lin Y-P, Chen Z-P (2004) Text information extraction based on hidden markov model. J Syst Simul 16(3):507–509

    Google Scholar 

  2. Zhou S-X, Lin Y-P, Wang Y-N (2007) Text information extraction based on the second-order hidden markov model. ACTA Electronica Sinica 11:2226–2231

    Google Scholar 

  3. Luo S-H, Ouyang W-M (2007) HMM based text categorization. Comput Eng Appl 10:179–181

    Google Scholar 

  4. Yang J, Wang H-H (2010) Text classification algorithm based on hidden Markov model. J Comput Appl 9:2348–2351

    Google Scholar 

  5. Luo X, Xia D-L, Yan P-L (2005) Improved feature selection method and TF-IDF formula based on word frequency differentia. J Comput Appl 5(9):2031–2033

    Google Scholar 

  6. Sebastiani F (2002) Machine learning in automated text categorization. J ACM Comput Surv 34(1):1–47

    Google Scholar 

  7. Tsang IW, Kwok JT, Cheung P-M (2005) Core vector machines: fast SVM training on very large data sets. J Mach Learning Res 363–392

    Google Scholar 

  8. Gulinazi L, Sun T-L (2011) The application of multi-class SVM based binary tree in web text categorization. J XinJiang Univ (2):100–104

    Google Scholar 

Download references

Acknowledgments

This work is supported by the provincial natural science research projects (KJ2011Z097) of Anhui.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Li Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, L., Li, L. (2013). Automatic Text Classification Based on Hidden Markov Model and Support Vector Machine. In: Yin, Z., Pan, L., Fang, X. (eds) Proceedings of The Eighth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA), 2013. Advances in Intelligent Systems and Computing, vol 212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37502-6_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37502-6_27

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37501-9

  • Online ISBN: 978-3-642-37502-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics