Skip to main content

Text Classification Methods Based on SVD and FCM

  • Conference paper
  • First Online:
  • 1035 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11268))

Abstract

In order to find key and useful messages among massive online resources, this paper propose a method to classify documents about soybean metabolism based on Singular Value Decomposition (SVD) and Fuzzy c-Means(FCM). Singular Value Decomposition (SVD) is an important way of matrix decomposition, which can represent a complex matrix by dividing it into smaller and simpler submatrices that describe important properties of matrices. After the dimension reduction, the Fuzzy c-Means (FCM) is used for clustering, which makes the objects divided into the same cluster have the highest similarity, while the object between different clusters have the lowest similarity. Besides, term frequency (TF) and entropy weight method (EWM) can also be used to construct matrix.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Carrera-Trejo, V., Sidorov, G., Miranda-Jimnez, S., Ibarra, M.M., Martnez, R.C.: Latent dirichlet allocation complement in the vector space model for multi-label text classification. Cancer Biol. Ther. 7(7), 1095–1097 (2015)

    Google Scholar 

  2. Cortes, C., Vapnik, V.: Support-Vector Networks. Kluwer Academic Publishers (1995)

    Google Scholar 

  3. Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. 39(1), 1–38 (1977)

    MathSciNet  MATH  Google Scholar 

  4. Guha, S., Rastogi, R.: Cure: an efficient clustering algorithm for large database. Inf. Syst. 26(1), 35–58 (2001)

    Article  Google Scholar 

  5. Li, C.H., Park, S.C.: Neural network for text classification based on singular value decomposition. In: IEEE International Conference on Computer and Information Technology, pp. 47–52 (2007)

    Google Scholar 

  6. Ng, R.T., Han, J.: Efficient and Effective Clustering Methods for Spatial Data Mining. University of British Columbia (1994)

    Google Scholar 

  7. Nigam, K., Mccallum, A.K., Thrun, S., Mitchell, T.: Text classification from labeled and unlabeled documents using EM. Mach. Learn. 39(2–3), 103–134 (2000)

    Article  Google Scholar 

  8. Roul, R.K., Sahay, S.K.: K-means and wordnet based feature selection combined with extreme learning machines for text classification. In: International Conference on Distributed Computing and Internet Technology, pp. 103–112 (2016)

    Google Scholar 

  9. Symeonidis, P., Kehayov, I., Manolopoulos, Y.: Text classification by aggregation of SVD eigenvectors. In: East European Conference on Advances in Databases and Information Systems, pp. 385–398 (2012)

    Chapter  Google Scholar 

  10. Tran, T.N., Drab, K., Daszykowski, M.: Revised DBSCAN algorithm to cluster data with dense adjacent clusters. Chemom. Intell. Lab. Syst. 120(2), 92–96 (2013)

    Article  Google Scholar 

  11. Wang, W., Yang, J., Muntz, R.: Sting+: an approach to active spatial data mining. In: ICDE, p. 116 (1999)

    Google Scholar 

  12. Wang, W., Yang, J., Muntz, R.R.: Sting: a statistical information grid approach to spatial data mining. In: Proceedings of the 23rd Very Large Database Conference, pp. 186–195 (1997)

    Google Scholar 

  13. Zhang, Y.T., Gong, L., Wang, Y.C.: An improved TF-IDF approach for text classification. J. Zhejiang Univ. Sci. A 6A(1), 49–55 (2005)

    Article  Google Scholar 

Download references

Acknowledge

This work was supported by grants from the Fundamental Research Funds for the Key Research Programm of Chongqing Science & Technology Commission (grant no. cstc2017rgzn-zdyf0064), the Chongqing Provincial Human Resource and Social Security Department (grant no. cx2017092), the Central Universities in China (grant nos. 2018CDXYRJ0030, CQU0225001104447). The authors would like to express their gratitude to all the subjects that participated in the experiments. This study is supported by Science and Technology Innovation Project of Foshan City, China (Grant No. 2015IT100095), the Fundamental Research Funds for the Central Universities (Grant No. lzujbky-2016-br03), CERNET Innovation Project (Grant No. NGII20150603) and Science and Technology Planning Project of Guangdong Province, China (Grant No. 2016B010108002).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yi Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yang, N., Li, S., Sun, R., Yang, Y. (2018). Text Classification Methods Based on SVD and FCM. In: U, L., Xie, H. (eds) Web and Big Data. APWeb-WAIM 2018. Lecture Notes in Computer Science(), vol 11268. Springer, Cham. https://doi.org/10.1007/978-3-030-01298-4_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-01298-4_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-01297-7

  • Online ISBN: 978-3-030-01298-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics