Text Classification Methods Based on SVD and FCM

Yang, Ning; Li, Shuaibing; Sun, Rong; Yang, Yi

doi:10.1007/978-3-030-01298-4_11

Text Classification Methods Based on SVD and FCM

Ning Yang¹⁵,
Shuaibing Li¹⁵,
Rong Sun¹⁷ &
…
Yi Yang^15,16

Conference paper
First Online: 21 October 2018

1035 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11268))

Abstract

In order to find key and useful messages among massive online resources, this paper propose a method to classify documents about soybean metabolism based on Singular Value Decomposition (SVD) and Fuzzy c-Means(FCM). Singular Value Decomposition (SVD) is an important way of matrix decomposition, which can represent a complex matrix by dividing it into smaller and simpler submatrices that describe important properties of matrices. After the dimension reduction, the Fuzzy c-Means (FCM) is used for clustering, which makes the objects divided into the same cluster have the highest similarity, while the object between different clusters have the lowest similarity. Besides, term frequency (TF) and entropy weight method (EWM) can also be used to construct matrix.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Carrera-Trejo, V., Sidorov, G., Miranda-Jimnez, S., Ibarra, M.M., Martnez, R.C.: Latent dirichlet allocation complement in the vector space model for multi-label text classification. Cancer Biol. Ther. 7(7), 1095–1097 (2015)
Google Scholar
Cortes, C., Vapnik, V.: Support-Vector Networks. Kluwer Academic Publishers (1995)
Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. 39(1), 1–38 (1977)
MathSciNet MATH Google Scholar
Guha, S., Rastogi, R.: Cure: an efficient clustering algorithm for large database. Inf. Syst. 26(1), 35–58 (2001)
Article Google Scholar
Li, C.H., Park, S.C.: Neural network for text classification based on singular value decomposition. In: IEEE International Conference on Computer and Information Technology, pp. 47–52 (2007)
Google Scholar
Ng, R.T., Han, J.: Efficient and Effective Clustering Methods for Spatial Data Mining. University of British Columbia (1994)
Google Scholar
Nigam, K., Mccallum, A.K., Thrun, S., Mitchell, T.: Text classification from labeled and unlabeled documents using EM. Mach. Learn. 39(2–3), 103–134 (2000)
Article Google Scholar
Roul, R.K., Sahay, S.K.: K-means and wordnet based feature selection combined with extreme learning machines for text classification. In: International Conference on Distributed Computing and Internet Technology, pp. 103–112 (2016)
Google Scholar
Symeonidis, P., Kehayov, I., Manolopoulos, Y.: Text classification by aggregation of SVD eigenvectors. In: East European Conference on Advances in Databases and Information Systems, pp. 385–398 (2012)
Chapter Google Scholar
Tran, T.N., Drab, K., Daszykowski, M.: Revised DBSCAN algorithm to cluster data with dense adjacent clusters. Chemom. Intell. Lab. Syst. 120(2), 92–96 (2013)
Article Google Scholar
Wang, W., Yang, J., Muntz, R.: Sting+: an approach to active spatial data mining. In: ICDE, p. 116 (1999)
Google Scholar
Wang, W., Yang, J., Muntz, R.R.: Sting: a statistical information grid approach to spatial data mining. In: Proceedings of the 23rd Very Large Database Conference, pp. 186–195 (1997)
Google Scholar
Zhang, Y.T., Gong, L., Wang, Y.C.: An improved TF-IDF approach for text classification. J. Zhejiang Univ. Sci. A 6A(1), 49–55 (2005)
Article Google Scholar

Download references

Acknowledge

This work was supported by grants from the Fundamental Research Funds for the Key Research Programm of Chongqing Science & Technology Commission (grant no. cstc2017rgzn-zdyf0064), the Chongqing Provincial Human Resource and Social Security Department (grant no. cx2017092), the Central Universities in China (grant nos. 2018CDXYRJ0030, CQU0225001104447). The authors would like to express their gratitude to all the subjects that participated in the experiments. This study is supported by Science and Technology Innovation Project of Foshan City, China (Grant No. 2015IT100095), the Fundamental Research Funds for the Central Universities (Grant No. lzujbky-2016-br03), CERNET Innovation Project (Grant No. NGII20150603) and Science and Technology Planning Project of Guangdong Province, China (Grant No. 2016B010108002).

Author information

Authors and Affiliations

School of Information Science and Engineering, Lanzhou University, Gansu, 730000, China
Ning Yang, Shuaibing Li & Yi Yang
Silk Road Economic Belt Research Center of Lanzhou University, Gansu, 730000, China
Yi Yang
School of Mathematics, Jilin University, Jilin, 130000, China
Rong Sun

Authors

Ning Yang
View author publications
You can also search for this author in PubMed Google Scholar
Shuaibing Li
View author publications
You can also search for this author in PubMed Google Scholar
Rong Sun
View author publications
You can also search for this author in PubMed Google Scholar
Yi Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yi Yang .

Editor information

Editors and Affiliations

University of Macau, Macao, China
Leong Hou U
Education University of Hong Kong, Hong Kong, China
Haoran Xie

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, N., Li, S., Sun, R., Yang, Y. (2018). Text Classification Methods Based on SVD and FCM. In: U, L., Xie, H. (eds) Web and Big Data. APWeb-WAIM 2018. Lecture Notes in Computer Science(), vol 11268. Springer, Cham. https://doi.org/10.1007/978-3-030-01298-4_11

Download citation

DOI: https://doi.org/10.1007/978-3-030-01298-4_11
Published: 21 October 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01297-7
Online ISBN: 978-3-030-01298-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics