Short Text Mapping Based on Fast Clustering Using Minimum Spanning Trees

Li, Pingrong

doi:10.1007/978-3-030-26766-7_50

Pingrong Li¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11645))

Included in the following conference series:

International Conference on Intelligent Computing

Abstract

Due to short length and limited content, short text representation has the problem of high-dimension and high-sparsity. For the purpose of achieving the goal of reducing the dimension and eliminate the sparseness while preserve the semantics of the information in the text to be represented, a method of short text mapping based on fast clustering using minimum spanning trees is proposed. First, we remove the irrelevant terms, then a clustering method based on minimum spanning tree is adopted to identify the relevant term set and remove the redundant terms to get the short text mapping space. Finally, a matrix mapping method is designed to represent the original short text on a highly correlated and non-redundant short text mapping space. The proposed method not only has low time complexity but also produces higher quality short text mapping space. The experiments prove that our method is feasible and effective.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Evaluating the Performance of Transformers-Based Semantic Similarity Measures in Short-Text Clustering

Effectively Representing Short Text via the Improved Semantic Feature Space Mapping

Locality-Sensitive Term Weighting for Short Text Clustering

References

Yong, Z., Li, Y., Xia, S.: An improved KNN text classification algorithm based on clustering. J. Comput. 4(3), 230–237 (2009)
Google Scholar
Cai, Y., et al.: Semi-supervised short text categorization based on attribute selection. J. Comput. Appl. 30(4), 1015–1018 (2010)
Google Scholar
Li, P., Wang, H., Zhu, K.Q., et al.: A large probabilistic semantic network based approach to compute term similarity. IEEE Trans. Knowl. Data Eng. 27(10), 2604–2617 (2015)
Article Google Scholar
Kumar, S., Rengarajan, P., Annie, A.X.: Using wikipedia category network to generate topic trees. In: AAAI 2017, pp. 4951–4952 (2017)
Google Scholar
Piao, G.Y., Breslin, J.G.: User modeling on Twitter with WordNet synsets and DBpedia concepts for personalized recommendations. In: CIKM 2016, pp. 2057–2060 (2016)
Google Scholar
Sun, A.: Short text classification using very few words. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1145–1146. ACM (2012)
Google Scholar
Usama, M.F., Irani, K.B.: Multi-interval discretization of continuous valued attributes for classification learning. In: Proceedings of 13th International Joint Conference on AI, pp. 1022–1027 (1993)
Google Scholar
John, G.H., Kohavi, R., Pfleger, K.: Irrelevant features and the subset selection problem. In: The Proceedings of the Eleventh International Conference on Machine Learning, pp. 121–129 (1994)
Chapter Google Scholar
Song, Q., Ni, J., Wang, G.: A fast clustering-based feature subset selection algorithm for high-dimensional data. IEEE Trans. Knowl. Data Eng. 25(1), 1–14 (2013)
Article Google Scholar
Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman & Co., New York (1979)
MATH Google Scholar
Prim, R.C.: Shortest connection networks and some generalizations. Bell Syst. Tech. J. 36, 1389–1401 (1957)
Article Google Scholar
Gao, L., Zhou, S., Guan, J.: Effectively classifying short texts by structured sparse representation with dictionary filtering. Inf. Sci. 323, 130–142 (2015)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

College of Electronic Commerce, Longnan Teachers College, Longnan, 746000, China
Pingrong Li

Authors

Pingrong Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pingrong Li .

Editor information

Editors and Affiliations

Tongji University, Shanghai, China
De-Shuang Huang
Nanchang Institute of Technology, Nanchang, China
Zhi-Kai Huang
Liverpool John Moores University, Liverpool, UK
Abir Hussain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, P. (2019). Short Text Mapping Based on Fast Clustering Using Minimum Spanning Trees. In: Huang, DS., Huang, ZK., Hussain, A. (eds) Intelligent Computing Methodologies. ICIC 2019. Lecture Notes in Computer Science(), vol 11645. Springer, Cham. https://doi.org/10.1007/978-3-030-26766-7_50

Download citation

DOI: https://doi.org/10.1007/978-3-030-26766-7_50
Published: 24 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26765-0
Online ISBN: 978-3-030-26766-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics