Abstract
In order to solve multidimensional scaling (MDS) efficiently, we proposed an algorithm, which apply stochastic gradient algorithm to minimizing well-known MDS criteria [1]. In this paper, the efficient MDS algorithm is applied to the text mining and compared with the SOM [2]. The results verified the validity of our algorithm in the analysis of a massive document collection. Our algorithm could find out some interesting structures from about 100000 articles in Usenet (NetNews).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Matsuda, Y., Yamaguchi, K.: Global mapping analysis: stochastic gradient algorithm in SSTRESS and classical MDS stress. In: ICONIP 2001 Proceedings, Shanghai, China, pp. 102–107 (2001)
Kohonen, T., Kaski, S., Lagus, K., Salojarvi, J., Honkela, J., Paatero, V., Saarela, A.: Self-organization of a massive document collection. IEEE Transactions on Neural Networks 11, 574–585 (2000)
Cox, T.F., Cox, M.A.A.: Multidimensional scaling. Chapman & Hall, London (1994)
Mardia, K.V.: Some properties of classical multidimensional scaling. Communications in Statistics: A, Theory and Method A7, 1233–1241 (1978)
Takane, Y., Young, F.W., Deleeuw, J.: Nonmetric individual-differences multidimensional-scaling - alternating least-squares method with optimal scaling features. Psychometrika 42, 7–67 (1977)
Matsuda, Y., Yamaguchi, K.: Global mapping analysis: stochastic approximation for multidimensional scaling. International Journal of Neural Systems 11, 419–426 (2001)
Matsuda, Y., Yamaguchi, K.: An efficient MDS-based topographic mapping algorithm. Neurocomputing 64, 285–299 (2005)
Oja, E.: Principal components, minor components, and linear neural networks. Neural Networks 5, 927–935 (1992)
Fellbaum, C. (ed.): WordNet: an electronic lexical database. MIT Press, Cambridge (1998)
Kohonen, T., Hynninen, J., Kangas, J., Laaksonen, J.: SOM_PAK: the selforganizing map program package. Technical Report A31, Helsinki University of Technology, Laboratory of Computer and Information Science (1996)
Baldi, P., Frasconi, P., Smyth, P.: Modeling the Internet and theWeb: Probabilistic Methods and Algorithms. John Wiley & Sons, New York (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Matsuda, Y., Yamaguchi, K. (2005). An Efficient MDS Algorithm for the Analysis of Massive Document Collections. In: Khosla, R., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2005. Lecture Notes in Computer Science(), vol 3682. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11552451_140
Download citation
DOI: https://doi.org/10.1007/11552451_140
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28895-4
Online ISBN: 978-3-540-31986-3
eBook Packages: Computer ScienceComputer Science (R0)