Skip to main content

An Efficient MDS Algorithm for the Analysis of Massive Document Collections

  • Conference paper
Knowledge-Based Intelligent Information and Engineering Systems (KES 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3682))

Abstract

In order to solve multidimensional scaling (MDS) efficiently, we proposed an algorithm, which apply stochastic gradient algorithm to minimizing well-known MDS criteria [1]. In this paper, the efficient MDS algorithm is applied to the text mining and compared with the SOM [2]. The results verified the validity of our algorithm in the analysis of a massive document collection. Our algorithm could find out some interesting structures from about 100000 articles in Usenet (NetNews).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Matsuda, Y., Yamaguchi, K.: Global mapping analysis: stochastic gradient algorithm in SSTRESS and classical MDS stress. In: ICONIP 2001 Proceedings, Shanghai, China, pp. 102–107 (2001)

    Google Scholar 

  2. Kohonen, T., Kaski, S., Lagus, K., Salojarvi, J., Honkela, J., Paatero, V., Saarela, A.: Self-organization of a massive document collection. IEEE Transactions on Neural Networks 11, 574–585 (2000)

    Article  Google Scholar 

  3. Cox, T.F., Cox, M.A.A.: Multidimensional scaling. Chapman & Hall, London (1994)

    MATH  Google Scholar 

  4. Mardia, K.V.: Some properties of classical multidimensional scaling. Communications in Statistics: A, Theory and Method A7, 1233–1241 (1978)

    Article  MathSciNet  Google Scholar 

  5. Takane, Y., Young, F.W., Deleeuw, J.: Nonmetric individual-differences multidimensional-scaling - alternating least-squares method with optimal scaling features. Psychometrika 42, 7–67 (1977)

    Article  MATH  Google Scholar 

  6. Matsuda, Y., Yamaguchi, K.: Global mapping analysis: stochastic approximation for multidimensional scaling. International Journal of Neural Systems 11, 419–426 (2001)

    Google Scholar 

  7. Matsuda, Y., Yamaguchi, K.: An efficient MDS-based topographic mapping algorithm. Neurocomputing 64, 285–299 (2005)

    Article  Google Scholar 

  8. Oja, E.: Principal components, minor components, and linear neural networks. Neural Networks 5, 927–935 (1992)

    Article  Google Scholar 

  9. Fellbaum, C. (ed.): WordNet: an electronic lexical database. MIT Press, Cambridge (1998)

    MATH  Google Scholar 

  10. Kohonen, T., Hynninen, J., Kangas, J., Laaksonen, J.: SOM_PAK: the selforganizing map program package. Technical Report A31, Helsinki University of Technology, Laboratory of Computer and Information Science (1996)

    Google Scholar 

  11. Baldi, P., Frasconi, P., Smyth, P.: Modeling the Internet and theWeb: Probabilistic Methods and Algorithms. John Wiley & Sons, New York (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Matsuda, Y., Yamaguchi, K. (2005). An Efficient MDS Algorithm for the Analysis of Massive Document Collections. In: Khosla, R., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2005. Lecture Notes in Computer Science(), vol 3682. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11552451_140

Download citation

  • DOI: https://doi.org/10.1007/11552451_140

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28895-4

  • Online ISBN: 978-3-540-31986-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics