Abstract
While the focus of research concerning electronic document archives still is on information retrieval, the importance of interactive exploration has been realized and is gaining importance. The map metaphor, where documents are organized on a map according to their contents, has proven particularly useful as an interface to such a collection. The self-organizing map has shown to produce stable topically ordered organizations of documents on such a 2-dimensional map display. However, the characteristics of these topical clusters are not being made explicit. In this paper we present the LabelSOM method which takes the applicability of the self-organizing map for document archive organization one step further by automatically labeling the various topical clusters found in the map. This allows the user to get an instant overview of the various topics covered by a document collection.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
M. Hascoët and X. Soinard. Using maps as a user interface to a digital library. In Proc. Int’l ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, 1998.
T. Honkela, S. Kaski, K. Lagus, and T. Kohonen. WEBSOM-Self-organizing maps of document collections. In Proc. Workshop on Self-Organizing Maps (WSOM97), Espoo, Finland, 1997.
T. Kohonen. Self-organized formation of topologically correct feature maps. Biological Cybernetics, 43, 1982. Springer Verlag, Berlin, Heidelberg, New York.
T. Kohonen. Self-Organizing Maps. Springer Verlag, Berlin, Germany, 1995.
X. Lin, D. Soergel, and G. Marchionini. A self-organizing semantic map for information retrieval. In Proc. Int’l ACM SIGIR Conf. on R & D in Information Retrieval, Chicago, IL, 1991.
D. Merkl. A connectionist view on document classification. In Proceedings of the 6th Australasian Database Conf. (ADC’95), Adelaide, SA, 1995.
D. Merkl. Exploration of document collections with self-organizing maps: A novel approach to similarity visualization. In Europ. Symp. on Principles of Data Mining and Knowledge Discovery (PKDD’97), Trondheim, Norway, 1997.
D. Merkl and A. Rauber. Alternative ways for cluster visualization in self-organizing maps. In Proc. of the Workshop on Self-Organizing Maps (WSOM97), Helsinki, Finland, 1997.
D. Merkl and A. Rauber. Cluster connections-a visualization technique to reveal cluster boundaries in self-organizing maps. In Proc 9th Italian Workshop on Neural Nets (WIRN97), Vietri sul Mare, Italy, 1997.
D. Merkl and A. Rauber. CIA’s view of the world and what neural networks learn from it: A comparison of geographical document space representation metaphors. In Proc. 9th International Conf. on Database and Expert Systems (DEXA98), Vienna, Austria, 1998.
D. Merkl and E. Schweighofer. En route to data mining in legal text corpora-Clustering, neural computation, and international treaties. In Proc. International Workshop on Database and Expert Systems Applications, Toulouse, France, 1997.
A. Rauber. SOMLib: A distributed digital library system based on self-organizing maps. In Proc. 10th Italian Workshop on Neural Nets (WIRN98), Vietri sul Mare, Italy, 1998.
A. Rauber and D. Merkl. Creating an order in distributed digital libraries by integrating independent self-organizing maps. In Proc. Int’l Conf. on Artificial Neural Networks (ICANN’98), Skövde, Sweden, 1998.
A. Rauber and D. Merkl. Finding structure in text archives. In Proc. European Symp. on Artificial Neural Networks (ESANN98), Bruges, Belgium, 1998.
A. Rauber and D. Merkl. Automatic labeling of self-organizing maps: Making a treasure map reveal its secrets. In Proc. 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD99), Beijing, China, 1999.
G. Salton. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley, Reading, MA, 1989.
A. Ultsch. Self-organizing neural networks for visualization and classification. In Information and Classification. Concepts, Methods and Application. Springer Verlag, 1993.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rauber, A., Merkl, D. (1999). Using Self-organizing Maps to Organize Document Archives and to Characterize Subject Matters: How to Make a Map Tell the News of the World. In: Bench-Capon, T.J., Soda, G., Tjoa, A.M. (eds) Database and Expert Systems Applications. DEXA 1999. Lecture Notes in Computer Science, vol 1677. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48309-8_28
Download citation
DOI: https://doi.org/10.1007/3-540-48309-8_28
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66448-2
Online ISBN: 978-3-540-48309-0
eBook Packages: Springer Book Archive