Skip to main content

Towards Content-Related Indexing in Databases

  • Conference paper
Book cover Datenbanksysteme in Büro, Technik und Wissenschaft

Part of the book series: Informatik aktuell ((INFORMAT))

Abstract

Modern business appliications require huge volumes of highdimensioinal data to be Stored. explorative queries, typically used in these applications, select groups of objects with similar attributes or attribute combinations. In contrast to multidimensional index structures designed for spatial data that assume dimension independence and very often a uniform distribution, we have developed a new database indexing concept that discovers correlation patterns and takes the nonuniform distribution into consideration. The corresponding analysis is done on the subsymbolic level by applying a hierarchical artificial neural network. The trained neural network organises the data into a hierarchy of clusters. The clusters can be interpreted as groups of similar objects on the symbolic level. The hierarchy is finally used to derive the Intelligent Cluster Index (ICIx). In this paper we present a description of the Intelligent Cluster Index, it’s creation and application as multidimensional index and as heuristic for a logical distribution schema. We describe first experimental results, showing that this new approach can significantly speed up the system performance.

This work is supported by the Deutsche Forschungsgemeinschaft (DFG) under contract reference SFB 457-00.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. N. Beckmann et al., The R*-tree: An Efficient and Robust Access Method for Points and Rectangles, in: SIGMOD Record Vol. 19(2), ACM Press, 1990

    Google Scholar 

  2. W. Benn, O. Görlitz, Semantic Navigation Maps for Information Agents, 2nd Int’l. Workshop on Cooperative Information Agents CIA-98. Paris,.July 1998.

    Google Scholar 

  3. S. Berclitold et al., The X-tree: An Index Structure for High-Dimensional Data, in: Proc. of the 22nd VLDB Conf., Mumbay, India, Morgan Kaufmann 1996

    Google Scholar 

  4. S. Berclitold et al., Independent Quantization: An Index Compression Technique for High-Dimensional Data Spaces, in: Proc. of the 16th Int’l. Conf. on Data Engineering (ICDE), San Diego, USA, IEEE Computer Society, 2000

    Google Scholar 

  5. E. Bertino et al., Indexing Techniques for Advanced Database Systems, Kluwer Academic Publishers Boston, Dordrecht, London 1997

    Book  MATH  Google Scholar 

  6. K. Beyer et al., When Is “(Nearest Neighbor“ Meaningful? in: Lecture Notes in Computer Science 1540, pp. 217–235, Springer 1999

    Google Scholar 

  7. J. P. Bigus, Data Mining with Neural Networks, The McGraw-Hill Companies, 1996

    Google Scholar 

  8. V. Burzevski, C. K. Mohan, Hierarchical Growing Cell Structures, Proc. IEEE Int’l. Conf. on Neural Networks, pp. 1658–63, 1996

    Google Scholar 

  9. P. Dadam Verteilte Datenbanken und Client/Server-Systeme: Grundlagen, Konzepte, Rcalisierungsfoimen, 1996, Springer Verlag Berlin

    Book  Google Scholar 

  10. B. Fritzke, A growing neural gas network learns topologies, 1994, Proc. 1994 NIPS Conf., Denver

    Google Scholar 

  11. B. Fritzke, Vektorbasierte Neuronale Netze, 1998, Shaker Verlag

    Google Scholar 

  12. S. Gilg, R. Neubert, Schemavergleich mit Hilfe Neuronaler Netze, 1998, Studienarbeit, TU Chemnitz

    Google Scholar 

  13. O. Görlitz, R. Neubert, W. Benn, Access to Distributed Environmental Databases with ICIx Technology, Online Information Review Journal, Vol. 24 Issue 5, 2000 MCB University Press

    Google Scholar 

  14. A. Guttman, R-Trees: A Dynamic Index Structure For Spatial Searching, in: SIGMOD Record 14(2), pp. 47–57, ACM Press 1984

    Article  Google Scholar 

  15. A. Henrich, Der LSD-Baum: eine mehrdimensionale Zugriffsstruktur und ihre Einsatzmöglichkeiten in Datenbanksystemen, 1990, Dissertation, FernUniversität-Gesamthochschule—Hagen

    Google Scholar 

  16. A. Henrich, H.-W. Six, How to split buckets in spatial data structures, 1991, Proc. Int’l. Conf. on Geographic Database Management Systems, Esprit Basic Research Series DG XIII, Springer Verlag

    Google Scholar 

  17. A. Henrich, A. Hilbert, P. Widmayer, Anbindung einer räumlich clustemden Zugriffsstruktur für geometrische Attribute an ein Standard-Datenbanksystem am Bsp. Oracle, 1991, Proc. Gl-Fachtagung BTW 91, Springer Informatik Fachbericht 270, pp. 161–177

    Google Scholar 

  18. N. Katayama, S. Satoh, SR-Tree: an index structure for nearest neighbor searching of highdimensional point data, 1998, Journal: Systems and Computers in Japan, Vol. 29, Iss. 5, pp. 59–73

    Article  Google Scholar 

  19. T. Kohoncn, Self-Organization of Very Large Document Collections: State of the Art, 1998, Proc. Int’l. Conf. on Arlifical Neural Networks 1CANN-98

    Google Scholar 

  20. S. Krumbiegel, Performanzvergleich künstlicher Neuronaler Netze bei unterschiedlicher Hardwareunterstützung, 1999, Studienarbeit, TU Chemnitz

    Google Scholar 

  21. W. Li, C. Clifton, Semantic Integration in Heterogeneous Databases Using Neural Networks, Proc. of the 20th VLDB Conf., Santiago, Chile, 1994

    Google Scholar 

  22. T. Martinetz, K. Schulten, A “neural gas“ network learns topologies, 1991, Artifical Neural Networks, North-Holland, Amsterdam, pp. 397–402

    Google Scholar 

  23. D. Merkl, Exploration of Text Collections with Hierarchical Feature Maps, 1997, Proc. ACM SIGIR 97, Philadelphia, USA

    Google Scholar 

  24. R. Neubert, O. Görlitz, W. Benn, Incorporating Knowledge Technology in Databases-the Intelligent Cluster index, KnowTech 2000 Conference and Exihibition, http://www.knowtech.net

  25. J. Nievergelt, II. Hinterberger, K. C. Sevcik, The grid file: An adaptable, symmetric multikey file structure, 1984, ACM Transactions on Database Systems, Vol 9(1), pp. 38–71

    Article  Google Scholar 

  26. OASIS: Open Architecture Server for Information Search, http://www.oasis-europe.org/

  27. J. Rahmel, SplitNet: Learning of Tree Structured Kohonen Chains, 1996, Proc. ICNN 96, Washington

    Google Scholar 

  28. J. Rahmel, On the Role of Topology for Neural Network Interpretation, 1996 Proc. ECAI 96, John Wiley&Sons Ltd.

    Google Scholar 

  29. H. Ritter, T. Kohonen, Self-Organizing Semantic Maps, 1989, Biological Cybernetics 61:(4), pp. 241–254, Springer Verlag

    Article  Google Scholar 

  30. Y. Sakurai et al., The A-tree: An Index Structure for High-Dimensional Spaces Using Relative Approximation, Proc. of the 26th VLDB Conf., Cairo, Egypt, 2000

    Google Scholar 

  31. P. Schcucnnann, Wen-Syan Li, C. Clifton, Multidatabase Query Processing with Uncertainly in Global Keys and Attribute Values, 1998, Journal of the American Society for Information Science Vol. 49(3), pp. 283–301

    Article  Google Scholar 

  32. J. Zavrel, Neural Information Retrieval, 1995, Thesis, University of Amsterdam

    Google Scholar 

  33. J. Zavrel, Neural Navigation Interfaces for Information Retrieval: Are they more than an Appealing Idea, 1996, Artificial Intelligence Review 10, pp. 477–504

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Neubert, R., Görlitz, O., Benn, W. (2001). Towards Content-Related Indexing in Databases. In: Heuer, A., Leymann, F., Priebe, D. (eds) Datenbanksysteme in Büro, Technik und Wissenschaft. Informatik aktuell. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-56687-5_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-56687-5_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41707-1

  • Online ISBN: 978-3-642-56687-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics