Abstract
A large number of high-dimensional index structures suffer from the so called ’dimensional curse’ problem, i.e., the retrieval performance becomes increasingly degraded as the dimensionality is increased. To solve this problem, the cell-based filtering scheme has been proposed, but it shows a linear decrease in performance as the dimensionality is increased. In this paper, we propose a parallel high-dimensional index structure using the cell-based filtering for multimedia data so as to cope with the linear decrease in retrieval performance. In addition, we devise data insertion, range query and k-NN query processing algorithms which are suitable for the cluster-based parallel architecture. Finally, we show that our parallel index structure achieves good retrieval performance in proportion to the number of servers in the cluster-based architecture and it outperforms a parallel version of the VA-File when the dimensionality is over 10.
This work is financially supported by the Ministry of Education and Human Resources Development(MOE), the Ministry of Commerce, Industry and Energy(MOCIE) and the Ministry of Labor(MOLAB) though the fostering project of the Lab of Excellency.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Robinson, J.T.: The K-D-B-tree: A Search Structure for Large Multidimensional Dynamic Indexes. In: Proc. of Int. Conf. on Management of Data, pp. 10–18 (1981)
White, D.A., Jain, R.: Similarity Indexing: Algorithms and Performance. In: Proc. of the SPIE: Storage and Retrieval for Image and Video Databases IV, vol. 2670, pp. 62–75 (1996)
Lin, H.I., Jagadish, H., Faloutsos, C.: The TV-tree: An Index Structure for High Di-men-sional Data. VLDB Journal 3, 517–542 (1995)
Berchtold, S., Keim, D.A., Kriegel, H.-P.: The X-tree: An Index Structure for High-Dimensional Data. In: Proceedings of the 22nd VLDB Conference, pp. 28–39 (1996)
Arya, S., Mount, D.M., Narayan, O.: Accounting for Boundary Effects in Nearest Neighbor Searching. In: Proc. of 11th Annaual Symp. on Computational Geometry, Vancouver, Canada, pp. 336–344 (1995)
Berchtold, S., Bohm, C., Keim, D., Kriegel, H.-P.: A Cost Model for Nearest Neighbor Search in High-Dimensional Data Space. In: ACM PODS Symposium on Principles of Data-bases Systems, Tucson, Arizona (1997)
Weber, R., Schek, H.-J., Blott, S.: A Quantitative Analysis and Perform-ance Study for Similarity-Search Methods in High-Dimensional Spaces. In: Proc. of 24th International Conference on Very Large Data Bases, pp. 24–27 (1998)
Han, S.-G., Chang, J.-W.: A New High-Dimensional Index Structure Using a Cell-Based Filtering Technique. In: Masunaga, Y., Thalheim, B., Štuller, J., Pokorný, J. (eds.) ADBIS 2000 and DASFAA 2000. LNCS, vol. 1884, pp. 79–92. Springer, Heidelberg (2000)
Faloutsos, C.: Design of a Signature File Method that Accounts for Non-Uniform Occur-rence and Query Frequencies. In: ACM SIGMOD, pp. 165–170 (1985)
Kim, J.-K., Chang, J.-W.: Horizontally-divided Signature File on a Parallel Machine Architecture. Journal of Systems Architecture 44(9-10), 723–735 (1998)
Kim, J.-K., Chang, J.-W.: Vertically-partitioned Parallel Signature File Method. Journal of Systems Architecture 46(8), 655–673 (2000)
Roussopoulos, N., Kelley, S., Vincent, F.: Nearest Neighbor Queries. In: Proc. of ACM Int. Conf. on Management of Data (SIGMOD), pp. 71–79 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chang, JW., Kim, YK., Kim, YJ. (2006). Parallel High-Dimensional Index Structure Using Cell-Based Filtering for Multimedia Data. In: Min, G., Di Martino, B., Yang, L.T., Guo, M., Rünger, G. (eds) Frontiers of High Performance Computing and Networking – ISPA 2006 Workshops. ISPA 2006. Lecture Notes in Computer Science, vol 4331. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11942634_80
Download citation
DOI: https://doi.org/10.1007/11942634_80
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49860-5
Online ISBN: 978-3-540-49862-9
eBook Packages: Computer ScienceComputer Science (R0)