Abstract
A number of problems in computer science can be solved efficiently with the so called memory based or kernel methods. Among this problems (relevant to the AI community) are multimedia indexing, clustering, non supervised learning and recommendation systems. The common ground to this problems is satisfying proximity queries with an abstract metric database.
In this paper we introduce a new technique for making practical indexes for metric range queries. This technique improves existing algorithm based on pivots and signatures, and introduce a new data structure, the Fixed Queries Trie to speedup metric range queries. The result is an O(n) construction time index, with query complexity O(n α), α≤ 1. The indexing algorithm uses only a few bits of storage for each database element.
Partially supported by CONACyT grant R-36911A and CYTED VII.19 RIBIDI.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baeza-Yates, R.: Searching: an algorithmic tour. In: Kent, A., Williams, J. (eds.) Encyclopedia of Computer Science and Technology, vol. 37, pp. 331–359. Marcel Dekker Inc., New York (1997)
Baeza-Yates, R., Cunto, W., Manber, U., Wu, S.: Proximity matching using fixedqueries trees. In: Crochemore, M., Gusfield, D. (eds.) CPM 1994. LNCS, vol. 807, pp. 198–212. Springer, Heidelberg (1994)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Reading (1999)
Burkhard, W., Keller, R.: Some approaches to best-match file searching. Comm. of the ACM 16(4), 230–236 (1973)
Chávez, E., Marroquín, J.L., Navarro, G.: Fixed queries array: A fast and economical data structure for proximity searching. Multimedia Tools and Applications (MTAP) 14(2), 113–135 (2001)
Chávez, E., Navarro, G., Baeza-Yates, R., Marroquín, J.L.: Proximity searching in metric spaces. ACM Computing Surveys 33(3), 273–321 (2001)
Derthick, M., Harrison, J., Moore, A., Roth, S.: Efficient multiobject dynamic query histograms. In: Proceedings of the IEEE Symposium on Information Visualization, pp. 84–91. IEEE Press, Los Alamitos (1999)
Guttman, A.: R-trees: a dynamic index structure for spatial searching. In: Proc. ACM SIGMOD International Conference on Management of Data, pp. 47–57 (1984)
Micó, L., Oncina, J., Vidal, E.: A new version of the nearest-neighbor approximating and eliminating search (AESA) with linear preprocessing-time and memory requirements. Pattern Recognition Letters 15, 9–17 (1994)
Yianilos, P.: Data structures and algorithms for nearest neighbor search in general metric spaces. In: Proc. 4th ACM-SIAM Symposium on Discrete Algorithms (SODA 1993), pp. 311–321 (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chávez, E., Figueroa, K. (2004). Faster Proximity Searching in Metric Data. In: Monroy, R., Arroyo-Figueroa, G., Sucar, L.E., Sossa, H. (eds) MICAI 2004: Advances in Artificial Intelligence. MICAI 2004. Lecture Notes in Computer Science(), vol 2972. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24694-7_23
Download citation
DOI: https://doi.org/10.1007/978-3-540-24694-7_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21459-5
Online ISBN: 978-3-540-24694-7
eBook Packages: Springer Book Archive