Abstract
Given two datasets \(\mathcal{D}_{A}\) and \(\mathcal{D}_{B}\) the closest-pair query (CPQ) retrieves the pair (a,b), where \(a \epsilon \mathcal{D}_{A}\) and \(b \epsilon \mathcal{D}_{B}\), having the smallest distance between all pairs of objects. An extension to this problem is to generate the k closest pairs of objects (k-CPQ). In several cases spatial constraints are applied, and object pairs that are retrieved must also satisfy these constraints. Although the application of spatial constraints seems natural towards a more focused search, only recently they have been studied for the CPQ problem with the restriction that \(\mathcal{D}_{A}\) = \(\mathcal{D}_{B}\). In this work we focus on constrained closest-pair queries (CCPQ), between two distinct datasets \(\mathcal{D}_{A}\) and \(\mathcal{D}_{B}\), where objects from \(\mathcal{D}_{A}\) must be enclosed by a spatial region R. A new algorithm is proposed, which is compared with a modified closest-pair algorithm. The experimental results demonstrate that the proposed approach is superior with respect to CPU and I/O costs.
Research supported by ARCHIMEDES project 2.2.14, “Management of Moving Objects and the WWW”, of the Technological Educational Institute of Thessaloniki (EPEAEK II), and by the 2003-2005 Serbian-Greek joint research and technology program.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Beckmann, N., Kriegel, H.-P., Schneider, R., Seeger, B.: The R*-tree: an Efficient and Robust Access Method for Points and Rectangles. In: Proc. ACM SIGMOD, Atlantic City, NJ, May 1990, pp. 322–331 (1990)
Bohm, C., Krebs, F.: Supporting KDD Applications by the K-Nearest Neighbor Join. In: Proceedings of the 14th International Conference on Database and Expert System Applications (DEXA 2003), Prague, Czech Republic, pp. 504–516 (2003)
Bohm, C., Krebs, F.: The k-Nearest Neighbor Join: Turbo Charging the KDD Process. Knowledge and Information Systems, KAIS (2004)
Breunig, M.M., Kriegel, H.-P., Ng, R.T., Sander, J.: LOF: Identifying Density-Based Local Outliers. In: Proceedings of the ACM International Conference on the Management of Data (SIGMOD 2000), Dallas, TX, pp. 93–104 (2000)
Brinkhoff, T., Kriegel, H.P., Seeger, B.: Efficient Processing of Spatial Joins Using R-trees. In: Proceedings of the ACM International Conference on Management of Data (SIGMOD 1993), Washington, D.C, May 1993, pp. 237–246 (1993)
Corral, A., Manolopoulos, Y., Theodoridis, Y., Vassilakopoulos, M.: Closest Pair Queries in Spatial Databases. In: Proceedings of the ACM International Conference on the Management of Data (SIGMOD 2000), Dallas, TX (2000)
Corral, A., Manolopoulos, Y., Theodoridis, Y., Vassilakopoulos, M.: Algorithms for Processing K-Closest-Pair Queries in Spatial Databases. Data and Knowledge Engineering (DKE) 49(1), 67–104 (2004)
Eppstein, D.: Fast Hierarchical Clustering and Other Applications of Dynamic Closest Pairs. Journal of Experimental Algorithmics 5(1), 1–23 (2000)
Hjaltason, G.R., Samet, H.: Incremental Distance Join Algorithms for Spatial Databases. In: Proceedings of ACM SIGMOD Conference, pp. 237–248 (1998)
Karypis, G., Han, E.-H., Kumar, V.: Chameleon: Hierarchical Clustering Using Dynamic Modeling. Computer 32(8), 68–75 (1999)
Mishra, P., Eich, M.H.: Join Processing in Relational Databases. ACM Computing Surveys 24(1) (1992)
Nanopoulos, A., Theodoridis, Y., Manolopoulos, Y.: C2P: Clustering Based on Closest Pairs. In: Proceedings of the 27th International Conference on Very Large Databases (VLDB 2001), Roma, Italy (2001)
Shan, J., Zhang, D., Salzberg, B.: On Spatial-Range Closest-Pair Query. In: Proceedings of the 8th International Symposium on Spatial and Temporal Databases (SSTD 2003), Santorini, Greece, pp. 252–269 (2003)
Shim, K., Srikant, R., Agrawal, R.: High-Dimensional Similarity Joins. IEEE Transactions on Knowledge and Data Engineering (TKDE) 14(1), 156–171 (2002)
Shin, H., Moon, B., Lee, S.: Adaptive Multi-Stage Distance Join Processing. In: Proceedings of the ACM SIGMOD Conference, pp. 343–354 (2000)
Shou, Y., Mamoulis, N., Cao, H., Papadias, D., Cheung, D.W.: Evaluation of Iceberg Distance Joins. In: Proceedings of the 8th International Symposium on Spatial and Temporal Databases (SSTD 2003), Santorini, Greece, pp. 270–278 (2003)
Tao, Y., Papadias, D.: Time-Parameterized Queries in Spatio-Temporal Databases. In: Proceedings of the ACM International Conference on the Management of Data (SIGMOD 2002), pp. 334–345 (2002)
TIGER/Line Files, 1994 Technical Documentation / prepared by the Bureau of the Census, Washington, DC (1994)
Xia, C., Lu, H., Ooi, B.C., Hu, J.: GORDER: An Efficient Method for KNN Processing. In: Proceedings of the 30th International Conference on Very Large Data Bases (VLDB 2004), Toronto, Canada, pp. 756–767 (2004)
Zhang, J., Mamoulis, N., Papadias, D., Tao, Y.: All-Nearest-Neighbors Queries in Spatial Databases. In: Proceedings of the 16th International Conference on Scientific and Statistical Databases (SSDBM 2004), Santorini, Greece, pp. 297–306 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Papadopoulos, A.N., Nanopoulos, A., Manolopoulos, Y. (2005). Closest Pair Queries with Spatial Constraints. In: Bozanis, P., Houstis, E.N. (eds) Advances in Informatics. PCI 2005. Lecture Notes in Computer Science, vol 3746. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573036_1
Download citation
DOI: https://doi.org/10.1007/11573036_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29673-7
Online ISBN: 978-3-540-32091-3
eBook Packages: Computer ScienceComputer Science (R0)