Skip to main content

Bulk Insertion for R-Tree by Seeded Clustering

  • Conference paper
Database and Expert Systems Applications (DEXA 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2736))

Included in the following conference series:

Abstract

In many scientific and commercial applications such as Earth Observation System (EOSDIS) and mobile phone services tracking a large number of clients, it is a daunting task to archive and index ever increasing volume of complex data that are continuously added to databases. To efficiently manage multidimensional data in scientific and data warehousing environments, R-tree based index structures have been widely used. In this paper, we propose a scalable technique called Seeded Clustering that allows us to maintain R-tree indexes by bulk insertion while keeping pace with high data arrival rates. Our approach uses a seed tree, which is copied from the top k levels of a target R-tree, to classify input data objects into clusters. We then build an R-tree for each of the clusters and insert the input R-trees into the target R-tree in bulk one at a time. We present detailed algorithms for the seeded clustering and bulk insertion as well as the results from our extensive experimental study. The experimental results show that the bulk insertion by seeded clustering outperforms the previously known methods in terms of insertion cost and the quality of target R-trees measured by their query performance.

This work was sponsored in part by the BK 21 Project from the Government of Korea. It was also sponsored in part by National Science Foundation CAREER Award (IIS-9876037), Grant No. IIS-0100436, and Research Infrastructure program EIA-0080123. The authors assume all responsibility for the contents of the paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arge, L., Hinrichs, K.H., Vahrenhold, J., Vitter, J.S.: Efficient Bulk Operations on Dynamic R-Trees. Algorithmica 33(1), 104–128 (2002)

    Article  MATH  MathSciNet  Google Scholar 

  2. Beckmann, N., Kriegel, H.-P., Schneider, R., Seeger, B.: The R*-tree: an efficient and robust access method for points and rectangles. In: Proceedings of the 1990 ACM SIGMOD international conference on Management of data, pp. 322–331 (1990)

    Google Scholar 

  3. Chen, L., Choubey, R., Rundensteiner, E.A.: Bulk-insertions into Rtrees using the small-tree-large-tree approach. In: Proceedings of the sixth ACM international symposium on Advances in geographic information systems, pp. 161–162 (1998)

    Google Scholar 

  4. Choubey, R., Chen, L., Rundersteiner, E.A.: GBI: A Generalized R-tree Bulk-Insertion Strategy. In: Advances in Spatial Databases, pp. 91–108 (1997)

    Google Scholar 

  5. Guttman, A.: R-Trees: A dynamic index structure for spatial searching. In: Proceedings of the 1984 ACM-SIGMOD Conference, pp. 47–57 (June 1984)

    Google Scholar 

  6. Kamel, I., Khalil, M., Kouramajian, V.: Bulk insertion in dynamic R-trees. In: Proceedings of the 4th International Symposium on Spatial Data Handling (SDH 1996), pp. 31–42 (1996)

    Google Scholar 

  7. Kamel, I., Faloutsos, C.: On packing R-trees. In: Proceedings of the second international conference on Information and knowledge management, pp. 490–499 (1993)

    Google Scholar 

  8. Leutenegger, S.T., Edgington, J.M., Lopez, M.A.: STR: A Simple and Efficient Algorithm for R-Tree Packing. In: Proceedings of the IEEE Data Engineering, pp. 497–506 (1997)

    Google Scholar 

  9. TIGER/Line Files, Technical Documentation, U.S. Bureau of Census, Washington DC (2000), accessible via http://www.census.gov/geo/www/tiger/tigerua/uatgr2k.html

  10. TPC-H, Transaction Processing Performance Council, accessible via http://www.tpc.org/tpch/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lee, T., Moon, B., Lee, S. (2003). Bulk Insertion for R-Tree by Seeded Clustering. In: Mařík, V., Retschitzegger, W., Štěpánková, O. (eds) Database and Expert Systems Applications. DEXA 2003. Lecture Notes in Computer Science, vol 2736. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45227-0_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45227-0_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40806-2

  • Online ISBN: 978-3-540-45227-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics