Skip to main content
Log in

External segment trees

  • Published:
Algorithmica Aims and scope Submit manuscript

Abstract

The segment tree is a well-known internal data structure with numerous applications in computational geometry. It allows the dynamical maintenance of a set of intervals such that the intervals enclosing a query point can be found efficiently (point enclosure search).

In this paper we transfer the underlying principle of the segment tree in a nontrivial way to secondary storage and arrive at the EST-an external file structure with the same functionality and the following properties: (1) Point enclosure searches are very efficient—only very few pages are accessed that are not filled to more than 50% with result intervals. (2) A page filling of 50% is guaranteed—on the average it will be around 70%. Although the segment tree represents, in the worst case, each interval by a logarithmic number offragments, in practical cases fragmentation remains low and the storage requirements about linear. (3) The EST is balanced and the update algorithms are efficient. (4) Unlike many other file structures for spatial objects the EST has no problems with an arbitrarydensity, that is, an arbitrarily large number of intervals covering any point of the line.

Furthermore, the EST can be used as a file structureconstructor in the following sense: Let there be a file structureX supporting searches for objects with propertyx and suppose it is necessary to maintain a collection of objects with associated (e.g., time) intervals. Then an EST-X structure that supports searches for objects with propertyx present at timet can be built. This suggests using the EST as a building block in the implementation of temporal database systems. Other applications include the one-dimensional indexing of collections of spatial objects in two or more dimensions.

More generally, this paper shows techniques for mapping internal tree structures with node lists (other examples: range tree, interval tree) to secondary memory. In this context an intriguing theoretical problem, thecover-balancing problem, is solved: Given a tree whose nodes have associatedweights partitioned into subtrees whose weights must lie in a certain range, maintain this partition under weight changes at arbitrary nodes. This is in contrast to classical balancing problems where updates occur only at the leaves.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Ahn, I., Towards an Implementation of Database Management Systems with Temporal Support.Proceedings of the International Conference on Data Engineering, Los Angeles, CA, 1986, pp. 374–381.

  2. Allen, J. F., Maintaining Knowledge about Temporal Intervals.Comm. ACM 26 (1983), 832–843.

    Google Scholar 

  3. Bentley, J. L., Multidimensional Binary Search Trees Used for Associative Searching.Comm. ACM 18 (1975), 509–517.

    Google Scholar 

  4. Bentley, J. L., Solutions to Klee's Rectangle Problems. Department of Computer Science, Carnegie Mellon University, Unpublished manuscript, 1977.

  5. Bentley, J. L., Multidimensional Bineary Search Trees in Database Application.IEEE Trans. Software Engrg. 5 (1979), 333–340.

    Google Scholar 

  6. Bentley, J. L., Decomposable Searching Problems.Inform. Process. Lett. 8 (1979), 244–351.

    Google Scholar 

  7. Blankenagel, G., Intervall-Indexstukturen und externe Algorithmen für Nicht-Standard-Datenbanksysteme. Dissertation, FernUniversität Hagen, April 1991. Also available as:Intervall-Indexstrukturen in Datenbanksystemen, Informatik-Fachberichte, vol. 312, Springer-Verlag, Berlin, 1992.

  8. Blankenagel, G., and R. H. Güting, XP-Trees: External Priority Search Trees. Informatik-Report 92, FernUniversität Hagen, 1990.

  9. Burkhard, W., Interpolation-Based Index Maintenance.BIT 23 (1983), 274–294.

    Google Scholar 

  10. Edelsbrunner, H., Dynamic Rectangle Intersection Searching. Report F47, Institut für Informationsverarbeitung, Technische Universität Graz, 1980.

  11. Edelsbrunner, H., and H. A. Maurer, On the Intersection of Orthogonal Objects.Inform. Process. Lett. 13 (1981), 177–181.

    Google Scholar 

  12. Faloutsos, C, T. Sellis, and N. Roussopoulos, Analysis of Object Oriented Spatial Access Methods.Proceedings of the ACM SIGMOD Conference, 1987, pp. 426–439.

  13. Finkel, R. A., and J. L. Bentley, Quad Trees: A Data Structure for Retrieval on Composite Keys.Acta Inform. 4 (1974), 1–9.

    Google Scholar 

  14. Frank, A., Applications of DBMS to Land Information Systems.Proceedings of the 7th VLDB Conference, Cannes, 1981, pp. 448–453.

  15. Frank, A. U., and R. Barrera, The Fieldtree: A Data Structure for Geographic Information Systems.Proceedings of the Symposium on the Design and Implementation of Large Spatial Databases, Santa Barbara, CA, 1989, pp. 29–44.

  16. Freeston, M. W., A Well-Behaved File Structure for the Storage of Spatial Objects.Proceedings of the Symposium on the Design and Implementation of Large Spatial Databases, Santa Barbara, CA, 1989, pp. 287–300.

  17. Freeston, M. W., Advances in the Design of the BANG-File. In: W. Litwin, H.-J. Schek (eds.),Proceedings of the 3rd International Conference on Foundations of Data Organization and Algorithms, 1989, Lecture Notes in Computer Science, vol. 367. Springer-Verlag, Berlin, pp. 322–338.

    Google Scholar 

  18. Günther, O., and J. Bilmes, The Implementation of the Cell Tree: Design Alternatives and Performance Evaluation.Proceedings of BTW '89 (Datenbanksysteme in Büro, Technik und Wissenschaft), Zürich, March 1989, Informatik-Fachberichte, vol. 204, Springer-Verlag, Berlin, 1989, pp. 246–265.

    Google Scholar 

  19. Günther, O., and E. Wong, The Arc Tree: An Approximation Scheme to Represent Arbitrary Curved Shapes. In: W. Litwin, H.-J. Schek (eds.),Proceedings of the 3rd International Conference on Foundations of Data Organization and Algorithms, 1989, Lecture Notes in Computer Science, vol. 367, Springer-Verlag, Berlin.

    Google Scholar 

  20. Güting, R. H., Dynamic c-Oriented Polygonal Intersection Searching.Inform, and Control 63 (1984), 143–163.

    Google Scholar 

  21. Güting, R. H., Geo-Relational Algebra: A Model and Query Language for Geometric Database Systems. In: J. W. Schmidt, S. Ceri, and M. Missikoff (eds.),Proceedings of the International Conference on Extending Database Technology, Venice, March 1988, pp. 506–527.

  22. Güting, R. H., Gral: An Extensible Relational Database System for Geometric Applications.Proceedings of the 15th International Conference on Very Large Data Bases, 1989, pp. 33–44.

  23. Güting, R. H., and H.-P. Kriegel,Multidimensional B-Tree: An Efficient Dynamic File Structure for Exact Match Queries. Informatik-Fachberichte, vol. 33, Springer-Verlag, Berlin, pp. 375–388. 1980.

    Google Scholar 

  24. Güting, R. H., and H.-P. Kriegel,Dynamic k-Dimensional Multiway Search Under Time-Varying Access Frequencies. Lecture Notes in Computer Science, vol. 104, Springer-Verlag, Berlin, 1981, pp. 135–145.

    Google Scholar 

  25. Guttman, A., R-Trees: A Dynamic Index Structure for Spatial Searching.Proceedings of the ACM SIGMOD Conference, Boston, 1984, pp. 47–57.

  26. Held, G. D., M. R. Stonebraker, and E. Wong, INGRES—A Relational Data Base System.Proceedings of the 1975 National Computer Conference, AFIPS Press, Reston, VA, 1975, pp. 409–416.

    Google Scholar 

  27. Henrich, A., H.-W. Six, and P. Widmayer, The LSD Tree: Spatial Access to Multi-dimensional Point and Non Point Objects.Proceedings of the 15th International Conference on Very Large Data Bases, Amsterdam, 1989, pp. 45–53.

  28. Hinrichs, K. H., The Grid File System: Implementation and Case Studies of Applications. Dissertation No. 7743, ETH, Zurich, 1985.

    Google Scholar 

  29. Hinrichs, K. H., and J. Nievergelt, The Grid File: A Data Structure Designed To Support Proximity Queries on Spatial Objects. In: M. Nagl and J. Perl (eds.),Proceedings of the WG, Trauner, Linz, 1983, pp. 100–113.

    Google Scholar 

  30. Hutflesz, A., H.-W. Six, and P. Widmayer, Globally Order Preserving Multidimensional Linear Hashing.Proceedings of the 4th International Conference on Data Engineering, Los Angeles, CA, 1988, pp. 572–579.

  31. Hutflesz, A., H.-W. Six, and P. Widmayer, The Twin Grid File: A Nearly Space Optimal Index Structure. In: J. W. Schmidt, S. Ceri, and M. Missikoff (eds.),Proceedings of the International Conference on Extending Database Technology, Venice, March 1988, pp. 352–363.

  32. Hutflesz, A, H.-W. Six, and P. Widmayer, The Twin Grid File: Space Optimizing Access Schemes.Proceedings of the ACM SIGMOD Conference, 1988, pp. 183–190.

  33. Kowalski, R., and M. Sergot, A Logic-Based Calculus of Events.New Generation Comput.4 (1986), 67–95.

    Google Scholar 

  34. Kriegel, H.-P., Variants of Multidimensional B-Trees as Dynamic Index Structure for Associative Retrieval in Database Systems.Proceedings of the 8th Conference on Graphtheoretic Concepts in Computer Science, Hanser, München, 1982, pp. 110–128.

    Google Scholar 

  35. Kriegel, H.-P., and B. Seeger, Techniques for Design and Implementation of Efficient Spatial Access Methods.Proceedings of the 14th International Conference on Very Large Data Bases, Los Angeles, CA, 1988, pp. 360–371.

  36. Kriegel, H.-P., and B. Seeger, PLOP-Hashing: A Grid File Without Directory.Proceedings of the 4th International Conference on Data Engineering, Los Angeles, CA, 1988, pp. 369–376.

  37. Kriegel, H.-P., Schiwietz, R. Schneider, and B. Seeger, Performance Comparison of Point and Spatial Access Methods.Proceedings of the Symposium on the Design and Implementation of Large Spatial Databases, Santa Barbara, CA, 1989, pp. 89–114.

  38. Matsuyama, T., L. V. Hao, and M. Nagao, A File Organization for Geographic Information Systems Based on Spatial Proximity.Comput. Vision Graphics Image Process. 26 (1984), 303–318.

    Google Scholar 

  39. McCreight, E. M., Priority Search Trees. Report CSL-81-5, XEROX Research Center, Palo Atto, CA, 1982.

    Google Scholar 

  40. McCreight, E. M., Priority Search Trees.SIAM J. Comput. 14 (1985), 257–276.

    Google Scholar 

  41. McKenzie, E., Bibliography: Temporal Databases.SIGMOD Record 15 (1986), 40–52.

    Google Scholar 

  42. Nievergelt, J., H. Hinterberger, and K. C. Sevcik, The Grid File: An Adaptable, Symmetric Multikey File Structure.ACM Trans Database Systems 9 (1984), 38–71.

    Google Scholar 

  43. Ooi, B. C.,Spatial kd-Tree: A Data Structure for Geographic Database. Informatik-Fachberichte, vol. 136, Springer-Verlag, Berlin 1987, pp. 247–258.

    Google Scholar 

  44. Otoo, E. J., Balanced Multidimensional Extendible Hash Tree.Proceedings of the 5th ACM SIGACT-SIGMOD Symposium on Principles of Database Systems, Cambridge, MA, 1986, pp. 100–113.

  45. Ouksel, M., and P. Scheuermann, Multidimensional B-Trees: Analysis of Dynamic Behavior,BIT 21 (1981), 401–418.

    Google Scholar 

  46. Overmars, M. H., and M. H. M. Smid, Maintaining Range Trees in Secondary Memory.Proceedings of the 5th STACS, Bordeaux, 1988, pp. 38–51.

  47. Rosenberg, J. B., Geographical Data Structures Compared: A Study of Data Structures Supporting Region Queries.IEEE Trans. Comput. Aided Design 4 (1985), 53–67.

    Google Scholar 

  48. Samet, H., Hierarchical Representations of Collections of Small Rectangles.ACM Comput. Surv. 20 (1988), 271–309.

    Google Scholar 

  49. Sellis, T., Roussopoulos, N., and C. Faloutsos, The R+-Tree: A Dynamic Index for Multi-Dimensional Objects.Proceedings of the 13th International Conference on Very Large Data Bases, Brighton, 1987, pp. 507–518.

  50. Six, H.-W., and P. Widmayer, Hintergrundspeicherstrukturen für ausgedehnte Objekte.Proc. GI-Jahrestagung, Informatik-Fachberichte, vol. 126, Springer-Verlag, Berlin, 1986, pp. 538–552.

    Google Scholar 

  51. Six, H.-W., and P. Widmayer, Spatial Searching in Geometric Databases.Proceedings of the 4th International Conference on Data Engineering, Los Angeles, CA, 1988, pp. 496–503.

  52. Smid, M., Dynamic Data Structures on Multiple Storage Media. Dissertation, Universiteit van Amsterdam, 1989.

  53. Snodgrass, R., The Temporal Query Language TQuel.ACM Trans. Database Systems 12 (1987), 247–298.

    Google Scholar 

  54. Snodgrass, R., and I. Ahn, A Taxomy of Time in Databases.Proceedings of ACM-SIGMOD, Austin, 1985, pp. 236–246.

  55. Tamminen, M., Some Aspects of Defining Spatially Referenced Data.Proceedings of FIG XVI Congress, Montreux, 1981.

  56. Tamminen, M., Efficient Spatial Access to a Database.Proceedings of the A CM SIGMOD Conference, Orlando, 1982, pp. 200–206.

  57. Tamminen, M., and R. Sulonen, The EXCELL Method for Efficient Geometric Access to Data.Proceedings of the 19th ACM IEEE Design Automation Conference, Las Vegas, CA, 1982, pp. 345–351.

Download references

Author information

Authors and Affiliations

Authors

Additional information

Communicated by Franco P. Preparata.

This work was supported by the DFG (Deutsche Forschungsgemeinschaft) under Grant Cr 65/2-5.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Blankenagel, G., Güting, R.H. External segment trees. Algorithmica 12, 498–532 (1994). https://doi.org/10.1007/BF01188717

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01188717

Key words

Navigation