Abstract
The segment tree is a well-known internal data structure with numerous applications in computational geometry. It allows the dynamical maintenance of a set of intervals such that the intervals enclosing a query point can be found efficiently (point enclosure search).
In this paper we transfer the underlying principle of the segment tree in a nontrivial way to secondary storage and arrive at the EST-an external file structure with the same functionality and the following properties: (1) Point enclosure searches are very efficient—only very few pages are accessed that are not filled to more than 50% with result intervals. (2) A page filling of 50% is guaranteed—on the average it will be around 70%. Although the segment tree represents, in the worst case, each interval by a logarithmic number offragments, in practical cases fragmentation remains low and the storage requirements about linear. (3) The EST is balanced and the update algorithms are efficient. (4) Unlike many other file structures for spatial objects the EST has no problems with an arbitrarydensity, that is, an arbitrarily large number of intervals covering any point of the line.
Furthermore, the EST can be used as a file structureconstructor in the following sense: Let there be a file structureX supporting searches for objects with propertyx and suppose it is necessary to maintain a collection of objects with associated (e.g., time) intervals. Then an EST-X structure that supports searches for objects with propertyx present at timet can be built. This suggests using the EST as a building block in the implementation of temporal database systems. Other applications include the one-dimensional indexing of collections of spatial objects in two or more dimensions.
More generally, this paper shows techniques for mapping internal tree structures with node lists (other examples: range tree, interval tree) to secondary memory. In this context an intriguing theoretical problem, thecover-balancing problem, is solved: Given a tree whose nodes have associatedweights partitioned into subtrees whose weights must lie in a certain range, maintain this partition under weight changes at arbitrary nodes. This is in contrast to classical balancing problems where updates occur only at the leaves.
Similar content being viewed by others
References
Ahn, I., Towards an Implementation of Database Management Systems with Temporal Support.Proceedings of the International Conference on Data Engineering, Los Angeles, CA, 1986, pp. 374–381.
Allen, J. F., Maintaining Knowledge about Temporal Intervals.Comm. ACM 26 (1983), 832–843.
Bentley, J. L., Multidimensional Binary Search Trees Used for Associative Searching.Comm. ACM 18 (1975), 509–517.
Bentley, J. L., Solutions to Klee's Rectangle Problems. Department of Computer Science, Carnegie Mellon University, Unpublished manuscript, 1977.
Bentley, J. L., Multidimensional Bineary Search Trees in Database Application.IEEE Trans. Software Engrg. 5 (1979), 333–340.
Bentley, J. L., Decomposable Searching Problems.Inform. Process. Lett. 8 (1979), 244–351.
Blankenagel, G., Intervall-Indexstukturen und externe Algorithmen für Nicht-Standard-Datenbanksysteme. Dissertation, FernUniversität Hagen, April 1991. Also available as:Intervall-Indexstrukturen in Datenbanksystemen, Informatik-Fachberichte, vol. 312, Springer-Verlag, Berlin, 1992.
Blankenagel, G., and R. H. Güting, XP-Trees: External Priority Search Trees. Informatik-Report 92, FernUniversität Hagen, 1990.
Burkhard, W., Interpolation-Based Index Maintenance.BIT 23 (1983), 274–294.
Edelsbrunner, H., Dynamic Rectangle Intersection Searching. Report F47, Institut für Informationsverarbeitung, Technische Universität Graz, 1980.
Edelsbrunner, H., and H. A. Maurer, On the Intersection of Orthogonal Objects.Inform. Process. Lett. 13 (1981), 177–181.
Faloutsos, C, T. Sellis, and N. Roussopoulos, Analysis of Object Oriented Spatial Access Methods.Proceedings of the ACM SIGMOD Conference, 1987, pp. 426–439.
Finkel, R. A., and J. L. Bentley, Quad Trees: A Data Structure for Retrieval on Composite Keys.Acta Inform. 4 (1974), 1–9.
Frank, A., Applications of DBMS to Land Information Systems.Proceedings of the 7th VLDB Conference, Cannes, 1981, pp. 448–453.
Frank, A. U., and R. Barrera, The Fieldtree: A Data Structure for Geographic Information Systems.Proceedings of the Symposium on the Design and Implementation of Large Spatial Databases, Santa Barbara, CA, 1989, pp. 29–44.
Freeston, M. W., A Well-Behaved File Structure for the Storage of Spatial Objects.Proceedings of the Symposium on the Design and Implementation of Large Spatial Databases, Santa Barbara, CA, 1989, pp. 287–300.
Freeston, M. W., Advances in the Design of the BANG-File. In: W. Litwin, H.-J. Schek (eds.),Proceedings of the 3rd International Conference on Foundations of Data Organization and Algorithms, 1989, Lecture Notes in Computer Science, vol. 367. Springer-Verlag, Berlin, pp. 322–338.
Günther, O., and J. Bilmes, The Implementation of the Cell Tree: Design Alternatives and Performance Evaluation.Proceedings of BTW '89 (Datenbanksysteme in Büro, Technik und Wissenschaft), Zürich, March 1989, Informatik-Fachberichte, vol. 204, Springer-Verlag, Berlin, 1989, pp. 246–265.
Günther, O., and E. Wong, The Arc Tree: An Approximation Scheme to Represent Arbitrary Curved Shapes. In: W. Litwin, H.-J. Schek (eds.),Proceedings of the 3rd International Conference on Foundations of Data Organization and Algorithms, 1989, Lecture Notes in Computer Science, vol. 367, Springer-Verlag, Berlin.
Güting, R. H., Dynamic c-Oriented Polygonal Intersection Searching.Inform, and Control 63 (1984), 143–163.
Güting, R. H., Geo-Relational Algebra: A Model and Query Language for Geometric Database Systems. In: J. W. Schmidt, S. Ceri, and M. Missikoff (eds.),Proceedings of the International Conference on Extending Database Technology, Venice, March 1988, pp. 506–527.
Güting, R. H., Gral: An Extensible Relational Database System for Geometric Applications.Proceedings of the 15th International Conference on Very Large Data Bases, 1989, pp. 33–44.
Güting, R. H., and H.-P. Kriegel,Multidimensional B-Tree: An Efficient Dynamic File Structure for Exact Match Queries. Informatik-Fachberichte, vol. 33, Springer-Verlag, Berlin, pp. 375–388. 1980.
Güting, R. H., and H.-P. Kriegel,Dynamic k-Dimensional Multiway Search Under Time-Varying Access Frequencies. Lecture Notes in Computer Science, vol. 104, Springer-Verlag, Berlin, 1981, pp. 135–145.
Guttman, A., R-Trees: A Dynamic Index Structure for Spatial Searching.Proceedings of the ACM SIGMOD Conference, Boston, 1984, pp. 47–57.
Held, G. D., M. R. Stonebraker, and E. Wong, INGRES—A Relational Data Base System.Proceedings of the 1975 National Computer Conference, AFIPS Press, Reston, VA, 1975, pp. 409–416.
Henrich, A., H.-W. Six, and P. Widmayer, The LSD Tree: Spatial Access to Multi-dimensional Point and Non Point Objects.Proceedings of the 15th International Conference on Very Large Data Bases, Amsterdam, 1989, pp. 45–53.
Hinrichs, K. H., The Grid File System: Implementation and Case Studies of Applications. Dissertation No. 7743, ETH, Zurich, 1985.
Hinrichs, K. H., and J. Nievergelt, The Grid File: A Data Structure Designed To Support Proximity Queries on Spatial Objects. In: M. Nagl and J. Perl (eds.),Proceedings of the WG, Trauner, Linz, 1983, pp. 100–113.
Hutflesz, A., H.-W. Six, and P. Widmayer, Globally Order Preserving Multidimensional Linear Hashing.Proceedings of the 4th International Conference on Data Engineering, Los Angeles, CA, 1988, pp. 572–579.
Hutflesz, A., H.-W. Six, and P. Widmayer, The Twin Grid File: A Nearly Space Optimal Index Structure. In: J. W. Schmidt, S. Ceri, and M. Missikoff (eds.),Proceedings of the International Conference on Extending Database Technology, Venice, March 1988, pp. 352–363.
Hutflesz, A, H.-W. Six, and P. Widmayer, The Twin Grid File: Space Optimizing Access Schemes.Proceedings of the ACM SIGMOD Conference, 1988, pp. 183–190.
Kowalski, R., and M. Sergot, A Logic-Based Calculus of Events.New Generation Comput.4 (1986), 67–95.
Kriegel, H.-P., Variants of Multidimensional B-Trees as Dynamic Index Structure for Associative Retrieval in Database Systems.Proceedings of the 8th Conference on Graphtheoretic Concepts in Computer Science, Hanser, München, 1982, pp. 110–128.
Kriegel, H.-P., and B. Seeger, Techniques for Design and Implementation of Efficient Spatial Access Methods.Proceedings of the 14th International Conference on Very Large Data Bases, Los Angeles, CA, 1988, pp. 360–371.
Kriegel, H.-P., and B. Seeger, PLOP-Hashing: A Grid File Without Directory.Proceedings of the 4th International Conference on Data Engineering, Los Angeles, CA, 1988, pp. 369–376.
Kriegel, H.-P., Schiwietz, R. Schneider, and B. Seeger, Performance Comparison of Point and Spatial Access Methods.Proceedings of the Symposium on the Design and Implementation of Large Spatial Databases, Santa Barbara, CA, 1989, pp. 89–114.
Matsuyama, T., L. V. Hao, and M. Nagao, A File Organization for Geographic Information Systems Based on Spatial Proximity.Comput. Vision Graphics Image Process. 26 (1984), 303–318.
McCreight, E. M., Priority Search Trees. Report CSL-81-5, XEROX Research Center, Palo Atto, CA, 1982.
McCreight, E. M., Priority Search Trees.SIAM J. Comput. 14 (1985), 257–276.
McKenzie, E., Bibliography: Temporal Databases.SIGMOD Record 15 (1986), 40–52.
Nievergelt, J., H. Hinterberger, and K. C. Sevcik, The Grid File: An Adaptable, Symmetric Multikey File Structure.ACM Trans Database Systems 9 (1984), 38–71.
Ooi, B. C.,Spatial kd-Tree: A Data Structure for Geographic Database. Informatik-Fachberichte, vol. 136, Springer-Verlag, Berlin 1987, pp. 247–258.
Otoo, E. J., Balanced Multidimensional Extendible Hash Tree.Proceedings of the 5th ACM SIGACT-SIGMOD Symposium on Principles of Database Systems, Cambridge, MA, 1986, pp. 100–113.
Ouksel, M., and P. Scheuermann, Multidimensional B-Trees: Analysis of Dynamic Behavior,BIT 21 (1981), 401–418.
Overmars, M. H., and M. H. M. Smid, Maintaining Range Trees in Secondary Memory.Proceedings of the 5th STACS, Bordeaux, 1988, pp. 38–51.
Rosenberg, J. B., Geographical Data Structures Compared: A Study of Data Structures Supporting Region Queries.IEEE Trans. Comput. Aided Design 4 (1985), 53–67.
Samet, H., Hierarchical Representations of Collections of Small Rectangles.ACM Comput. Surv. 20 (1988), 271–309.
Sellis, T., Roussopoulos, N., and C. Faloutsos, The R+-Tree: A Dynamic Index for Multi-Dimensional Objects.Proceedings of the 13th International Conference on Very Large Data Bases, Brighton, 1987, pp. 507–518.
Six, H.-W., and P. Widmayer, Hintergrundspeicherstrukturen für ausgedehnte Objekte.Proc. GI-Jahrestagung, Informatik-Fachberichte, vol. 126, Springer-Verlag, Berlin, 1986, pp. 538–552.
Six, H.-W., and P. Widmayer, Spatial Searching in Geometric Databases.Proceedings of the 4th International Conference on Data Engineering, Los Angeles, CA, 1988, pp. 496–503.
Smid, M., Dynamic Data Structures on Multiple Storage Media. Dissertation, Universiteit van Amsterdam, 1989.
Snodgrass, R., The Temporal Query Language TQuel.ACM Trans. Database Systems 12 (1987), 247–298.
Snodgrass, R., and I. Ahn, A Taxomy of Time in Databases.Proceedings of ACM-SIGMOD, Austin, 1985, pp. 236–246.
Tamminen, M., Some Aspects of Defining Spatially Referenced Data.Proceedings of FIG XVI Congress, Montreux, 1981.
Tamminen, M., Efficient Spatial Access to a Database.Proceedings of the A CM SIGMOD Conference, Orlando, 1982, pp. 200–206.
Tamminen, M., and R. Sulonen, The EXCELL Method for Efficient Geometric Access to Data.Proceedings of the 19th ACM IEEE Design Automation Conference, Las Vegas, CA, 1982, pp. 345–351.
Author information
Authors and Affiliations
Additional information
Communicated by Franco P. Preparata.
This work was supported by the DFG (Deutsche Forschungsgemeinschaft) under Grant Cr 65/2-5.
Rights and permissions
About this article
Cite this article
Blankenagel, G., Güting, R.H. External segment trees. Algorithmica 12, 498–532 (1994). https://doi.org/10.1007/BF01188717
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF01188717