Abstract
This paper examines a partial match retrieval scheme which supports range queries for highly dynamic databases. The scheme relies on order preserving multi-attribute hashing. In general, designing optimal indexes is NP-hard. Greedy algorithms used to determine the optimal indexes for simple partial match queries are not directly applicable because there are a larger number of queries to consider in determining the optimal indexes. In this paper we present heuristic algorithms which provide near-optimal solutions. The optimisation scheme we propose can be used to design other dynamic file structures such as the grid file, BANG file and multilevel grid file to further enhance their retrieval performance taking into consideration the query distribution.
Similar content being viewed by others
References
A. V. Aho and J. D. Ullman:Optimal partial-match retrieval when fields are independently specified. ACM Transactions on Database Systems, 4(2): 168–179, June 1979.
J. L. Bentley:Multidimensional binary search trees used for associative searching. Communications of the ACM, 18(9): 509–517, September 1975.
J. L. Bentley:Multidimensional binary search trees in database applications. IEEE Transactions on Software Engineering, SE-5(4): 333–340, July 1979.
W. A. Burkhard:Interpolation-based index maintenance. BIT, 23: 274–294, 1983.
R. Cichelli:Minimal perfect hash functions made simple. Communications of the ACM, 23,(1): 17–19, January 1980.
R. Fagin, J. Nievergelt, and H. R. Strong:Extendible hashing — a fast access method for dynamic files. ACM Transactions on Database Systems, 4(3): 315–344, September 1979.
M. Freeston:The BANG file: a new kind of grid file. In U. Dayal and I. Traiger, editors, Proceedings of the 1987 ACM SIGMOD International Conference on the Management of Data, pages 260–269, San Francisco, California, USA, May 1987.
Y. Hsiao and A. L. Tharp:Adaptive hashing. Information Systems, 13(1): 111–127, 1988.
D. E. Knuth:Sorting and Searching, volume 3 ofThe Art of Computer Programming. Addison-Wesley, Reading, Massachusetts, USA, 1973.
H. Kriegel:Performance comparison of index structures for multi-key retrieval. In B. Yormark, editor, Proceedings of the 1984 ACM SIGMOD International Conference on the Management of Data, pages 186–196, Boston, MA, USA, June 1984.
H. Kriegel and B. Seeger:Multidimensional order preserving linear hashing with partial expansions. In G. Ausiello and P. Atzeni, editors, Proceedings of the International Conference on Database Theory, pages 203–220, Rome, Italy, September 1986. Also published as Lecture Notes in Computer Science 243 by Springer-Verlag.
P. Å. Larson:Linear hashing with partial expansions. In Proceedings of the Sixth International Conference on Very Large Data Bases, pages 224–232, Montreal, Canada, October 1980.
W. Litwin:Linear hashing: a new tool for file and table addressing. In Proceedings of the Sixth International Conference on Very Large Data Bases, pages 212–223, Montreal, Canada, October 1980.
J. W. Lloyd:Optimal partial-match retrieval. BIT, 20: 406–413, 1980.
J. W. Lloyd and K. Ramamohanarao:Partial-match retrieval for dynamic files. BIT, 22: 150–168, 1982.
S. Moran:On the complexity of designing optimal partial-match retrieval systems. ACM Transactions on Database Systems, 8(4): 543–551, December 1983.
S. Nahar, S. Sahni, and E. Shargowitz:Experiments with simulated annealing. In Proceedings of the 22nd Design Automation Conference, pages 748–752, 1985.
J. Nievergelt, H. Hinterberger, and K. C. Sevcik:The grid file: An adaptable symmetric multikey file structure. ACM Transactions on Database Systems, 9(1): 38–71, March 1984.
J. A. Orenstein:A dynamic hash file for random and sequential access. In M. Schkolnick and C. Thanos, editors, Proceedings of the Ninth International Conference on Very Large Data Bases, pages 132–141, Florence, Italy, November 1983.
K. Ramamohanarao and J. W. Lloyd:Dynamic hashing schemes. The Computer Journal, 25: 478–485, 1982.
K. Ramamohanarao and R. Sacks-Davis:Recursive linear hashing. ACM Transactions on Database Systems, 8(9): 369–391, September 1984.
K. Ramamohanarao, J. Shepherd, and R. Sacks-Davis:Multi-attribute hashing with multiple file copies for high performance partial-match retrieval. BIT, 30: 404–423, 1990.
R. L. Rivest:Partial-match retrieval algorithms. SIAM Journal on Computing, 5(1): 19–50, March 1976.
C. S. Roberts:Partial-match retrieval via the method of superimposed codes. Proceedings of the IEEE, 67(12): 1624–1642, December 1979.
J. T. Robinson:The k-d-B-tree: a search structure for large multidimensional dynamic indexes. In Proceedings of the SIGMOD Conference, pages 10–18, Ann Arbor, MI, USA, April 1981.
T. J. Sager:A polynomial time generator for minimal perfect hash functions. Communications of the ACM, 28(5): 523–532, May 1985.
K.-Y. Whang and R. Krishnamurthy:The multilevel grid file — a dynamic hierarchical multidimensional file structure. In International Symposium on Database Systems for Advanced Applications, pages 449–459, Tokyo, Japan, April 1991.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Harris, E.P., Ramamohanarao, K. Optimal dynamic multi-attribute hashing for range queries. BIT 33, 561–579 (1993). https://doi.org/10.1007/BF01990535
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF01990535