Abstract
Queries on partitioned signature files, namely Quick Filter (QF), can lead to retrieve from disk a large number of blocks, depending on the specific query pattern. In order to reduce the overall retrieval time, we consider multi-block read schedules that, provided contiguous allocation of blocks of the file on disk surface is guaranteed by the storage system, transfer more than one block at a time. We show that, for any signature query and buffer size, there always exists an optimal schedule whose reads all have the same size, and that such a constant size (CS) schedule can be determined in a time logarithmic in the number of blocks to be retrieved. We then provide analytical results for the expected performance of QF using CS schedules and compare QF with other, sequential-based, signature file organizations. Finally, we suggest how our approach can also be of interest for other file organizations based on multi-attribute hashing.
This work has been partially supported by Italian CNR, under contract no. 94.00388.CT12, and by ESPRIT LTR project no. 9141, HERMES (Foundations of High Performance Multimedia Information Management Systems).
Preview
Unable to display preview. Download preview PDF.
References
P. Ciaccia. Optimal multi-block read schedules for partitioned signature files. Technical Report UBLCS-95-13, University of Bologna, Dept. of Computer Science, August 1995.
P. Ciaccia and D. Maio. Access cost estimation for physical database design. Data and Knowledge Engineering, 11(2):125–150, 1993.
P. Ciaccia and P. Zezula. Estimating accesses in partitioned signature file organizations. ACM Transactions on Information Systems, 11(2):133–142, April 1993.
C. Faloutsos. Signature-based text retrieval methods: A survey. Data Engineering, 13(1):25–32, March 1990.
C. Faloutsos and S. Christodoulakis. Description and performance analysis of signature file methods for office filing. ACM Transactions on Office Information Systems, 5(3):237–257, July 1987.
H.V. Jagadish. Linear clustering of objects with multiple attributes. In Proceedings of the 1990 ACM SIGMOD International Conference on Management of Data, pages 332–342, Atlantic City, NJ, May 1990.
Z. Lin and C. Faloutsos. Frame-sliced signature files. IEEE Transactions on Knowledge and Data Engineering, 4(3):281–289, June 1992.
W. Litwin. Linear hashing: a new tool for files and table addressing. In Proceedings of the 6th VLDB International Conference, pages 212–223, Montreal, Canada, August 1980.
D.L. Lee and C.-W. Leng. Partitioned signature files: Design issues and performance evaluation. ACM Transactions on Office Information Systems, 7(2):158–180, April 1989.
P. G. Selinger, M. M. Astrahan, D. D. Chamberlin, R. A. Lorie, and T. G. Price. Access path selection in a relational database system. In Proceedings of the 1979 ACM SIGMOD International Conference on Management of Data, pages 23–34, May 1979.
B. Seeger, P.-A. Larson, and R. McFayden. Reading a set of disk pages. In Proceedings of the 19th VLDB International Conference, pages 592–603, Dublin, Ireland, August 1993.
P. Zezula, F. Rabitti, and P. Tiberio. Dynamic partitioning of signature files. ACM Transactions on Information Systems, 9(4):336–369, October 1991.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ciaccia, P. (1996). Optimal multi-block read schedules for partitioned signature files. In: Apers, P., Bouzeghoub, M., Gardarin, G. (eds) Advances in Database Technology — EDBT '96. EDBT 1996. Lecture Notes in Computer Science, vol 1057. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0014156
Download citation
DOI: https://doi.org/10.1007/BFb0014156
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61057-1
Online ISBN: 978-3-540-49943-5
eBook Packages: Springer Book Archive