Abstract
The availability of large main memories and the new technologies for disk drives have modified the models for external sorting and have renewed interest in their study. We investigate the replacement-selection paradigm for external sorting. We focus on the performance of the merge phase when given random files and specially when given nearly sorted files since such files are common in practice. In particular, we demonstrate that, during the merge phase, the floating-buffers technique not only reduces the sorting time by fully overlapping I/O and saving seeks, but also it profits significantly from existing order in the input. We also propose a new algorithm for computing a feasible reading sequence for floating buffers that improves upon previous algorithms.
This work was partially carried out under grants from the Natural Sciences and Engineering Research Council of Canada, the Information Technology Research Centre of Ontario and CONACYT 1707-A9209 in México.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
V. Estivill-Castro and D. Wood. A survey of adaptive sorting algorithms. Computing Surveys, 24: 441–476, 1992.
V. Estivill-Castro and D. Wood. Faster External sorting and nearly sortedness. Research Report LANIA-RT-94-9, LANIA, Xalapa, Mexico, 1994.
E. Horowitz and S. Sahni. Fundamentals of Data Structures. Computer Science Press, Inc., Woodland Hill, CA, 1976.
D.E. Knuth. The Art of Computer Programming, Vol.3: Sorting and Searching. Addison-Wesley Publishing Co., Reading, MA, 1973.
C. Ruemmler and J. Wilkes. An Introduction to Disk Drive Modeling. Computer, 27(2): 17–28, March 1994.
B. Salzberg. Merging sorted runs using large main memory. Acta Informatica, 27: 195–215, 1989.
L.Q. Zheng and P. Larson. Speeding up external mergesort. Comp. Sci. Report CS-92-40, U. of Waterloo. To appear in IEEE T. on Knowledge and Data Engineering.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1994 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Estivill-Castro, V., Wood, D. (1994). Foundations for faster external sorting. In: Thiagarajan, P.S. (eds) Foundation of Software Technology and Theoretical Computer Science. FSTTCS 1994. Lecture Notes in Computer Science, vol 880. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58715-2_142
Download citation
DOI: https://doi.org/10.1007/3-540-58715-2_142
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-58715-6
Online ISBN: 978-3-540-49054-8
eBook Packages: Springer Book Archive