Abstract
Our work is motivated by the problem of managing data on storage devices, typically a set of disks. Such storage servers are used as web servers or multimedia servers, for handling high demand for data. As the system is running, to exhibit good performance, it needs to respond dynamically to changes in demand for different data items. There are known algorithms for mapping demand to a layout. When the demand changes, a new layout can be computed. In this work we study thedata migration problem, which arises when we need to change one layout to another quickly. This problem has been studied earlier where for each disk a new layout has been prescribed. However, to apply these algorithms effectively, we identify another problem that we refer to as the correspondence problem, whose solution has a significant impact on the overall solution for the data migration problem. We study algorithms for the data migration problem in more detail and identify variations of the basic algorithm that seem to improve performance in practice, even though some of the variations have poor worst-case behavior.
Similar content being viewed by others
References
E. Anderson, J. Hall, J. Hartline, M. Hobbes, A. Karlin, J. Saia, R. Swaminathan, and J. Wilkes. An experimental study of data migration algorithms.Workshop on Algorithm Engineering, pp. 145–158. LNCS 2141. Springer, New York, 2001.
C. Berge and J. C. Fournier. A short proof for a generalization of Vizing’s theorem.Journal of Graph Theory, 15(3):333–336 (1991).
J. A. Bondy and U. S. R. Murty.Graph Theory with Applications. Elsevier, New York, 1977.
A. L. Chervenak. Tertiary Storage: An Evaluation of New Applications. Ph.D. Thesis, UC Berkeley, 1994.
C.-F. Chou, L. Golubchik, J. C. S. Lui, and I.-H. Chung. Design of scalable continuous media servers.Multimedia Tools and Applications, 17(2–3):181–212, 2002.
S. Ghandeharizadeh and R. R. Muntz. Design and implementation of scalable continuous media servers.Parallel Computing Journal, 24(1):91–122, 1998.
L. Golubchik, S. Khanna, S. Khuller, R. Thurimella, and A. Zhu. Approximation algorithms for data placement on parallel disks.Proc. of ACM-SIAM SODA, pp. 223–232, 2000.
J. Hall, J. Hartline, A. Karlin, J. Saia, and J. Wilkes. On algorithms for efficient data migration.Proc. of ACM-SIAM SODA, pp. 620–629, 2001.
S. Kashyap and S. Khuller. Algorithms for non-uniform size data placement on parallel disks.Proc. of Conference on Foundations of Software Technology and Theoretical Computer Science (FST&TCS), pp. 265–276. LNCS 2914. Springer, Berlin, 2003.
S. Khuller, Y. Kim, and Y-C. Wan. Algorithms for data migration with cloning.Proc. of 22nd ACM Symposium on Principles of Database Systems (PODS), pp. 27–36, 2003.
H. Shachnai and T. Tamir. Polynomial time approximation schemes for class-constrained packing problems.Proc. of Workshop on Approximation Algorithms, pp. 238–249. LNCS 1913. Springer, Berlin, 2000.
H. Shachnai and T. Tamir. On two class-constrained versions of the multiple knapsack problem.Algorithmica, 29:442–467, 2001.
H. Shachnai and T. Tamir. Approximation schemes for generalized 2-dimensional vector packing with application to data placement.Proc. of Workshop on Approximation Algorithms, pp. 165–177. LNCS 2764. Springer, Berlin, 2003.
D. B. Shmoys and E. Tardos. An approximation algorithm for the generalized assignment problemMathematical Programming, 62:461–474, 1993.
V. G. Vizing. On an estimate of the chromatic class of aρ-graph (Russian).Diskretnaya Analiz., 3:25–30, 1964.
J. Wolf, H. Shachnai, and P. Yu. DASD dancing: a disk load balancing optimization scheme for video-on-demand computer systems.Proc. of ACM SIGMETRICS/Performance Conference, pp. 157–166, 1995.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was supported by the NSF Awards CCR-0113192 and EIA-0091474 as well as the Okawa Research Award. This work made use of Integrated Media Systems Center Shared Facilities supported by the National Science Foundation under Cooperative Agreement No. EEC-9529152; any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect those of the National Science Foundation. This work was done while Svetlana Shargorodskaya was at the University of Maryland.
Rights and permissions
About this article
Cite this article
Golubchik, L., Khuller, S., Kim, YA. et al. Data migration on parallel disks: Algorithms and evaluation. Algorithmica 45, 137–158 (2006). https://doi.org/10.1007/s00453-005-1194-6
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/s00453-005-1194-6