Skip to main content
Log in

Data migration on parallel disks: Algorithms and evaluation

  • Published:
Algorithmica Aims and scope Submit manuscript

Abstract

Our work is motivated by the problem of managing data on storage devices, typically a set of disks. Such storage servers are used as web servers or multimedia servers, for handling high demand for data. As the system is running, to exhibit good performance, it needs to respond dynamically to changes in demand for different data items. There are known algorithms for mapping demand to a layout. When the demand changes, a new layout can be computed. In this work we study thedata migration problem, which arises when we need to change one layout to another quickly. This problem has been studied earlier where for each disk a new layout has been prescribed. However, to apply these algorithms effectively, we identify another problem that we refer to as the correspondence problem, whose solution has a significant impact on the overall solution for the data migration problem. We study algorithms for the data migration problem in more detail and identify variations of the basic algorithm that seem to improve performance in practice, even though some of the variations have poor worst-case behavior.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. E. Anderson, J. Hall, J. Hartline, M. Hobbes, A. Karlin, J. Saia, R. Swaminathan, and J. Wilkes. An experimental study of data migration algorithms.Workshop on Algorithm Engineering, pp. 145–158. LNCS 2141. Springer, New York, 2001.

    Chapter  Google Scholar 

  2. C. Berge and J. C. Fournier. A short proof for a generalization of Vizing’s theorem.Journal of Graph Theory, 15(3):333–336 (1991).

    Article  MATH  MathSciNet  Google Scholar 

  3. J. A. Bondy and U. S. R. Murty.Graph Theory with Applications. Elsevier, New York, 1977.

    Google Scholar 

  4. A. L. Chervenak. Tertiary Storage: An Evaluation of New Applications. Ph.D. Thesis, UC Berkeley, 1994.

  5. C.-F. Chou, L. Golubchik, J. C. S. Lui, and I.-H. Chung. Design of scalable continuous media servers.Multimedia Tools and Applications, 17(2–3):181–212, 2002.

    Article  Google Scholar 

  6. S. Ghandeharizadeh and R. R. Muntz. Design and implementation of scalable continuous media servers.Parallel Computing Journal, 24(1):91–122, 1998.

    Article  MATH  Google Scholar 

  7. L. Golubchik, S. Khanna, S. Khuller, R. Thurimella, and A. Zhu. Approximation algorithms for data placement on parallel disks.Proc. of ACM-SIAM SODA, pp. 223–232, 2000.

  8. J. Hall, J. Hartline, A. Karlin, J. Saia, and J. Wilkes. On algorithms for efficient data migration.Proc. of ACM-SIAM SODA, pp. 620–629, 2001.

  9. S. Kashyap and S. Khuller. Algorithms for non-uniform size data placement on parallel disks.Proc. of Conference on Foundations of Software Technology and Theoretical Computer Science (FST&TCS), pp. 265–276. LNCS 2914. Springer, Berlin, 2003.

    Google Scholar 

  10. S. Khuller, Y. Kim, and Y-C. Wan. Algorithms for data migration with cloning.Proc. of 22nd ACM Symposium on Principles of Database Systems (PODS), pp. 27–36, 2003.

  11. H. Shachnai and T. Tamir. Polynomial time approximation schemes for class-constrained packing problems.Proc. of Workshop on Approximation Algorithms, pp. 238–249. LNCS 1913. Springer, Berlin, 2000.

    Google Scholar 

  12. H. Shachnai and T. Tamir. On two class-constrained versions of the multiple knapsack problem.Algorithmica, 29:442–467, 2001.

    Article  MATH  MathSciNet  Google Scholar 

  13. H. Shachnai and T. Tamir. Approximation schemes for generalized 2-dimensional vector packing with application to data placement.Proc. of Workshop on Approximation Algorithms, pp. 165–177. LNCS 2764. Springer, Berlin, 2003.

    Google Scholar 

  14. D. B. Shmoys and E. Tardos. An approximation algorithm for the generalized assignment problemMathematical Programming, 62:461–474, 1993.

    Article  MathSciNet  Google Scholar 

  15. V. G. Vizing. On an estimate of the chromatic class of aρ-graph (Russian).Diskretnaya Analiz., 3:25–30, 1964.

    MathSciNet  Google Scholar 

  16. J. Wolf, H. Shachnai, and P. Yu. DASD dancing: a disk load balancing optimization scheme for video-on-demand computer systems.Proc. of ACM SIGMETRICS/Performance Conference, pp. 157–166, 1995.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Leana Golubchik.

Additional information

This research was supported by the NSF Awards CCR-0113192 and EIA-0091474 as well as the Okawa Research Award. This work made use of Integrated Media Systems Center Shared Facilities supported by the National Science Foundation under Cooperative Agreement No. EEC-9529152; any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect those of the National Science Foundation. This work was done while Svetlana Shargorodskaya was at the University of Maryland.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Golubchik, L., Khuller, S., Kim, YA. et al. Data migration on parallel disks: Algorithms and evaluation. Algorithmica 45, 137–158 (2006). https://doi.org/10.1007/s00453-005-1194-6

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00453-005-1194-6

Key Words

Navigation