Skip to main content

An On-Line Reorganization Framework for SAN File Systems

  • Conference paper
Advances in Databases and Information Systems (ADBIS 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4152))

Abstract

While the cost per megabyte of magnetic disk storage is economical, organizations are alarmed by the increasing cost of managing storage. Storage Area Network (SAN) architectures strive to minimize this cost by consolidating storage devices. A SAN is a special-purpose network that interconnects different data storage devices with servers. While there are many definitions for a SAN, there is a general consensus that it provides access at the granularity of a block and is typically used for database applications.

In this study, we focus on SAN switches that include an embedded storage management software in support of virtualization. We describe an On-line Re-organization Environment, ORE, that controls the placement of data to improve the average response time of the system. ORE is designed for a heterogeneous collection of storage devices. Its key novel feature is its use of “time” to quantify the benefit and cost of a migration. It migrates a fragment only when its net benefit exceeds a pre-specified threshold. We describe a taxonomy of techniques for fragment migration and employ a trace driven simulation study to quantify their tradeoff. Our performance results demonstrate a significant improvement in response time (order of magnitude) for those algorithms that employ ORE’s cost/benefit feature. Moreover, a technique that employs bandwidth of all devices intelligently is superior to one that simply migrates data to the fastest devices.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amiri, K., Gibson, G., Golding, R.: Highly Concurrent Shared Storage. In: Proceedings of the International Conference on Distributed Computing Systems (April 2000)

    Google Scholar 

  2. Amiri, K., Petrou, D., Ganger, G., Gibson, G.: Dynamice Function Placement for Data-Intensive Cluster Ccomputing. In: Proceedings of the USENIX Annual Technical Conference (June 2000)

    Google Scholar 

  3. Anderson, T., Breitbart, Y., Korth, H., Wool, A.: Replication, Consistency, and Practicality: Are These Mutually Exclusive?. In: Proceedings of ACM SIGMOD p. 27 (1998)

    Google Scholar 

  4. Aref, W., Kamel, I., Niranjan, T., Ghandeharizadeh, S.: Disk Scheduling for Displaying and Recording Video in Non-Linear News Editing Systems. In: Proceedings of Multimedia Computing and Networking Conference (1997)

    Google Scholar 

  5. Arpaci-Dusseau, R.H., Anderson, E., Treuhaft, N., Culler, D.E., Hellerstein, J.M., Patterson, D., Yelick, K.: Cluster I/O with River: Making the Fast Case Common. In: Proceedings of the Sixth Workshop on Input/Output in Parallel and Distributed Systems, Atlanta, GA, pp. 10–22. ACM Press, New York (1999)

    Chapter  Google Scholar 

  6. Berson, S., Ghandeharizadeh, S., Muntz, R., Ju, X.: Staggered Striping in Multimedia Information Systems. In: Proceedings of ACM SIGMOD, pp. 79–90 (1994)

    Google Scholar 

  7. Chen, P.M., Patterson, D.A.: Maximizing Performance in a Striped Disk Array. In: Proc. 17th Annual Int’l Symp. on Computer Architecture, ACM SIGARCH Computer Architecture News, p. 322 (1990)

    Google Scholar 

  8. Copeland, G., Alexander, W., Boughter, E., Keller, T.: Data placement in Bubba. In: Proceedings of ACM SIGMOD, pp. 99–108 (1988)

    Google Scholar 

  9. Dan, A., Sitaram, D.: An Online Video Placement Policy Based on Bandwidth to Space Ratio (BSR). In: Proceedings of ACM SIGMOD, pp. 376–385 (1995)

    Google Scholar 

  10. DeWitt, D., Gray, J.: Parallel Database Systems: The Future of High Performance Database Systems. Communications of the ACM 35(6), 85–98 (1992)

    Article  Google Scholar 

  11. Drapeau, A.L., Shirrif, K.W., Hartman, J.H., Miller, E.L., Seshan, S., Katz, R.H., Lutz, K., Patterson, D.A., Lee, E.K., Chen, P.H., Gibson, G.A.: RAID-II: A high-bandwidth network file server. In: Proceedings of the 21st Annual International Symposium on Computer Architecture, pp. 234–244 (1994)

    Google Scholar 

  12. Ghandeharizadeh, S., Ierardi, D., Kim, D.: Placement of Data in Multi Zone Disk Drives. In: Proceedings of the Second International Baltic Workshop on Databases and Information Systems (1996)

    Google Scholar 

  13. Ghandeharizadeh, S., Ierardi, D., Zimmermann, R.: An Algorithm for Disk Space Management to Minimize Seeks. The Computer Journal 57, 75–81 (1996)

    MATH  Google Scholar 

  14. Ghandeharizadeh, S., Ierardi, D., Zimmermann, R.: Management of Space in Hierarchical Storage Systems. In: Arbib, M., Grethe, J. (eds.) A Guide to Neuroinformatics. Academic Press, London (2001)

    Google Scholar 

  15. Ghandeharizadeh, S., Kim, S., Shi, W., Zimmermann, R.: On minimizing startup latency in scalable continuous media servers. Multimedia Computing and Networking (February 1997)

    Google Scholar 

  16. Gibson, G.: Redundant Disk Arrays: Reliable, Parallel Secondary Storage (1991)

    Google Scholar 

  17. Gibson, G.A., Patterson, D.A.: Designing Disk Arrays for High Data Reliability. Journal of Parallel and Distributed Computing 17(1–2), 4–27 (1993)

    Article  Google Scholar 

  18. Golubchik, L., Muntz, R.R.: Fault Tolerance Issues in Data Declustering for Parallel Database Systems. Data Engineering Bulletin 17(3), 14–28 (1994)

    Google Scholar 

  19. Gray, J., Graefe, G.: The 5 Minute Rule, Ten Years Later. In: SIGMOD Record, vol. 26 (1997)

    Google Scholar 

  20. Gray, J., Helland, P., O’Neil, P., Shasha, D.: The Dangers of Replication and a Solution. In: Proceedings of ACM SIGMOD, pp. 173–182 (1996)

    Google Scholar 

  21. Gray, J., Horst, B., Walker, M.: Parity Striping of Disk Arrays: Low Cost Reliable Storage with Acceptable Throughput. In: Proceedings of the VLDB Conference, pp. 152–162 (September 1990)

    Google Scholar 

  22. Gray, J., Reuter, A.: Transaction Processing: Concepts and Techniques. Morgan Kaufmann, San Francisco (1992)

    Google Scholar 

  23. Hsiao, H., DeWitt, D.: Chained Declustering: A New Availability Strategy for Multiprocessor Database Machines. In: Proceedings of 6th International Data Engineering Conference, pp. 456–465 (1990)

    Google Scholar 

  24. Lakhamraju, M.K., Rastogi, R., Seshadri, S., Sudarshan, S.: On-Line Reorganization in Object Databases. In: Proceedings of ACM SIGMOD, pp. 58–69 (2000)

    Google Scholar 

  25. Lee, E.K., Thekkath, C.A.: Petal: Distributed Virtual Disks. In: Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems, Cambridge, MA, pp. 84–92 (1996)

    Google Scholar 

  26. Lee, M.L., Kitsuregawa, M., Ooi, B.C., Tan, K., Mondal, A.: Towards Self-tuning Data Placement in Parallel Database Systems. In: Proceedings of ACM SIGMOD, pp. 225–236 (2000)

    Google Scholar 

  27. Petrou, D., Amiri, K., Ganger, G., Gibson, G.: Easing the Management of Data-Parallel Systems via Adaptation. In: Proceedings of the 9th ACM SIGOPS European Workshop (September 2000)

    Google Scholar 

  28. Scheuermann, P., Weikum, G., Zabback, P.: “Disk Cooling” in Parallel Disk Systems. Data Engineering Bulletin 17(3), 29–40 (1994)

    Google Scholar 

  29. Scheuermann, P., Weikum, G., Zabbak, P.: Data Partitioning and Load Balancing in Parallel Disk Systems. VLDB Journal 7(1) (1998)

    Google Scholar 

  30. Siewiorek, D.P., Swarz, R.S.: The Theory and Practice of Reliable System Design. Digital Press (1982)

    Google Scholar 

  31. Veitch, A., Riedel, E., Towers, S., Wilkes, J.: Towards Global Storage Management and Data Placement. Technical Report HPL-SSP-2001-1, Hewlett Packard Laboratories (March 2001)

    Google Scholar 

  32. Vingralek, R., Breitbart, Y., Weikum, G.: Snowball: Scalable Storage on Networks of Workstations with Balanced Load. Distributed and Parallel Databases 6(2), 117–156 (1998)

    Article  Google Scholar 

  33. Weikum, G., Hasse, C., Moenkeberg, A., Zabback, P.: The COMFORT Automatic Tuning Project, Invited Project Review. Information Systems 19(5), 381–432 (1994)

    Article  Google Scholar 

  34. Wiesmann, M., Pedone, F., Schiper, A., Kemme, B., Alonso, G.: Understanding Replication in Databases and Distributed Systems. In: Proceedings of 20th International Conference on Distributed Computing Systems (ICDCS 2000), Taipei, Taiwan, R.O.C., pp. 264–274. IEEE Computer Society Press, Los Alamitos (2000)

    Google Scholar 

  35. Wilkes, J., Golding, R., Staelin, C., Sullivan, T.: The HP AutoRAID Hierarchical Storage System. In: Proceedings of the Fifteenth ACM Symposium on Operating Systems Principles, Copper Mountain, CO, pp. 96–108. ACM Press, New York (1995)

    Chapter  Google Scholar 

  36. Wu, K., Yu, P., Chung, J., Teng, J.: A Performance Study of Workfile Disk Management for Concurrent Mergesorts in a Multiprocessor Database System. In: Proceedings of the VLDB Conference, September 1995, pp. 100–110 (1995)

    Google Scholar 

  37. Yu, X., Gum, B., Chen, Y., Wang, R., Li, K., Krishnamurthy, A., Anderson, T.: Trading Capacity for Performance in a Disk Array. In: Symposium on Operating Systems Design and Implementation (October 2000)

    Google Scholar 

  38. Zimmermann, R., Ghandeharizadeh, S.: HERA: Heterogeneous Extension of RAID. In: Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA 2000) (June 2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ghandeharizadeh, S., Gao, S., Gahagan, C., Krauss, R. (2006). An On-Line Reorganization Framework for SAN File Systems. In: Manolopoulos, Y., Pokorný, J., Sellis, T.K. (eds) Advances in Databases and Information Systems. ADBIS 2006. Lecture Notes in Computer Science, vol 4152. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11827252_30

Download citation

  • DOI: https://doi.org/10.1007/11827252_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-37899-0

  • Online ISBN: 978-3-540-37900-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics