Skip to main content
Log in

Beyond mirroring: multi-version disk arraywith improved performance and energy efficiency

  • Published:
Journal of Zhejiang University SCIENCE C Aims and scope Submit manuscript

Abstract

Performance and power consumption are two important design objectives for data centers consisting of thousands or tens of thousands of disks (or disk arrays). To leverage the two objectives, in this study we propose a multi-version disk array (MDA). The main idea of MDA is to exploit the I/O workload characteristics to guide the replication strategy by replicating multiple versions of the popular data blocks and simply offloading the write data to the free space of the reserved version region, thus achieving high performance in the burst period and low power consumption in the idle period. Our prototype implementation of MDA and the performance evaluations show that the performance of MDA outperforms that of traditional RAID10 by up to 34.4% and 42.3% in terms of the average response time for the online transaction processing (OLTP) application I/O and search engine I/O, respectively. Moreover, the energy efficiency of MDA outperforms that of RAID10 by up to 48.7% and 36.4%, respective to the aforementioned measures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Arlitt, M., Williamson, C., 1996. Web Server Workload Characterization: the Search for Invariants. Proc. ACM SIGMETRICS Int. Conf. on Measurement and Modeling of Computer Systems, p.126–137. [doi:10.1145/233013.233034]

  • Bairavasundaram, L.N., Arpaci-Dusseau, A.C., Arpaci-Dusseau, R.H., Goodson, G.R., Schroeder, B., 2008. An analysis of data corruption in the storage stack. ACM Trans. Storage, 4(3):1–28. [doi:10.1145/1416944.1416947]

    Article  Google Scholar 

  • Batsakis, A., Burns, R., Kanevsky, A., Lentini, J., Talpey, T., 2008. AWOL: an Adaptive Write Optimizations Layer. Proc. 6th USENIX Conf. on File and Storage Technologies, p.67–80.

  • Colarelli, D., Grunwald, D., 2002. Massive Arrays of Idle Disks for Storage Archives. Proc. ACM/IEEE Conf. on Supercomputing, p.1–11. [doi:10.1109/SC.2002.10058]

  • Gomez, M.E., Santonja, V., 2002. Characterizing Temporal Locality in I/O Workload. Proc. Int. Symp. on Performance Evaluation of Computer and Telecommunication Systems, p.1–8.

  • Gray, J., Shenoy, P., 2000. Rules of Thumb in Data Engineering. Proc. 16th Int. Conf. on Data Engineering, p.3–10. [doi:10.1109/ICDE.2000.839382]

  • Gurumurthi, S., Sivasubramaniam, A., Kandemir, M., Franke, H., 2003. DRPM: Dynamic Speed Control for Power Management in Server Class Disks. Proc. 30th Annual Int. Symp. on Computer Architecture, p.169–179. [doi:10.1109/ISCA.2003.1206998]

  • Hennessy, J.L., Patterson, D.A., 2006. Computer Architecture: a Quantitative Approach (4th Ed.). Morgan Kaufmann, USA, p.38–44.

    MATH  Google Scholar 

  • Hsieh, J.W., Kuo, T.W., Chang, L.P., 2006. Efficient identification of hot data for flash memory storage systems. ACM Trans. Storage, 2(1):22–40. [doi:10.1145/1138041.1138043]

    Article  Google Scholar 

  • Huang, H., Hung, W., Shin, K.G., 2005. FS2: Dynamic Data Replication in Free Disk Space for Improving Disk Performance and Energy Consumption. Proc. 20th ACM Symp. on Operating Systems Principles, p.263–276. [doi:10.1145/1095810.1095836]

  • Krioukov, A., Bairavasundaram, L.N., Goodson, G.R., Srinivasan, K., Thelen, R., Arpaci-Dusseau, A.C., Arpaci-Dusseau, R.H., 2008. Parity Lost and Parity Regained. Proc. 6th USENIX Conf. on File and Storage Technologies, p.127–141.

  • Li, D., Wang, J., 2004. EERAID: Energy Efficient Redundant and Inexpensive Disk Array. Proc. 11th Workshop on ACM SIGOPS European Workshop, p.1–14. [doi:10.1145/1133572.1133577]

  • Mao, B., Feng, D., Jiang, H., Wu, S., Chen, J., Zeng, L., 2008. GRAID: a Green RAID Storage Architecture with Improved Energy Efficiency and Reliability. Proc. Int. Symp. on Modeling, Analysis and Simulation of Computers and Telecommunication Systems, p.1–8. [doi:10.1109/MASCOT.2008.4770574]

  • Mi, N., Casale, G., Cherkasova, L., Smirni, E., 2008. Burstiness in multi-tier applications: symptoms, causes, and new models. LNCS, 5346:265–286. [doi:10.1007/978-3-540-89856-6_14]

    Google Scholar 

  • Mogi, K., Kitsuregawa, M., 1996. Hot mirroring: a method of hiding parity update penalty and degradation during rebuilds for RAID5. ACM SIGMOD Rec., 25(2):183–194. [doi:10.1145/235968.233331]

    Article  Google Scholar 

  • Narayanan, D., Donnelly, A., Rowstron, A., 2008. Write off-loading: practical power management for enterprise storage. ACM Trans. Storage, 4(3):1–23. [doi:10.1145/1416944.1416949]

    Article  Google Scholar 

  • Orji, C.U., Solworth, J.A., 1993. Doubly distorted mirrors. ACM SIGMOD Rec., 22(2):307–316. [doi:10.1145/170036.170082]

    Article  Google Scholar 

  • Patterson, D., Gibson, G., Katz, R., 1988. A case for redundant arrays of inexpensive disks (RAID). ACM SIGMOD Rec., 17(3):109–116. [doi:10.1145/971701.50214]

    Article  Google Scholar 

  • Pinheiro, E., Bianchini, R., 2004. Energy Conservation Techniques for Disk Array-Based Servers. Proc. 18th Annual Int. Conf. on Supercomputing, p.68–78. [doi:10.1145/1006209.1006220]

  • Pinheiro, E., Bianchini, R., Dubnicki, C., 2006. Exploiting Redundancy to Conserve Energy in Storage Systems. Proc. Joint Int. Conf. on Measurement and Modeling of Computer Systems, p.15–26. [doi:10.1145/1140277.1140281]

  • Riska, A., Riedel, E., 2006. Disk Drive Level Workload Characterization. Proc. Annual USENIX Technical Conf., p.97–103.

  • Ruemmler, C., Wilkes, J., 1993. UNIX Disk Access Patterns. Proc. USENIX Winter Technical Conf., p.405–420.

  • Savage, S., Wilkes, J., 1996. AFRAID: a Frequently Redundant Array of Independent Disks. Proc. USENIX Annual Technical Conf., p.27–39.

  • Schmuck, F., Haskin, R., 2002. GPFS: a Shared-Disk File System for Large Computing Clusters. Proc. 1st USENIX Conf. on File and Storage Technologies, p.231–244.

  • Sivathanu, M., Bairavasundaram, L., Arpaci-Dusseau, A.C., Arpaci-Dusseau, R.H., 2004. Life or Death at Block-Level. Proc. 6th Conf. Symp. on Opearting Systems Design and Implementation, p.379–394.

  • Son, S.W., Chen, G., Kandemir, M., 2005. Disk Layout Optimization for Reducing Energy Consumption. Proc. 19th Annual Int. Conf. on Supercomputing, p.274–283. [doi:10.1145/1088149.1088186]

  • Storer, M.W., Greenan, K.M., Miller, E.L., Voruganti, K., 2008. Pergamum: Replacing Tape with Energy Efficient, Reliable, Disk-Based Archival Storage. Proc. 6th USENIX Conf. on File and Storage Technologies, p.1–16.

  • Tian, L., Feng, D., Jiang, H., Zhou, K., Zeng, L., Chen, J., Wang, Z., Song, Z., 2007. PRO: a Popularity-Based Multi-threaded Reconstruction Optimization for RAID-Structured Storage Systems. Proc. 5th USENIX Conf. on File and Storage Technologies, p.277–290.

  • UMass Trace Repository, 2002. OLTP Application I/O and Search Engine I/O. Available from http://traces.cs.umass.edu/index.php/storage/storage.

  • Wang, J., Hu, Y., 2002. WOLF-a Novel Reordering Write Buffer to Boost the Performance of Log-Structured File Systems. Proc. 1st USENIX Conf. on File and Storage Technologies, p.47–60.

  • Weddle, C., Oldham, M., Qian, J., Wang, A.A., Reiher, P., Kuenning, G., 2005. PARAID: a gear-shifting power-aware RAID. ACM Trans. Storage, 3(3):245–260. [doi:10.1145/1288783.1289721]

    Google Scholar 

  • Weil, S., Brandt, S., Miller, E., Long, D., Maltzahn, C., 2006. Ceph: a Scalable, High-Performance Distributed File System. Proc. 7th Symp. on Operating Systems Design and Implementation, p.307–320.

  • Wilkes, J., Golding, R., Staelin, C., Sullivan, T., 1996. The HP AutoRAID hierarchical storage system. ACM Trans. Comput. Syst., 14(1):108–136. [doi:10.1145/225535.225539]

    Article  Google Scholar 

  • Xie, T., Sun, Y., 2008. Sacrificing Reliability for Energy Saving: Is It Worthwhile for Disk Arrays? IEEE Int. Symp. on Parallel and Distributed Processing, p.1–12. [doi:10.1109/IPDPS.2008.4536247]

  • Zhang, C., Krishnamurthy, X., Yu, A., Wang, R.Y., 2002. Configuring and Scheduling an Eager-Writing Disk Array for a Transaction Processing Workload. Proc. 1st USENIX Conf. on File and Storage Technologies, p.289–304.

  • Zhu, Q., Chen, Z., Tan, L., Zhou, Y., Keeton, K., Wilkes, J., 2005. Hibernator: helping disk arrays sleep through the winter. ACM SIGOPS Oper. Syst. Rev., 39(5):177–190. [doi:10.1145/1095809.1095828]

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Su-zhen Wu.

Additional information

Project supported by the National Natural Science Foundation of China (No. 61100033), the US National Science Foundation (Nos. NSF-CNS-1016609 and NSF-IIS-0916859), and the Changjiang Innovative Group of Education of China (No. IRT0725)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mao, B., Wu, Sz. & Feng, D. Beyond mirroring: multi-version disk arraywith improved performance and energy efficiency. J. Zhejiang Univ. - Sci. C 12, 721–729 (2011). https://doi.org/10.1631/jzus.C1000407

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/jzus.C1000407

Key words

CLC number

Navigation