ABSTRACT
Due to the cost consideration for data storage, high-areal-density shingled-magnetic-recording (SMR) drives and data deduplication techniques are getting popular in many data storage services for the improvement of profit per storage unit. However, naively applying deduplication techniques upon SMR drives may dramatically downgrade the runtime performance of data storage services, because of the time-consuming SMR space reclamation processes. This work advocates a vertical integration solution by jointly managing the host-managed SMR drives with deduplication system, in order to essentially relieve the time-consuming SMR space reclamation issue. The proposed design was evaluated by a series of realistic deduplication workloads with encouraging results.
- Dropbox, inc., http://www.dropbox.com/.Google Scholar
- A. Aghayev, M. Shafaei, and P. Desnoyers. Skylight-a window on shingled disk operation. ACM Trans. Storage, 11(4):16:1--16:28, Oct. 2015. Google ScholarDigital Library
- A. Aghayev, T. Ts'o, G. Gibson, and P. Desnoyers. Evolving ext4 for shingled disks. In Proceedings of the 15th Usenix Conference on File and Storage Technologies, FAST'17, pages 105--119. USENIX Association, 2017. Google ScholarDigital Library
- B. Calder and et al. Windows azure storage: A highly available cloud storage service with strong consistency. In Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles, SOSP '11, pages 143--157. ACM, 2011. Google ScholarDigital Library
- S. Greaves, Y. Kanai, and H. Muraoka. Shingled recording for 2-3 tbit/in2. IEEE Transactions on Magnetics, 45(10):3823--3829, Oct 2009.Google ScholarCross Ref
- D. Kim, S. Song, and B.-Y. Choi. Data Deduplication for Data Optimization for Storage and Network Systems. Google ScholarDigital Library
- Q. M. Le, K. Sathyanarayana Raju, A. Amer, and J. Holliday. Workload impact on shingled write disks: All-writes can be alright. In 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems, pages 444--446, July 2011. Google ScholarDigital Library
- D. Meister, J. Kaiser, A. Brinkmann, T. Cortes, M. Kuhn, and J. Kunkel. A study on data deduplication in hpc storage systems. In High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for, pages 1--11, Nov 2012. Google ScholarDigital Library
- D. T. Meyer and W. J. Bolosky. A study of practical deduplication. In Proceedings of the 9th USENIX Conference on File and Stroage Technologies, FAST'11, pages 1--1, Berkeley, CA, USA, 2011. USENIX Association. Google ScholarDigital Library
- A. Muthitacharoen, B. Chen, and D. Mazières. A low-bandwidth network file system. In Proceedings of the Eighteenth ACM Symposium on Operating Systems Principles, SOSP '01, pages 174--187. ACM, 2001. Google ScholarDigital Library
- R. Pitchumani, J. Hughes, and E. L. Miller. Smrdb: key-value data store for shingled magnetic recording disks. In SYSTOR, 2015. Google ScholarDigital Library
- K. Srinivasan, T. Bisson, G. Goodson, and K. Voruganti. idedup: Latency-aware, inline data deduplication for primary storage. In Proceedings of the 10th USENIX Conference on File and Storage Technologies, FAST'12, pages 24--24. USENIX Association, 2012. Google ScholarDigital Library
- C. W. Tsao, Y. H. Chang, M. C. Yang, and P. C. Huang. Efficient victim block selection for flash storage devices. IEEE Transactions on Computers, 64(12):3444--3460, Dec 2015. Google ScholarDigital Library
- R. Wood, M. Williams, A. Kavcic, and J. Miles. The feasibility of magnetic recording at 10 terabits per square inch on conventional media. IEEE Transactions on Magnetics, 45(2):917--923, Feb 2009.Google ScholarCross Ref
- F. Wu, Z. Fan, M. C. Yang, B. Zhang, X. Ge, and D. H. C. Du. Performance evaluation of host aware shingled magnetic recording (ha-smr) drives. IEEE Transactions on Computers, 66(11):1932--1945, Nov 2017.Google ScholarDigital Library
- F. Wu, M.-C. Yang, Z. Fan, B. Zhang, X. Ge, and D. H. Du. Evaluating host aware SMR drives. In 8th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 16). USENIX Association, 2016. Google ScholarDigital Library
- B. Zhu, K. Li, and H. Patterson. Avoiding the disk bottleneck in the data domain deduplication file system. In Proceedings of the 6th USENIX Conference on File and Storage Technologies, FAST'08, USENIX Association, 2008. Google ScholarDigital Library
Index Terms
- Improving runtime performance of deduplication system with host-managed SMR storage drives
Recommendations
A new sequential-write-constrained cache management to mitigate write amplification for SMR drives
SAC '19: Proceedings of the 34th ACM/SIGAPP Symposium on Applied ComputingShingled magnetic recording (SMR) is regarded as a promising solution for fulfilling the capacity requirement of next-generation big data applications. However, due to the sequential-write constraint of SMR drives, random-write requests could only be ...
Modeling Drive-Managed SMR Performance
Special Issue on MSST 2017 and Regular PapersAccurately modeling drive-managed Shingled Magnetic Recording (SMR) disks is a challenge, requiring an array of approaches including both existing disk modeling techniques as well as new techniques for inferring internal translation layer algorithms. In ...
Improving Runtime Performance of Deduplication System with Host-Managed SMR Storage Drives
2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)Due to the cost consideration for data storage, high-areal-density shingled-magnetic-recording (SMR) drives and data deduplication techniques are getting popular in many data storage services for the improvement of profit per storage unit. However, ...
Comments