skip to main content
10.1145/3195970.3196063acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article

Improving runtime performance of deduplication system with host-managed SMR storage drives

Published:24 June 2018Publication History

ABSTRACT

Due to the cost consideration for data storage, high-areal-density shingled-magnetic-recording (SMR) drives and data deduplication techniques are getting popular in many data storage services for the improvement of profit per storage unit. However, naively applying deduplication techniques upon SMR drives may dramatically downgrade the runtime performance of data storage services, because of the time-consuming SMR space reclamation processes. This work advocates a vertical integration solution by jointly managing the host-managed SMR drives with deduplication system, in order to essentially relieve the time-consuming SMR space reclamation issue. The proposed design was evaluated by a series of realistic deduplication workloads with encouraging results.

References

  1. Dropbox, inc., http://www.dropbox.com/.Google ScholarGoogle Scholar
  2. A. Aghayev, M. Shafaei, and P. Desnoyers. Skylight-a window on shingled disk operation. ACM Trans. Storage, 11(4):16:1--16:28, Oct. 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Aghayev, T. Ts'o, G. Gibson, and P. Desnoyers. Evolving ext4 for shingled disks. In Proceedings of the 15th Usenix Conference on File and Storage Technologies, FAST'17, pages 105--119. USENIX Association, 2017. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. B. Calder and et al. Windows azure storage: A highly available cloud storage service with strong consistency. In Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles, SOSP '11, pages 143--157. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Greaves, Y. Kanai, and H. Muraoka. Shingled recording for 2-3 tbit/in2. IEEE Transactions on Magnetics, 45(10):3823--3829, Oct 2009.Google ScholarGoogle ScholarCross RefCross Ref
  6. D. Kim, S. Song, and B.-Y. Choi. Data Deduplication for Data Optimization for Storage and Network Systems. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Q. M. Le, K. Sathyanarayana Raju, A. Amer, and J. Holliday. Workload impact on shingled write disks: All-writes can be alright. In 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems, pages 444--446, July 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. Meister, J. Kaiser, A. Brinkmann, T. Cortes, M. Kuhn, and J. Kunkel. A study on data deduplication in hpc storage systems. In High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for, pages 1--11, Nov 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. D. T. Meyer and W. J. Bolosky. A study of practical deduplication. In Proceedings of the 9th USENIX Conference on File and Stroage Technologies, FAST'11, pages 1--1, Berkeley, CA, USA, 2011. USENIX Association. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Muthitacharoen, B. Chen, and D. Mazières. A low-bandwidth network file system. In Proceedings of the Eighteenth ACM Symposium on Operating Systems Principles, SOSP '01, pages 174--187. ACM, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Pitchumani, J. Hughes, and E. L. Miller. Smrdb: key-value data store for shingled magnetic recording disks. In SYSTOR, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. Srinivasan, T. Bisson, G. Goodson, and K. Voruganti. idedup: Latency-aware, inline data deduplication for primary storage. In Proceedings of the 10th USENIX Conference on File and Storage Technologies, FAST'12, pages 24--24. USENIX Association, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. C. W. Tsao, Y. H. Chang, M. C. Yang, and P. C. Huang. Efficient victim block selection for flash storage devices. IEEE Transactions on Computers, 64(12):3444--3460, Dec 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. Wood, M. Williams, A. Kavcic, and J. Miles. The feasibility of magnetic recording at 10 terabits per square inch on conventional media. IEEE Transactions on Magnetics, 45(2):917--923, Feb 2009.Google ScholarGoogle ScholarCross RefCross Ref
  15. F. Wu, Z. Fan, M. C. Yang, B. Zhang, X. Ge, and D. H. C. Du. Performance evaluation of host aware shingled magnetic recording (ha-smr) drives. IEEE Transactions on Computers, 66(11):1932--1945, Nov 2017.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. F. Wu, M.-C. Yang, Z. Fan, B. Zhang, X. Ge, and D. H. Du. Evaluating host aware SMR drives. In 8th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 16). USENIX Association, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. B. Zhu, K. Li, and H. Patterson. Avoiding the disk bottleneck in the data domain deduplication file system. In Proceedings of the 6th USENIX Conference on File and Storage Technologies, FAST'08, USENIX Association, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Improving runtime performance of deduplication system with host-managed SMR storage drives

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      DAC '18: Proceedings of the 55th Annual Design Automation Conference
      June 2018
      1089 pages
      ISBN:9781450357005
      DOI:10.1145/3195970

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 June 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,770of5,499submissions,32%

      Upcoming Conference

      DAC '24
      61st ACM/IEEE Design Automation Conference
      June 23 - 27, 2024
      San Francisco , CA , USA

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader