ABSTRACT
Solid-state drives (SSDs) necessitate garbage collection (GC) to erase data blocks and reclaim the space of invalidated data, and GC inevitably introduces additional writes due to data relocation. The performance of GC, which is quantified by cleaning cost or write amplification, is critical to the overall performance of SSDs. However, characterizing GC performance is complicated by the general implementations of GC algorithms and the complex data locality characteristics of real-world workloads. This paper presents a general analytical study to characterize the performance impact of data locality on a general family of GC algorithms. We develop probabilistic models to address two fundamental issues: (1) What is the impact of data locality on the performance of locality-oblivious GC? (2) How can data locality be leveraged to improve the performance in locality-aware GC? We further conduct extensive trace-driven simulations on real-world workloads to validate the findings of our models.
- N. Agrawal, V. Prabhakaran, T. Wobber, J. D. Davis, M. Manasse, and R. Panigrahy. Design Tradeoffs for SSD Performance. In Proc. of USENIX ATC, Jun 2008. Google ScholarDigital Library
- A. Birrell, M. Isard, C. Thacker, and T. Wobber. A Design for High-performance Flash Disks. ACM SIGOPS Oper. Syst. Rev., 41(2):88--93, Apr 2007. Google ScholarDigital Library
- W. Bux and I. Iliadis. Performance of Greedy Garbage Collection in Flash-based Solid-state Drives. Performance Evaluation, Nov 2010. Google ScholarDigital Library
- F. Chen, D. A. Koufaty, and X. Zhang. Understanding Intrinsic Characteristics and System Implications of Flash Memory based Solid State Drives. In Proc. of ACM SIGMETRICS, Jun 2009. Google ScholarDigital Library
- F. Chen, T. Luo, and X. Zhang. CAFTL: A Content-aware Flash Translation Layer Enhancing the Lifespan of Flash Memory Based Solid State Drives. In Proceedings of USENIX, FAST, 2011. Google ScholarDigital Library
- T.-S. Chung, D.-J. Park, S. Park, D.-H. Lee, S.-W. Lee, and H.-J. Song. System Software For Flash Memory: A Survey. In Proc. of Int. Conf. on Embedded and Ubiquitous Computing, Aug 2006. Google ScholarDigital Library
- P. Desnoyers. Analytic Modeling of SSD Write Performance. In Proceedings of SYSTOR, Jun 2012. Google ScholarDigital Library
- E. Gal and S. Toledo. Algorithms and Data Structures for Flash Memories. ACM Computing Surveys, 37(2):138--163, Jun 2005. Google ScholarDigital Library
- A. Gupta, Y. Kim, and B. Urgaonkar. DFTL: A Flash Translation Layer Employing Demand-based Selective Caching of Page-level Address Mappings. In Proc. of ACM ASPLOS, Mar 2009. Google ScholarDigital Library
- A. Gupta, R. Pisolkar, B. Urgaonkar, and A. Sivasubramaniam. Leveraging Value Locality in Optimizing NAND Flash-based SSDs. In Proc. of USENI FAST, 2011. Google ScholarDigital Library
- J.-W. Hsieh, T.-W. Kuo, and L.-P. Chang. Efficient Identification of Hot Data for Flash Memory Storage Systems. ACM TOS, Feb 2006. Google ScholarDigital Library
- X.-Y. Hu, E. Eleftheriou, R. Haas, I. Iliadis, and R. Pletka. Write Amplification Analysis in Flash-based Solid State Drives. In Proc. of SYSTOR, May 2009. Google ScholarDigital Library
- M. Jung and M. Kandemir. Revisiting Widely Held SSD Expectations and Rethinking System-level Implications. In Proc. of ACM SIGMETRICS, Jun 2013. Google ScholarDigital Library
- H.-S. Lee, H.-S. Yun, and D.-H. Lee. HFTL: Hybrid Flash Translation Layer based on Hot Data Identification for Flash Memory. IEEE Trans. on Consumer Electronics, 55(4):2005--2011, 2009. Google ScholarDigital Library
- S.-W. Lee, D.-J. Park, T.-S. Chung, D.-H. Lee, S. Park, and H.-J. Song. A Log Buffer-based Flash Translation Layer Using Fully-associative Sector Translation. ACM TECS, 6(3), Jul 2007. Google ScholarDigital Library
- Y. Li, P. P. C. Lee, and J. C. S. Lui. Stochastic Analysis on RAID Reliability for Solid-State Drives. In Proc. of IEEE SRDS, 2013. Google ScholarDigital Library
- Y. Li, P. P. C. Lee, and J. C. S. Lui. Stochastic Modeling of Large-Scale Solid-State Storage Systems: Analysis, Design Tradeoffs and Optimization. In Proc. of ACM SIGMETRICS, 2013. Google ScholarDigital Library
- Y. Lu, J. Shu, and W. Zheng. Extending the Lifetime of Flash-based Storage through Reducing Write Amplification from File Systems. In Proc. of USENIX FAST, 2013. Google ScholarDigital Library
- Micron Technology. Bad Block Management in NAND Flash Memory. Technical Note, TN-29-59, 2011.Google Scholar
- D. Narayanan, A. Donnelly, and A. Rowstron. Write off-loading: Practical power management for enterprise storage. ACM TOS, 4(3):10:1--10:23, Nov 2008. Google ScholarDigital Library
- C. Park, W. Cheon, J. Kang, K. Roh, W. Cho, and J.-S. Kim. A Reconfigurable FTL (Flash Translation Layer) Architecture for NAND Flash-based Applications. ACM TECS, 7(4):38:1--38:23, Aug 2008. Google ScholarDigital Library
- Z. Qin, Y. Wang, D. Liu, and Z. Shao. Demand-based Block-level Address Mapping in Large-scale NAND Flash Storage Systems. In Proc. of IEEE/ACM/IFIP CODES+ISSS, Oct 2010. Google ScholarDigital Library
- M. Rosenblum and J. K. Ousterhout. The Design and Implementation of a Log-structured File System. ACM Trans. Comput. Syst., 10(1):26--52, Feb 1992. Google ScholarDigital Library
- A. Soga, C. Sun, and K. Takeuchi. NAND Flash Aware Data Management System for High-speed SSDs by Garbage Collection Overhead Suppression. In IEEE 6th International Memory Workshop (IMW), May 2014.Google ScholarCross Ref
- Storage Performance Council. http://traces.cs.umass.edu/index.php/Storage/Storage, 2002.Google Scholar
- B. Van Houdt. A Mean Field Model for a Class of Garbage Collection Algorithms in Flash-based Solid State Drives. In Proc. of ACM SIGMETRICS, Jun 2013. Google ScholarDigital Library
- B. Van Houdt. Performance of Garbage Collection Algorithms for Flash-based Solid State Drives with Hot/cold Data. Performance Evaluation, 70(10):692--703, Sep 2013. Google ScholarDigital Library
- A. Verma, R. Koller, L. Useche, and R. Rangaswami. SRCMap: Energy Proportional Storage using Dynamic Consolidation. In Proc. of USENIX FAST, Feb 2010. Google ScholarDigital Library
- Y. Yang and J. Zhu. Analytical Modeling of Garbage Collection Algorithms in Hotness-aware Flash-based Solid State Drives. In Proc. of IEEE MSST, June 2014.Google ScholarCross Ref
Index Terms
- Impact of Data Locality on Garbage Collection in SSDs: A General Analytical Study
Recommendations
Profile-guided proactive garbage collection for locality optimization
PLDI '06: Proceedings of the 27th ACM SIGPLAN Conference on Programming Language Design and ImplementationMany applications written in garbage collected languages have large dynamic working sets and poor data locality. We present a new system for continuously improving program data locality at run time with low overhead. Our system proactively reorganizes ...
Profile-guided proactive garbage collection for locality optimization
Proceedings of the 2006 PLDI ConferenceMany applications written in garbage collected languages have large dynamic working sets and poor data locality. We present a new system for continuously improving program data locality at run time with low overhead. Our system proactively reorganizes ...
Age-based garbage collection
OOPSLA '99: Proceedings of the 14th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applicationsModern generational garbage collectors look for garbage among the young objects, because they have high mortality; however, these objects include the very youngest objects, which clearly are still live. We introduce new garbage collection algorithms, ...
Comments