ABSTRACT
Video streaming on the Internet is popular and the need to store and stream video content using CDNs is continually on the rise thanks to services such as Hulu and Netflix. Adaptive HTTP streaming using the deployed CDN infrastructure has become the de facto standard for meeting the increasing demand for video streaming on the Internet. The storage architecture that is used for storing and streaming the video content is the focus of this study. Hard-disk as the storage medium has been the norm for enterprise-class storage servers for the longest time. More recently, multi-tiered storage servers (incorporating SSDs) such as Sun's ZFS and Facebook's flashcache offer an alternative to disk-based storage servers for enterprise applications. Both these systems use the SSD as a cache between the DRAM and the hard disk. The thesis of our work is that the current-state-of-the art in multi-tiered storage systems, architected for general-purpose enterprise workloads, do not cater to the unique needs of adaptive HTTP streaming. We present FlashStream, a multi-tiered storage architecture that addresses the unique needs of adaptive HTTP streaming. Like ZFS and flashcache, it also incorporates SSDs as a cache between the DRAM and the hard disk. The key architectural elements of FlashStream include optimal write granularity to overcome the write amplification effect of flash memory SSDs and a QoS-sensitive caching strategy that monitors the activity of the flash memory SSDs to ensure that video streaming performance is not hampered by the caching activity. We have implemented FlashStream and experimentally compared it with ZFS and flashcache for adaptive HTTP streaming workloads. We show that FlashStream outperforms both these systems for the same hardware configuration. Specifically, it is better by a factor of two compared to its nearest competitor, namely ZFS. In addition, we have compared FlashStream with a traditional two-level storage architecture (DRAM + HDDs), and have shown that, for the same investment cost, FlashStream provides 33% better performance and 94% better energy efficiency.
- Flashcache. http://www.facebook.com/note.php?note_id=388112370932.Google Scholar
- Hdd technology trends. http://www.storagenewsletter.com/news/disk/hdd-technology-trends-ibm.Google Scholar
- Hulu. http://www.hulu.com.Google Scholar
- Hulu viewers. http://www.comscore.com/Press_Events/Press_Releases/2011/12/comScore_Releases_November_2011_U.S._Online_Video_Rankings.Google Scholar
- ISO/IEC DIS 23009-1.2. Information technology - Dynamic adaptive streaming over HTTP (DASH) - Part 1: Media presentation description and segment formats.Google Scholar
- Netflix. http://www.netflix.com.Google Scholar
- Netflix traffic. http://www.techspot.com/news/46048-netflix-represents-327-of-north-americas-peak-web-traffic.html.Google Scholar
- Newegg. http://www.newegg.com.Google Scholar
- pion-net. http://www.pion.org/projects/pion-network-library.Google Scholar
- Zettabyte file system. http://solaris-training.com/classp/200_HTML/docs/zfs_wp.pdf.Google Scholar
- N. Agrawal, V. Prabhakaran, T. Wobber, J. D. Davis, M. Manasse, and R. Panigrahy. Design tradeoffs for ssd performance. In ATC'08: USENIX 2008 Annual Technical Conference on Annual Technical Conference, pages 57--70, Berkeley, CA, USA, 2008. USENIX Association. Google ScholarDigital Library
- A. Barbir, B. Cain, R. Nair, and O. Spatscheck. Known content network (cn) request-routing mechanisms. RFC 3568, http://tools.ietf.org/html/rfc3568. Google ScholarDigital Library
- A. C. Begen, T. Akgul, and M. Baugher. Watching video over the web: Part1: Streaming protocols. IEEE Internet Computing, 15(2):54--63, 2011. Google ScholarDigital Library
- B. Debnath, S. Sengupta, and J. Li. Flashstore: High throughput persistent key-value store. In Proceedings of the 36th International Conference on Very Large Data Bases, Singapore, September 2010. Google ScholarDigital Library
- J. Do, D. Zhang, J. M. Patel, D. J. DeWitt, J. F. Naughton, and A. Halverson. Turbocharging dbms buffer pool using ssds. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of data, SIGMOD '11, pages 1113--1124, 2011. Google ScholarDigital Library
- Intel Corporation. Understanding the Flash Translation Layer (FTL) Specification. White Paper, http://www.embeddedfreebsd.org/Documents/Intel-FTL.pdf, 1998.Google Scholar
- A. Kawaguchi, S. Nishioka, and H. Motoda. A flash-memory based file system. In USENIX Winter, pages 155--164, 1995. Google ScholarDigital Library
- T. Kgil and T. Mudge. Flashcache: a nand flash memory file cache for low power web servers. In Proceedings of the 2006 international conference on Compilers, architecture and synthesis for embedded systems, CASES '06, pages 103--112, 2006. Google ScholarDigital Library
- S. Kullback and R. A. Leibler. On information and sufficiency. The Annals of Mathematical Statistics, 22(1):79--86, March 1951.Google ScholarCross Ref
- S. Lederer, C. Muller, and C. Timmerer. Dynamic adaptive streaming over http dataset. In Proceedings of the third annual ACM conference on Multimedia Systems, MMSys '12, pages 89--94, Chapel Hill, North Carolina, USA, February 2012. Google ScholarDigital Library
- S.-W. Lee, B. Moon, and C. Park. Advances in flash memory ssd technology for enterprise database applications. In Proceedings of the ACM SIGMOD, pages 863--870, June 2009. Google ScholarDigital Library
- S.-W. Lee, B. Moon, C. Park, J.-M. Kim, and S.-W. Kim. A case for flash memory ssd in enterprise database applications. In Proceedings of the ACM SIGMOD, pages 1075--1086, June 2008. Google ScholarDigital Library
- T. R. G. Nair and P. Jayarekha. A rank based replacement policy for multimedia server cache using zipf-like law. Journal of Computing, 2(3):14--22, 2010.Google Scholar
- C. Park, W. Cheon, J. Kang, K. Roh, W. Cho, and J.-S. Kim. A reconfigurable ftl (flash translation layer) architecture for nand flash-based applications. Trans. on Embedded Computing Sys., 7(4):1--23, 2008. Google ScholarDigital Library
- C. Partridge, T. Mendez, and W. Milliken. Host anycasting services. RFC 1546, http://tools.ietf.org/html/rfc1546. Google ScholarDigital Library
- S. Podlipnig and L. Boszormenyi. A survey of web cache replacement strategies. ACM Comput. Surv., 35(4):374--398, December 2003. Google ScholarDigital Library
- M. Ryu, H. Kim, and U. Ramachandran. Why are state-of-the-art flash-based multi-tiered storage systems performing poorly for http video streaming? In Proceedings of the 22nd SIGMM Workshop on Network and Operating Systems Support for Digital Audio and Video, Toronto, Ontario, Canada, June 2012. Google ScholarDigital Library
- L. Singleton, R. Nathuji, and K. Schwan. Flash on disk for low-power multimedia computing. In Proceedings of the ACM Multimedia Computing and Networking Conference, January 2007.Google ScholarCross Ref
Index Terms
- FlashStream: a multi-tiered storage architecture for adaptive HTTP streaming
Recommendations
Impact of flash memory on video-on-demand storage: analysis of tradeoffs
MMSys '11: Proceedings of the second annual ACM conference on Multimedia systemsThere is no doubt that video-on-demand (VoD) services are very popular these days. However, disk storage is a serious bottleneck limiting the scalability of a VoD server. Disk throughput degrades dramatically due to seek time overhead when the server is ...
A comprehensive study of energy efficiency and performance of flash-based SSD
Use of flash memory as a storage medium is becoming popular in diverse computing environments. However, because of differences in interface, flash memory requires a hard-disk-emulation layer, called FTL (flash translation layer). Although the FTL enables ...
A hybrid filesystem for hard disk drives in tandem with flash memory
The traditional hard disk drive (HDD) is often a bottleneck in the overall performance of modern computer systems. With the development of solid state drives (SSD) based on flash memory, new possibilities are available to improve secondary storage ...
Comments