Abstract
Computational scientific applications tend to be very data I/O intensive, producing a large amount of data as the execution result. In this research, we propose a new storage system using next-generation non-volatile memory that is suitable for exa-scale computing systems. This storage system is called the Cloud Computing Burst System (CCBS) and is composed of a unified table management module, data scoring module, and CCBS storage. In particular, CCBS operates as a workload enlightened storage system using its own data scoring module. The CCBS storage architecture consists of PCM/NAND Flash arrays and a data migration engine. CCBS storage cannot only provide a scaling out feature, but also improve the overall performance of the storage system. In addition, by using new non-volatile memory array, many benefits, such as low energy consumption, density scaling, and high performance, can be achieved. We demonstrate the effectiveness of our proposed system by simulating the storage system using scientific benchmarking tool. Our data scoring algorithm can provide 7% more hit rate than other methods for CCBS. In addition, our proposed system has improved storage system speed by 1.64 times, compared with only NAND Flash conventional model.
Similar content being viewed by others
References
Mesnier M, Ganger G, Riedel E (2003) Storage area networking—object-based storage. IEEE Commun Mag 41(8):84–90
Fitzpatrick B (2004) Distributed caching with memcached. Linux J 2004(124):5
Beaver D, Kumar S, Li H, Sobel J, Vajgel P (2010) Finding a needle in Haystack: facebook’s photo storage. In: 9th USENIX Symposium on Operating Systems Design and Implementation. Vancouver, pp 1–8
Li Y, Guo L, Guo Y (2013) An efficient and performance-aware big data storage system. In: Ivanov II, van Sinderen M, Leymann F, Shan T (eds) Cloud computing and services science. CLOSER 2012. Communications in computer and information science, vol 367. Springer, Cham, pp 102–106
Xie W, Zhou J, Reyes M, Noble J, Chen Y (2015) Two-mode data distribution scheme for heterogeneous storage in data centers. In: IEEE International Conference on Big Data (Big Data)
Xu Y, Frachtenberg E, Jiang S, Paleczny M (2014) Characterizing facebook’s memcached workload. IEEE Internet Comput 18(2):41–49
Nishtala R, Fugal H, Grimm S, Kwiatkowski M, Lee H, Li HC, Stafford D (2013) Scaling memcache at facebook. In: NSDI, vol 13. pp 385–398
Toor S, Toebbicke R, Resines M, Holmgren S (2012) Investigating an open source cloud storage infrastructure for CERN-specific data analysis. In: IEEE Seventh International Conference on Networking, Architecture, and Storage
Gray J, Liu D, NietoSantisteban M, Szalay A, DeWitt D, Heber G (2005) Scientific data management in the coming decade. ACM SIGMOD Rec 34(4):34–41
Carns P, Harms K, Allcock W, Bacon C, Lang S, Latham R, Ross R (2011) Understanding and improving computational science storage access through continuous characterization. TOS 7(3):1–26
Liu N, Cope J, Carns P, Carothers C, Ross R, Grider G, Crume A, Maltzahn C (2012) On the role of burst buffers in leadership-class storage systems. In: IEEE 28th Symposium on Mass Storage Systems and Technologies (MSST)
Xu T, Sato K, Matsuoka S (2016) CloudBB: scalable I/O accelerator for shared cloud storage. In: IEEE 22nd International Conference on Parallel and Distributed Systems (ICPADS). IEEE
Rackspace (2012) Storage systems overview, Rackspace. https://blog.rackspace.com/storage-systems-overview/. Accessed 2 Feb 2017
Amazon (2006) Amazon API Document. http://docs.aws.amazon.com/AmazonS3/latest/API/RESTObjectPUT.html. Accessed 5 Mar 2017
Ali N et al (2009) Scalable I/O forwarding framework for high-performance computing systems. In: IEEE International Conference on Cluster Computing and Workshops, CLUSTER’09. IEEE
Qureshi MK, Srinivasan V, Revers JA (2009) Scalable high performance main memory system using phase-change memory technology. In: The 36th Annual International Symposium on Computer Architecture (ISCA09), pp 24–33
Lee DH et al (2015) A new memory-disk integrated system with HW optimizer. ACM Trans Archit Code Optim (TACO) 12(2):11
Kryder M, Kim C (2010) After hard drives what comes next? In: The IEEE Transactions on Magnetics, pp 3406–3413
Zheng Q et al (2012) Cosbench: a benchmark tool for cloud object storage services. In: IEEE 5th International Conference on Cloud Computing (CLOUD). IEEE, pp 998–999
Yang JJ, Strukov DB, Stewart DR (2013) Memristive devices for computing. Nature Nanotechnol 8(1):13–24
Sparck Jones K (1972) A statistical interpretation of term specificity and its application in retrieval. J Doc 28(1):11–21
Arnold J (2014) Openstack swift: using, administering, and developing for swift object storage O’Reilly Media, Inc., Sebastopol, CA, USA
Acknowledgements
This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT and Future Planning (2015R1A2A2A01007668).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Youn, YS., Yoon, SK. & Kim, SD. Cloud computing burst system (CCBS): for exa-scale computing system. J Supercomput 73, 4020–4041 (2017). https://doi.org/10.1007/s11227-017-1998-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-017-1998-6