skip to main content
research-article

Batch-file Operations to Optimize Massive Files Accessing: Analysis, Design, and Application

Authors Info & Claims
Published:16 July 2020Publication History
Skip Abstract Section

Abstract

Existing local file systems, designed to support a typical single-file access mode only, can lead to poor performance when accessing a batch of files, especially small files. This single-file mode essentially serializes accesses to batched files one by one, resulting in a large number of non-sequential, random, and often dependent I/Os between file data and metadata at the storage ends. Such access mode can further worsen the efficiency and performance of applications accessing massive files, such as data migration. We first experimentally analyze the root cause of such inefficiency in batch-file accesses. Then, we propose a novel batch-file access approach, referred to as BFO for its set of optimized Batch-File Operations, by developing novel BFOr and BFOw operations for fundamental read and write processes, respectively, using a two-phase access for metadata and data jointly. The BFO offers dedicated interfaces for batch-file accesses and additional processes integrated into existing file systems without modifying their structures and procedures. In addition, based on BFOr and BFOw, we also propose the novel batch-file migration BFOm to accelerate the data migration for massive small files. We implement a BFO prototype on ext4, one of the most popular file systems. Our evaluation results show that the batch-file read and write performances of BFO are consistently higher than those of the traditional approaches regardless of access patterns, data layouts, and storage media, under synthetic and real-world file sets. BFO improves the read performance by up to 22.4× and 1.8× with HDD and SSD, respectively, and it boosts the write performance by up to 111.4× and 2.9× with HDD and SSD, respectively. BFO also demonstrates consistent performance advantages for data migration in both local and remote situations.

References

  1. Vasily Tarasov and George Amvrosiadis. 2018. Filebench. Retrieved from http://sourceforge.net/projects/filebench/.Google ScholarGoogle Scholar
  2. Skyvia.com. 2018. Skyvia. Retrieved from https://skyvia.com/data-integration/synchronization.Google ScholarGoogle Scholar
  3. Alibaba. 2018. TFS Project. Retrieved from http://code.taobao.org/p/tfs/src/.Google ScholarGoogle Scholar
  4. William E. Allcock, John Bresnahan, Rajkumar Kettimuthu, and Michael Link. 2005. The globus striped GridFTP framework and server. In Proceedings of the ACM/IEEE Supercomputing Conference (SC’05). 54.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Michael P. Andersen and David E. Culler. 2016. BTrDB: Optimizing storage system design for timeseries processing. In Proceedings of the USENIX Conference on File and Storage Technologies. 39--52.Google ScholarGoogle Scholar
  6. Apache. 2018. Hadoop. Retrieved from http://hadoop.apache.org/.Google ScholarGoogle Scholar
  7. Jens Axboe. 2018. Blktrace. Retrieved from https://git.kernel.org/pub/scm/linux/kernel/git/axboe.Google ScholarGoogle Scholar
  8. Doug Beaver, Sanjeev Kumar, Harry C. Li, Jason Sobel, and Peter Vajgel. 2010. Finding a needle in haystack: Facebook’s photo storage. In Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation (OSDI’10). 47--60.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Binfer. 2018. High-speed File Transfer Software. Retrieved from https://www.binfer.com/high-speed-file-transfer-software/.Google ScholarGoogle Scholar
  10. Vijay Chidambaram, Tushar Sharma, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-Dusseau. 2012. Consistency without ordering. In Proceedings of the USENIX Conference on File and Storage Technologies. 9.Google ScholarGoogle Scholar
  11. Xiaoning Ding, Song Jiang, Feng Chen, Kei Davis, and Xiaodong Zhang. 2007. DiskSeen: Exploiting disk layout and access history to enhance I/O prefetch. In Proceedings of theUSENIX Annual Technical Conference. USENIX, 261--274.Google ScholarGoogle Scholar
  12. John Esmet, Michael A. Bender, Martin Farach-Colton, and Bradley C. Kuszmaul. 2012. The TokuFS streaming file system. In Proceedings of the 4th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage’12).Google ScholarGoogle Scholar
  13. Songling Fu, Ligang He, Chenlin Huang, Xiangke Liao, and Kenli Li. 2015. Performance optimization for managing massive numbers of small files in distributed file systems. IEEE Trans. Parallel Distrib. Syst. 26, 12 (2015), 3433--3448.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. GNU. 2018. Linux scp. Retrieved from http://www.gnu.org/software/coreutils/coreutils.html.Google ScholarGoogle Scholar
  15. GNU. 2018. Linux tar. Retrieved from http://www.gnu.org/software/coreutils/coreutils.html.Google ScholarGoogle Scholar
  16. Yunhong Gu and Robert L. Grossman. 2007. UDT: UDP-based data transfer for high-speed wide area networks. Comput. Netw. 51, 7 (2007), 1777--1799.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Sangtae Ha, Injong Rhee, and Lisong Xu. 2008. CUBIC: A new TCP-friendly high-speed TCP variant. Operat. Syst. Rev. 42, 5 (2008), 64--74.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Andrew Hanushevsky. 2018. BBCP. Retrieved from http://www.slac.stanford.edu/ abh/bbcp/.Google ScholarGoogle Scholar
  19. William Jannen, Jun Yuan, Yang Zhan, Amogh Akshintala, John Esmet, Yizheng Jiao, Ankur Mittal, Prashant Pandey, Phaneendra Reddy, Leif Walsh, Michael A. Bender, Martin Farach-Colton, Rob Johnson, Bradley C. Kuszmaul, and Donald E. Porter. 2015. BetrFS: A right-optimized write-optimized file system. In Proceedings of the USENIX Conference on File and Storage Technologies. 301--315.Google ScholarGoogle Scholar
  20. Yongsoo Joo, Sangsoo Park, and Hyokyung Bahn. 2017. Exploiting I/O reordering and I/O interleaving to improve application launch performance. Trans. Stor. 13, 1 (2017), 8:1–8:17.Google ScholarGoogle Scholar
  21. Tom Kelly. 2003. Scalable TCP: Improving performance in highspeed wide area networks. Comput. Commun. Rev. 33, 2 (2003), 83--91.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Youngjae Kim, Scott Atchley, Geoffroy Vallée, and Galen M. Shipman. 2015. LADS: Optimizing data transfers using layout-aware data scheduling. In Proceedings of the USENIX Conference on File and Storage Technologies. 67--80.Google ScholarGoogle Scholar
  23. Changman Lee, Dongho Sim, Joo Young Hwang, and Sangyeun Cho. 2015. F2FS: A new file system for flash storage. In Proceedings of the USENIX Conference on File and Storage Technologies. 273--286.Google ScholarGoogle Scholar
  24. Tan Li, Yufei Ren, Dantong Yu, and Shudong Jin. 2017. RAMSYS: Resource-aware asynchronous data transfer with multicore SYStems. IEEE Trans. Parallel Distrib. Syst. 28, 5 (2017), 1430--1444.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Jie Liang, Yinlong Xu, Yongkun Li, and Yubiao Pan. 2017. ISM- An intra-stripe data migration approach for RAID-5 scaling. In Proceedings of the International Conference on Networking, Architecture, and Storage (NAS’17). 1--10.Google ScholarGoogle ScholarCross RefCross Ref
  26. LinuxKernel. 2018. CFQ. Retrieved from https://www.kernel.org/doc/Documentation/block/cfq-iosched.txt.Google ScholarGoogle Scholar
  27. LinuxKernel. 2018. Deadline. Retrieved from https://www.kernel.org/doc/Documentation/block/deadline-iosched.txt.Google ScholarGoogle Scholar
  28. Yue Liu, Songlin Hu, Tilmann Rabl, Wantao Liu, Hans-Arno Jacobsen, Kaifeng Wu, Jian Chen, and Jintao Li. 2014. DGFIndex for smart grid: Enhancing hive with a cost-effective multidimensional range index. Proc. VLDB Endow. 7, 13 (2014), 1496--1507.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Lanyue Lu, Thanumalayan Sankaranarayana Pillai, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-Dusseau. 2016. WiscKey: Separating keys from values in SSD-conscious storage. In Proceedings of the 14th USENIX Conference on File and Storage Technologies (FAST’16). 133--148.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Marshall Kirk Mckusick and T. J. Kowalski. 2007. Fsck—The UNIX file system check program. Retrieved from https://dl.acm.org/doi/10.5555/107172.107210.Google ScholarGoogle Scholar
  31. Netapp. 2018. Cloud Sync. Retrieved from https://cloud.netapp.com/cloud-sync.Google ScholarGoogle Scholar
  32. Nexor. 2018. Secure and Efficient Manual Release of Files Across Networks. Retrieved from https://www.nexor.com/case-studies/files-transfer-secure-networks/.Google ScholarGoogle Scholar
  33. Thanumalayan Sankaranarayana Pillai, Ramnatthan Alagappan, Lanyue Lu, Vijay Chidambaram, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-Dusseau. 2017. Application crash consistency and performance with CCFS. In Proceedings of the 15th USENIX Conference on File and Storage Technologies (FAST’17). 181--196.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Kai Ren and Garth A. Gibson. 2013. TABLEFS: Enhancing metadata efficiency in the local file system. In Proceedings of the USENIX Annual Technical Conference. 145--156.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Ohad Rodeh, Josef Bacik, and Chris Mason. 2013. BTRFS: The Linux B-tree filesystem. ACM Trans. Storage 9, 3 (2013), 1--32.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Bradley W. Settlemyer, Jonathan D. Dobson, Stephen W. Hodson, Jeffery A. Kuehn, Stephen W. Poole, and Thomas Ruwart. 2011. A technique for moving large data sets over high-performance long distance networks. In Proceedings of the IEEE Conference on Mass Storage Systems and Technologies (MSST’11). 1--6.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Philip Shilane, Mark Huang, Grant Wallace, and Windsor Hsu. 2012. WAN optimized replication of backup datasets using stream-informed delta compression. In Proceedings of the USENIX Conference on File and Storage Technologies. 5.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Adam Sweeney, Doug Doucette, Wei Hu, Curtis Anderson, Mike Nishimoto, and Geoff Peck. 1996. Scalability in the XFS file system. In Proceedings of the USENIX Annual Technical Conference. 1--14.Google ScholarGoogle Scholar
  39. Textfiles.com. 2018. TextFiles. Retrieved from http://www.textfiles.com/bbs/.Google ScholarGoogle Scholar
  40. Andrew Tridgell. 2018. Rsync. Retrieved from https://rsync.samba.org/.Google ScholarGoogle Scholar
  41. Stephen C. Tweedie. 2000. EXT3, journaling filesystem. In Proceedings of the Ottowa Linux Symposium.Google ScholarGoogle Scholar
  42. Wenrui Yan, Jie Yao, Qiang Cao, Changsheng Xie, and Hong Jiang. 2017. ROS: A rack-based optical storage system with inline accessibility for long-term data preservation. In Proceedings of the 12th European Conference on Computer Systems (EuroSys’17). 161--174.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Wangdong Yang, Kenli Li, and Keqin Li. 2019. A pipeline computing method of SpTV for three-order tensors on CPU and GPU. Trans. Knowl. Discov. Data 13, 6 (2019), 63:1–63:27.Google ScholarGoogle Scholar
  44. Weikuan Yu, Jeffrey S. Vetter, Shane Canon, and Song Jiang. 2007. Exploiting lustre file joining for effective collective IO. In Proceedings of the 7th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID’07). 267--274.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Jun Yuan, Yang Zhan, William Jannen, Prashant Pandey, Amogh Akshintala, Kanchan Chandnani, Pooja Deo, Zardosht Kasheff, Leif Walsh, Michael A. Bender, Martin Farach-Colton, Rob Johnson, Bradley C. Kuszmaul, and Donald E. Porter. 2016. Optimizing every operation in a write-optimized file system. In Proceedings of the USENIX Conference on File and Storage Technologies. 1--14.Google ScholarGoogle Scholar
  46. Haoyu Zhang, Brian Cho, Ergin Seyfe, Avery Ching, and Michael J. Freedman. 2018. Riffle: Optimized shuffle service for large-scale data analytics. In Proceedings of the 13th EuroSys Conference (EuroSys’18). 43:1–43:15.Google ScholarGoogle Scholar
  47. Shuanglong Zhang, Helen Catanese, and An-I Andy Wang. 2016. The composite-file file system: Decoupling the one-to-one mapping of files and metadata for better performance. In Proceedings of the USENIX Conference on File and Storage Technologies. 15--22.Google ScholarGoogle Scholar

Index Terms

  1. Batch-file Operations to Optimize Massive Files Accessing: Analysis, Design, and Application

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Storage
        ACM Transactions on Storage  Volume 16, Issue 3
        August 2020
        150 pages
        ISSN:1553-3077
        EISSN:1553-3093
        DOI:10.1145/3410885
        • Editor:
        • Sam H. Noh
        Issue’s Table of Contents

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 16 July 2020
        • Online AM: 7 May 2020
        • Accepted: 1 April 2020
        • Revised: 1 March 2020
        • Received: 1 September 2019
        Published in tos Volume 16, Issue 3

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format