ABSTRACT
There are millions of files and multi-terabytes of data transferred to and from the University of Tennessee's National Institute for Computational Sciences each month. New capabilities available with GridFTP version 5.2.2 include additional transfer log information previously unavailable in prior versions implemented within XSEDE. The transfer log data now available includes identification of source and destination endpoints which unlocks a wealth of information that can be used to detail GridFTP activities across the Internet. This information can be used for a wide variety of reports of interest to individual XSEDE Service Providers and to XSEDE Operations. In this paper, we discuss the new capabilities available for transfer logs in GridFTP 5.2.2, our initial attempt to organize, analyze, and report on this file transfer data for NICS, and its applicability to XSEDE Service Providers. Analysis of this new information can provide insight into effective and efficient utilization of GridFTP resources including identification of potential areas of GridFTP file transfer improvement (e.g., network and server tuning) and potential predictive analysis to improve efficiency.
- Extreme Science and Engineering Discovery Environment (XSEDE). https://www.xsede.org/, 2011.Google Scholar
- I. Foster. Globus toolkit version 4: Software for service-oriented systems. In Network and parallel computing, pages 2--13. Springer, 2005. Google ScholarDigital Library
- R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2013.Google Scholar
Index Terms
- Descriptive Data Analysis of File Transfer Data
Recommendations
Implementation of a dynamic adjustment strategy for parallel file transfer in co-allocation data grids
Co-allocation architecture was developed to enable parallel transferring of files from multiple replicas stored in the different servers. Several co-allocation strategies have been coupled and used to exploit the different transfer rates among various ...
The File Mover: high-performance data transfer for the grid: Research Articles
The exploration in many scientific disciplines (e.g. High-Energy Physics, Climate Modeling, and Life Sciences) involves the production and the analysis of massive data collections, whose archival, retrieval, and analysis require the coordinated usage of ...
An Anticipative Recursively Adjusting Mechanism for parallel file transfer in data grids
Data Grids enable the sharing, selection, and connection of a wide variety of geographically distributed computational and storage resources for content needed by large-scale data-intensive applications such as high-energy physics, bioinformatics, and ...
Comments