Abstract:
Experts have warned that processing of genetic data will soon exceed the computing needs of Twitter and YouTube. This is due to the drop of the costs for sequencing DNA o...Show MoreMetadata
Abstract:
Experts have warned that processing of genetic data will soon exceed the computing needs of Twitter and YouTube. This is due to the drop of the costs for sequencing DNA of any living creature and its huge impact in many application areas. Designing suitable network architectures for distributing such data is therefore of paramount importance. Management of genomic data sets is a typical big data problem, characterized not only by a huge volume, but also by the large size of each genomic file. Since it is unthinkable that any professional who needs to process genomes can own the infrastructure for massive genome analysis, a cloud-based access to genomic services is envisaged. This will have a significant impact on the underlying networks, which could become the system bottleneck. In this paper, we propose Genome Centric Networking (GCN), a novel network function virtualization framework for cloud-based genomic data management, designed with the aim of limiting the exchanged traffic by using distributed caching. The key element of GCN is a novel signaling protocol, which allows both discovering network resources and managing caches. We evaluated GCN on a real testbed. GCN allows halving the exchanged traffic and reducing the transfer time of genomic datasets significantly.
Published in: 2017 IEEE Conference on Network Softwarization (NetSoft)
Date of Conference: 03-07 July 2017
Date Added to IEEE Xplore: 14 August 2017
ISBN Information: