Abstract
More and more data are stored in cloud storage, which brings two major challenges. First, the modified files in the cloud should be quickly synchronized to ensure data consistency, e.g., delta synchronization (sync) achieves efficient cloud sync by synchronizing only the updated part of the file. Second, the huge data in the cloud needs to be deduplicated and encrypted, e.g., Message-Locked Encryption (MLE) implements data deduplication by encrypting the content among different users. However, when combined, a few updates in the content can cause large sync traffic amplification for both keys and ciphertext in the MLE-based cloud storage, significantly degrading the cloud sync efficiency. A feature-based encryption sync scheme, FeatureSync, is proposed to address the delta amplification problem. However, with further improvement of the network bandwidth, the performance of FeatureSync stagnates. In our preliminary experimental evaluations, we find that the bottleneck of the computational overhead in the high-bandwidth network environments is the main bottleneck in FeatureSync. In this article, we propose an enhanced feature-based encryption sync scheme FASTSync to optimize the performance of FeatureSync in high-bandwidth network environments. The performance evaluations on a lightweight prototype implementation of FASTSync show that FASTSync reduces the cloud sync time by 70.3% and the encryption time by 37.3%, on average, compared with FeatureSync.
- [1] . 2022. Retrieved from https://www.icloud.com/.Google Scholar
- [2] . 2013. Message-locked encryption and secure deduplication. In Proceedings of the 32nd Annual International Conference on the Theory and Applications of Cryptographic Techniques (EUROCRYPT’13).Google ScholarCross Ref
- [3] . 2006. Improving duplicate elimination in storage systems. ACM Trans. Storage 2, 4 (2006), 424–448.Google ScholarDigital Library
- [4] . 2022. Retrieved from https://www.boxcryptor.com/en/.Google Scholar
- [5] . 1997. On the resemblance and containment of documents. In Proceedings of the Compression and Complexity of SEQUENCES.Google ScholarDigital Library
- [6] . 2022. Retrieved from https://tldp.org/HOWTO/Traffic-Control-HOWTO/intro.html.Google Scholar
- [7] . 2015. QuickSync: Improving synchronization efficiency for mobile cloud storage services. In Proceedings of the 21st Annual International Conference on Mobile Computing and Networking (MobiCom’15).Google ScholarDigital Library
- [8] . 2002. Reclaiming space from duplicate files in a serverless distributed file system. In Proceedings of the 22nd International Conference on Distributed Computing Systems (ICDCS’02).Google ScholarCross Ref
- [9] . 2003. Application-specific delta-encoding via resemblance detection. In Proceedings of the USENIX Annual Technical Conference (ATC’03).Google Scholar
- [10] . 2022. Retrieved from: http://www.dropboxchina.com/.Google Scholar
- [11] . 2015. Retrieved from: https://www.sookasa.com/resources/encrypted-cloud-storage/.Google Scholar
- [12] . 2022. Retrieved from: https://github.com/Quicksandlyh/FastCDC-c/.Google Scholar
- [13] . 2022. Retrieved from: https://www.seafile.com/en/home.Google Scholar
- [14] . 2022. Retrieved from: https://github.com/.Google Scholar
- [15] . 2022. Retrieved from: https://drive.google.com/.Google Scholar
- [16] . 2022. Retrieved from: https://github.com/tidwall/hashmap.c/.Google Scholar
- [17] . 2020. Dsync: A lightweight delta synchronization approach for cloud storage services. In Proceedings of the 36th Symposium on Mass Storage Systems and Technologies (MSST’20).Google Scholar
- [18] . 2013. DupLESS: Server-aided encryption for deduplicated storage. In Proceedings of the 22th USENIX Security Symposium (Security’13).Google Scholar
- [19] . 2022. Retrieved from: https://www.kernel.org/.Google Scholar
- [20] . 2004. Redundancy elimination within large collections of files. In Proceedings of the USENIX Annual Technical Conference (ATC’04).Google ScholarDigital Library
- [21] . 2019. Metadedup: Deduplicating metadata in encrypted deduplication via indirection. In Proceedings of the 35th Symposium on Mass Storage Systems and Technologies (MSST’19).Google ScholarCross Ref
- [22] . 2017. Information leakage in encrypted deduplication via frequency analysis. In Proceedings of the 47th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN’17).Google ScholarCross Ref
- [23] . 2020. Balancing storage efficiency and data confidentiality with tunable encrypted deduplication. In Proceedings of the 15th EuroSys Conference (EuroSys’20).Google ScholarDigital Library
- [24] . 2013. Efficient batched synchronization in Dropbox-like cloud storage services. In Proceedings of the 14th International Middleware Conference (Middleware’13).Google ScholarCross Ref
- [25] . 2000. File System Support for Delta Compression. Master’s thesis. Department of Electrical Engineering and Computer Science, University of California at Berkeley.Google Scholar
- [26] . 2016. Leveraging data deduplication to improve the performance of primary storage systems in the cloud. IEEE Trans. Comput. 65, 6 (2016), 1775–1788.Google ScholarDigital Library
- [27] . 2016. Exploiting workload characteristics and service diversity to improve the availability of cloud storage systems. IEEE Trans. Parallel Distrib. Syst. 27, 7 (2016), 2010–2021.Google ScholarDigital Library
- [28] . 2022. Retrieved from: https://drive.google.com/.Google Scholar
- [29] . 2019. Secure distributed adaptive bin packing algorithm for cloud storage. Fut. Gen. Comput. Syst. 90 (2019), 307–316.Google ScholarCross Ref
- [30] . 2001. A low-bandwidth network file system. In Proceedings of the 18th ACM Symposium on Operating System Principles (SOSP’01).Google ScholarDigital Library
- [31] . 2022. Retrieved from: https://github.com/openssl/openssl.Google Scholar
- [32] . 2019. The design and implementation of a rekeying-aware encrypted deduplication storage system. ACM Trans. Storage 13, 1 (2017), 1–30.Google ScholarDigital Library
- [33] . 2012. WAN-optimized replication of backup datasets using stream-informed delta compression. ACM Trans. Storage 8, 4 (2012), 1–26.Google ScholarDigital Library
- [34] and Nasir Memon. 2002. Algorithms for delta compression and remote file synchronization. In Handbook of Lossless Compression, S. Sayood (Ed.). Academic Press.Google Scholar
- [35] . 2019. PandaSync: Network and workload aware hybrid cloud sync optimization. In Proceedings of the 39th IEEE International Conference on Distributed Computing Systems (ICDCS’19).Google ScholarCross Ref
- [36] . 2019. PFP: Improving the reliability of deduplication-based storage systems with per-file parity. IEEE Trans. Parallel Distrib. Syst. 30, 9 (2019), 2117–2129.Google ScholarDigital Library
- [37] . 2021. When delta sync meets message-locked encryption: A feature-based delta sync scheme for encrypted cloud storage. In Proceedings of the 41st IEEE International Conference on Distributed Computing Systems (ICDCS’21). 337–347.Google ScholarCross Ref
- [38] . 2016. A comprehensive study of the past, present, and future of data deduplication. Proc. IEEE 104 (2016), 1681–1710.Google ScholarCross Ref
- [39] . 2014. Ddelta: A deduplication-inspired fast delta compression approach. Perform. Eval. 79 (2014), 258–272.Google ScholarCross Ref
- [40] . 2016. FastCDC: A fast and efficient content-defined chunking approach for data deduplication. In Proceedings of the USENIX Annual Technical Conference (USENIX ATC’16), and (Eds.). 101–114.Google Scholar
- [41] . 2018. Towards web-based delta synchronization for cloud storage services. In Proceedings of the 16th USENIX Conference on File and Storage Technologies (FAST’18).Google Scholar
- [42] . 2017. DeltaCFS: Boosting delta sync for cloud storage services by learning from NFS. In Proceedings of the 37th IEEE International Conference on Distributed Computing Systems (ICDCS’17).Google ScholarCross Ref
- [43] . 2015. SecDep: A user-aware efficient fine-grained secure deduplication scheme with multi-level key management. In Proceedings of the IEEE 31st Symposium on Mass Storage Systems and Technologies (MSST’15).Google ScholarCross Ref
Index Terms
- FASTSync: A FAST Delta Sync Scheme for Encrypted Cloud Storage in High-bandwidth Network Environments
Recommendations
Encrypted Data Deduplication in Cloud Storage
ASIAJCIS '15: Proceedings of the 2015 10th Asia Joint Conference on Information SecurityCloud storage is a remote storage service, where users can upload and download their data anytime and anywhere. However, it raises issues regarding privacy and data confidentiality because all the data are stored in the cloud storage. This is a subject ...
Efficient update of encrypted files for cloud storage
UCC '15: Proceedings of the 8th International Conference on Utility and Cloud ComputingWe propose a method for file encryption based on AES-CTR suitable for cloud storage. Our method allows efficient updates of encrypted files by minimizing the amount of data that need to be re-encrypted. It achieves significantly better performance than ...
Comments