Abstract
Cloud computing has become an effective solution for various services on the internet. Additionally, a cloud environment acts as a default storage location for application users. Large cloud storage service providers receive terabytes of data per second with an enormous amount of duplicated content. The duplicate copies can be eliminated using the deduplication technique. The proposed research work detects redundant audio content of the existing files in a cloud environment. Additionally, this study investigates the cloud computing environment which consists of numerous audio files (waveform audio file format). The proposed work detects redundant content and identifies only a part of the existing audio file, which refines the duplicated content over the space. This can be accomplished using the refined super subset identification algorithm, which processes a waveform audio file format content as numerical data and efficiently detects the repeated contents in an elastic cloud computing environment. The results demonstrate the accuracy of detecting duplicated files present in various files. The visual representation of the results proves the accuracy and exhibits that the quality of the audio content was not compromised. Finally, the method is effectively validated in a real-time environment.




Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Venkatesh K, Narasimhan D, Bala Krishnan R (2019) Accelerated service oriented architecture implementation of deduplication principle. Int J Innov Technol Explor Eng (IJITEE) 8(11). ISSN: 2278–3075. Sept 2019
Chen L, Qiu M, Song J et al (2018) E2FS: an elastic storage system for cloud computing. J Supercomput 74:1045–1060. https://doi.org/10.1007/s11227-016-1827-3
Sebaa A, Tari A (2019) Query optimization in cloud environments: challenges, taxonomy, and techniques. J Supercomput 75:5420–5450. https://doi.org/10.1007/s11227-019-02806-9
Wu T, Pan J, Lin C (2014) Improving accessing efficiency of cloud storage using deduplication and feedback schemes. IEEE Journal 8:208–218
Li J, Chen X, Huang X, Tang S, Xiang Y, Member MMH (2015) IEEE and Abdulhameed Alelaiwi Member, IEEE, secure distributed deduplication systems with improved reliability. IEEE Trans Comput 64(2):3569–3579
Ko YW, Kim S-J, Kim J, Kim E-J, So JM (2015) Energy efficient metadata management for cloud storage system. Int J Distrib Sensor Netw. Article ID 626575
Giannakopoulos T (2015) pyaudio analysis: an open-source python library for audio signal analysis, p 0144610. Published: ember 11 Dec 2015
Dia OA, Farkas C (2015) Risk aware query replacement approach for secure databases performance management. IEEE Trans Depend Secure Comput 12(2). March/April 2015.
Li J, Li YK, Chen X, Lee PPC, Lou W (2015) A hybrid cloud approach for secure authorized deduplication. IEEE Trans Parall Distrib Syst 26(5). May 2015.
Wang J, Chen X, Huang X, You I () Senior Member, IEEE, and Yang Xiang, Senior Member, IEEE verifiable auditing for outsourced database in cloud computing. IEEE Trans Comput 64(11). Nov 2015
Bellare M, Keelveedhi S (2015) Interactive message-locked encryption and secure deduplication. In: Cryptography P-K (ed) Berlin. Springer, Germany, pp 516–538
Sumedha A, Telkar SAM, Shaikh MZ (2016) Secured and efficient cloud storage data deduplication system. IJARCCE5(1). January 2016.
Sellami R, Bhiri S, Defude B () Member, IEEE, supporting multi data stores applications in cloud environments. IEEE Trans Serv Comput 9(1). January/February 2016
Xia Z, Wang X, Sun X, Wang Q (2016) A secure and dynamic multi-keyword ranked search scheme over encrypted cloud data. IEEE Trans Parall Distrib Syst 27(2). February 2016
Tudoran R, Costan A, Antoniu G () OverFlow: multi-site aware big data management for scientific workflows on clouds. IEEE Trans Cloud Comput 4(1). January−March 2016
Fu M, Feng D, Hua Y, He X, Chen Z, Liu J, Xia W, Huang F, Liu Q (2016) Reducing fragmentation for in-line deduplication backup storage via exploiting backup history and cache knowledge. IEEE Trans Parall Distrib Syst 27(3). March 2016
Su K-W, Leu J-S, Yu M-C, Wu Y-T, Lee E-C, Song T (2016) Design and implementation of various file deduplication schemes on storage devices. Springer Science Business Media New York
Luo X, Zhou H, Yu LA, Xue L, Xie Y (2016) Characterizing mobile ∗-box applications. Comput Netw 103: 228–239
Bahdanau D, Chorowski J, Serdyuk D, Brakel P, Bengio Y (2016) End-to-end attention-based large vocabulary speech recognition. IEEE Int Conf Acoustics Speech Signal Process ICASSP
Ranjitha S, Sudhakar P, Seetharaman K (2016) A novel and efficient deduplication system for HDFS.Int Conf Intell Comput Converg, pp 498–505
Kumar PM, Lokesh S, Varatharajan R, Babu GC, Parthasarathy P (2018) Cloud and IoT based disease prediction and diagnosis system for healthcare using Fuzzy neural classifier. Futur Gener Comput Syst 86:527–534
Rashid F, Miri A, Woungang I (2016) Secure image deduplication through image compression. J Inf Security Appl 27–28:54–64
Liu J, Chai Y, Yan C, Wang X () A delayed container organization approach to improve restore speed for deduplication systems. IEEE Trans Parall Distrib Syst 27(9). Sept 2016
Jin H, Jiang H, Zhou K (2016) Dynamic and public auditing with fair arbitration for cloud data. IEEE Trans Cloud Compu
Rongmao C, Yi M, Yang G, Guo F, Wang X (2016) Dual-server public-key encryption with keyword search for secure cloud storage. IEEE Trans Inf Foren Security 11(4). April 2016
Panchatcharam P, Vivekanandan S (2019) Internet of things (IOT) in healthcare–smart health and surveillance, architectures, security analysis and data transfer: a review. Int J Softw Innov IJSI 7(2):21–40
Wan C, Zhang J, Pei B, Chen C (2016) Efficient privacy-preserving third-party auditing for ambient intelligence systems. J Ambient Intell Humanized Computing 7(1):21–27
Ryan NS (2017) Widodoa, Hyotaek Limb, Mohammed Atiquzzamanc, SDM: Smart deduplication for mobile cloud storage. Futur Gener Comput Syst 70:64–73
Hamid HAA, Rahman SMM, Hossain MS, Almogren A, Alamri A (2017) A security model for preserving the privacy of medical big data in a healthcare cloud using a fog computing facility with pairing-based cryptography. 7 Nov 2017.
Shinde D, Dangi A (2017) Implementation on secured hybrid cloud storage service provider with deduplication. Int J Emerg Technol Adv Eng 7(6). June 2017
Daniel E, Vasanthi NA (2017) LDAP: a lightweight deduplication and auditing protocol for secure data storage in cloud environment. Springer Science Business Media, LLC, part of Springer Nature
Li C, Wang C, Luo Y (2020) An efficient scheduling optimization strategy for improving consistency maintenance in edge cloud environment. J Supercomput 76:6941–6968. https://doi.org/10.1007/s11227-019-03133-9
Chunlin L, Jianhang T, Youlong L (2018) Multi-queue scheduling of heterogeneous jobs in hybrid geo-distributed cloud environment. J Supercomput 74:5263–5292. https://doi.org/10.1007/s11227-018-2420-8
Bir P, Karatangi SV, Rai A (2020) Design and implementation of an elastic processor with hyperthreading technology and virtualization for elastic server models. J Supercomput 76:7394–7415. https://doi.org/10.1007/s11227-020-03174-5
Acknowledgements
The study was supported by FIST grant received from the Department of Science and Technology, Government of India (Reference No. SR/FST/MSI-107/2015(C)).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Venkatesh, K., Narasimhan, D. Revealing the novel precise subset identification and deduplication of audio substance over the shared public environment. J Supercomput 78, 11856–11872 (2022). https://doi.org/10.1007/s11227-022-04317-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-022-04317-6
Keywords
Profiles
- D. Narasimhan View author profile