Skip to main content
Log in

Revealing the novel precise subset identification and deduplication of audio substance over the shared public environment

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Cloud computing has become an effective solution for various services on the internet. Additionally, a cloud environment acts as a default storage location for application users. Large cloud storage service providers receive terabytes of data per second with an enormous amount of duplicated content. The duplicate copies can be eliminated using the deduplication technique. The proposed research work detects redundant audio content of the existing files in a cloud environment. Additionally, this study investigates the cloud computing environment which consists of numerous audio files (waveform audio file format). The proposed work detects redundant content and identifies only a part of the existing audio file, which refines the duplicated content over the space. This can be accomplished using the refined super subset identification algorithm, which processes a waveform audio file format content as numerical data and efficiently detects the repeated contents in an elastic cloud computing environment. The results demonstrate the accuracy of detecting duplicated files present in various files. The visual representation of the results proves the accuracy and exhibits that the quality of the audio content was not compromised. Finally, the method is effectively validated in a real-time environment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Venkatesh K, Narasimhan D, Bala Krishnan R (2019) Accelerated service oriented architecture implementation of deduplication principle. Int J Innov Technol Explor Eng (IJITEE) 8(11). ISSN: 2278–3075. Sept 2019

  2. Chen L, Qiu M, Song J et al (2018) E2FS: an elastic storage system for cloud computing. J Supercomput 74:1045–1060. https://doi.org/10.1007/s11227-016-1827-3

    Article  Google Scholar 

  3. Sebaa A, Tari A (2019) Query optimization in cloud environments: challenges, taxonomy, and techniques. J Supercomput 75:5420–5450. https://doi.org/10.1007/s11227-019-02806-9

    Article  Google Scholar 

  4. Wu T, Pan J, Lin C (2014) Improving accessing efficiency of cloud storage using deduplication and feedback schemes. IEEE Journal 8:208–218

    Google Scholar 

  5. Li J, Chen X, Huang X, Tang S, Xiang Y, Member MMH (2015) IEEE and Abdulhameed Alelaiwi Member, IEEE, secure distributed deduplication systems with improved reliability. IEEE Trans Comput 64(2):3569–3579

    Article  MathSciNet  Google Scholar 

  6. Ko YW, Kim S-J, Kim J, Kim E-J, So JM (2015) Energy efficient metadata management for cloud storage system. Int J Distrib Sensor Netw. Article ID 626575

  7. Giannakopoulos T (2015) pyaudio analysis: an open-source python library for audio signal analysis, p 0144610. Published: ember 11 Dec 2015

  8. Dia OA, Farkas C (2015) Risk aware query replacement approach for secure databases performance management. IEEE Trans Depend Secure Comput 12(2). March/April 2015.

  9. Li J, Li YK, Chen X, Lee PPC, Lou W (2015) A hybrid cloud approach for secure authorized deduplication. IEEE Trans Parall Distrib Syst 26(5). May 2015.

  10. Wang J, Chen X, Huang X, You I () Senior Member, IEEE, and Yang Xiang, Senior Member, IEEE verifiable auditing for outsourced database in cloud computing. IEEE Trans Comput 64(11). Nov 2015

  11. Bellare M, Keelveedhi S (2015) Interactive message-locked encryption and secure deduplication. In: Cryptography P-K (ed) Berlin. Springer, Germany, pp 516–538

    Google Scholar 

  12. Sumedha A, Telkar SAM, Shaikh MZ (2016) Secured and efficient cloud storage data deduplication system. IJARCCE5(1). January 2016.

  13. Sellami R, Bhiri S, Defude B () Member, IEEE, supporting multi data stores applications in cloud environments. IEEE Trans Serv Comput 9(1). January/February 2016

  14. Xia Z, Wang X, Sun X, Wang Q (2016) A secure and dynamic multi-keyword ranked search scheme over encrypted cloud data. IEEE Trans Parall Distrib Syst 27(2). February 2016

  15. Tudoran R, Costan A, Antoniu G () OverFlow: multi-site aware big data management for scientific workflows on clouds. IEEE Trans Cloud Comput 4(1). January−March 2016

  16. Fu M, Feng D, Hua Y, He X, Chen Z, Liu J, Xia W, Huang F, Liu Q (2016) Reducing fragmentation for in-line deduplication backup storage via exploiting backup history and cache knowledge. IEEE Trans Parall Distrib Syst 27(3). March 2016

  17. Su K-W, Leu J-S, Yu M-C, Wu Y-T, Lee E-C, Song T (2016) Design and implementation of various file deduplication schemes on storage devices. Springer Science Business Media New York

  18. Luo X, Zhou H, Yu LA, Xue L, Xie Y (2016) Characterizing mobile ∗-box applications. Comput Netw 103: 228–239

  19. Bahdanau D, Chorowski J, Serdyuk D, Brakel P, Bengio Y (2016) End-to-end attention-based large vocabulary speech recognition. IEEE Int Conf Acoustics Speech Signal Process ICASSP

  20. Ranjitha S, Sudhakar P, Seetharaman K (2016) A novel and efficient deduplication system for HDFS.Int Conf Intell Comput Converg, pp 498–505

  21. Kumar PM, Lokesh S, Varatharajan R, Babu GC, Parthasarathy P (2018) Cloud and IoT based disease prediction and diagnosis system for healthcare using Fuzzy neural classifier. Futur Gener Comput Syst 86:527–534

    Article  Google Scholar 

  22. Rashid F, Miri A, Woungang I (2016) Secure image deduplication through image compression. J Inf Security Appl 27–28:54–64

    Google Scholar 

  23. Liu J, Chai Y, Yan C, Wang X () A delayed container organization approach to improve restore speed for deduplication systems. IEEE Trans Parall Distrib Syst 27(9). Sept 2016

  24. Jin H, Jiang H, Zhou K (2016) Dynamic and public auditing with fair arbitration for cloud data. IEEE Trans Cloud Compu

  25. Rongmao C, Yi M, Yang G, Guo F, Wang X (2016) Dual-server public-key encryption with keyword search for secure cloud storage. IEEE Trans Inf Foren Security 11(4). April 2016

  26. Panchatcharam P, Vivekanandan S (2019) Internet of things (IOT) in healthcare–smart health and surveillance, architectures, security analysis and data transfer: a review. Int J Softw Innov IJSI 7(2):21–40

    Article  Google Scholar 

  27. Wan C, Zhang J, Pei B, Chen C (2016) Efficient privacy-preserving third-party auditing for ambient intelligence systems. J Ambient Intell Humanized Computing 7(1):21–27

    Article  Google Scholar 

  28. Ryan NS (2017) Widodoa, Hyotaek Limb, Mohammed Atiquzzamanc, SDM: Smart deduplication for mobile cloud storage. Futur Gener Comput Syst 70:64–73

    Article  Google Scholar 

  29. Hamid HAA, Rahman SMM, Hossain MS, Almogren A, Alamri A (2017) A security model for preserving the privacy of medical big data in a healthcare cloud using a fog computing facility with pairing-based cryptography. 7 Nov 2017.

  30. Shinde D, Dangi A (2017) Implementation on secured hybrid cloud storage service provider with deduplication. Int J Emerg Technol Adv Eng 7(6). June 2017

  31. Daniel E, Vasanthi NA (2017) LDAP: a lightweight deduplication and auditing protocol for secure data storage in cloud environment. Springer Science Business Media, LLC, part of Springer Nature

  32. Li C, Wang C, Luo Y (2020) An efficient scheduling optimization strategy for improving consistency maintenance in edge cloud environment. J Supercomput 76:6941–6968. https://doi.org/10.1007/s11227-019-03133-9

    Article  Google Scholar 

  33. Chunlin L, Jianhang T, Youlong L (2018) Multi-queue scheduling of heterogeneous jobs in hybrid geo-distributed cloud environment. J Supercomput 74:5263–5292. https://doi.org/10.1007/s11227-018-2420-8

    Article  Google Scholar 

  34. Bir P, Karatangi SV, Rai A (2020) Design and implementation of an elastic processor with hyperthreading technology and virtualization for elastic server models. J Supercomput 76:7394–7415. https://doi.org/10.1007/s11227-020-03174-5

    Article  Google Scholar 

Download references

Acknowledgements

The study was supported by FIST grant received from the Department of Science and Technology, Government of India (Reference No. SR/FST/MSI-107/2015(C)).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to D. Narasimhan.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Venkatesh, K., Narasimhan, D. Revealing the novel precise subset identification and deduplication of audio substance over the shared public environment. J Supercomput 78, 11856–11872 (2022). https://doi.org/10.1007/s11227-022-04317-6

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-022-04317-6

Keywords

Navigation