Abstract
Music infringement has emerged as a significant concern and the copyright protection of digital music is becoming more important. Most music copyright protection methods use images as watermark information. Hence, the original watermark needs to be embedded in the music multi-times to achieve copyright protection. It cannot guarantee the accuracy of the embedded watermark. And it may also degrade the quality of the watermarked music. In order to solve that issue, a blind music watermarking is proposed to protect the copyright of music. Comparing with the existing music copyright protection methods, the proposed method generates a watermark from the speech of a copyright owner that can ensure the uniqueness of the watermark. When generating the watermark, the multi-band power distribution of speech recording is calculated first. Subsequently, the discrete cosine transformation is applied to the sub-graphs of the multi-band power distribution, and the watermark is generated using the resulting Discrete Cosine Transformation (DCT) coefficients. After that, the watermark is embedded into the coefficients of Discrete Wavelet Transformation (DWT) using quantization norms. The experiments demonstrate that the Signal to Noise Ratio (SNR) values are higher than 58db between the original music and watermarked music. Meanwhile, the Bit Error Rate (BER) and Normalized Correlation (NC) values are zeros and ones, respectively. In conclusion, the evaluation results show that the proposed method can achieve excellent imperceptibility and robustness. Additionally, it can also effectively resist conventional processing operations such as noise addition, filtering, and so on.







Similar content being viewed by others
Data Availability
Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.
References
Lin J, Qin J, Lyu S et al (2021) Lattice-Based Minimum-Distortion Data Hiding. IEEE Commun Lett 25(9):2839–2843
Mosleh M, Setayeshi S, Barekatain B (2021) Transparent and Robust Audio Watermarking Using Synergy LU Decomposition and the Fibonacci Sequence in GBT-DCT-DWT Transforms. Electronic and Cyber Defense 9(1):101–113
Hu HT, Lin SJ (2017) Hsu LY (2017) Effective Blind Speech Watermarking via Adaptive Mean Modulation and Package Synchronization in DWT Domain. Eurasip J Audio Speech Music Process 1:1–14
Saadi S, Merrad A, Benziane A (2019) Novel Secured Scheme for Blind Audio/Speech Norm-space Watermarking by Arnold Algorithm. Signal Process 154:74–86
Ko HJ, Huang CT, Horng G et al (2020) Robust and Blind Image Watermarking in DCT Domain using Inter-block Coefficient Correlation. Inf Sci 517:128–147
Mosleh M, Setayeshi S, Barekatain B et al (2021) A Novel Audio Watermarking Scheme based on Fuzzy Inference System in DCT Domain. Multimed Tools Appl 80(13):20423–20447
Mosleh M, Setayeshi S, Barekatain B (2021) Transparent and Robust Audio Watermarking Using Synergy LU Decomposition and the Fibonacci Sequence in GBT-DCT-DWT Transforms. Electronic and Cyber Defense 9(1):101–113
Turnip TN, Prasetyo TA, Banjarnahor NA et al (2021) Application of Double Pictures to Audio Watermarking using Discrete Cosine Transform (DCT) and Singular Value Decomposition (SVD) Methods. J Appl Technol Inf Indo 1(2):7–13
Ogura M, Sugiura Y, Shimamura T (2017) SVD based Audio Watermarking using Angle-quantization. 2017 International conference on electrical, computer and communication engineering (ECCE), pp 119–122
Maiti C, Dhara BC (2022) A Blind Audio Watermarking based on Singular Value Decomposition and Quantization. Int J Speech Technol 25(3):759–771
Zong T, Zhao J, Xiang Y et al (2022) Desynchronization-attack-resilient Audio Watermarking Mechanism for Stereo Signals using the Linear Correlation Between Channels. World Wide Web 25(1):357–379
Pourhashemi SM, Mosleh M, Erfani Y (2021) A Novel Audio Watermarking Scheme using Ensemble-based Watermark Detector and Discrete Wavelet Transform. Neural Comput Appl 33(11):6161–6181
Narla VL, Gulivindala S, Chanamallu SR et al (2021) BCH Encoded Robust and Blind Audio Watermarking with Tamper Detection using Hash. Multimed Tools Appl 80(21):32925–32945
Abodena O (2021) Robust and High-capacity Audio Watermarking based on Chirp Z-transform. 2021 29th Signal processing and communications applications conference (SIU), pp 1–4
Budiman G, Suksmono AB, Danudirdjo D (2020) Wavelet-Based Hybrid Audio Watermarking Using Statistical Mean Manipulation and Spread Spectrum. 2020 27th International conference on telecommunications (ICT), pp 1–5
Yuan X, Li M (2020) Gram-Schmidt Orthogonalization-Based Audio Multiple Watermarking Scheme. Circuits, Syst Signal Process 39(8):3958–3977
Maiti C, Dhara BC (2022) A Blind Audio Watermarking based on Singular Value Decomposition and Auantization. Int J Speech Technol 25(3):759–771
Suresh G, Narla VL, Gangwar DP, et al (2022) False- Positive-Free SVD Based Audio Watermarking with Integer Wavelet Transform, Circuits, Systems, and Signal Processing, pp 1–26
Yamni M, Karmouni H, Sayyouri M et al (2022) Robust Audio Watermarking Scheme based on Fractional Charlier Moment Transform and Dual Tree Complex Wavelet Transform. Expert Syst Appl 203:117325
Wu Q, Ding R, Wei J (2022) Audio Watermarking Algorithm with a Synchronization Mechanism Based on Spectrum Distribution. Secur Commun Netw 2022:2617107
Li J, Xiang S (2022) Audio-lossless Robust Watermarking Against Desynchronization Attacks. Signal Process 198:108561
Yamni M, Karmouni H, Sayyouri M et al (2022) Efficient Watermarking Algorithm for Digital Audio/Speech Signal. Digit Signal Process 120:103251
Kim C, Stern RM (2010) Feature Extraction for Robust Speech Recognition based on Maximizing the Sharpness of the Power Distribution and on Power Flooring. 2010 IEEE international conference on acoustics, speech and signal processing, pp 4574–4577
Hu HT, Chou HH, Lee TT (2021) Robust Blind Speech Watermarking via FFT-based Perceptual Vector Norm Modulation with Frame Self-Synchronization. IEEE Access 9:9916–9925
He J, Liu Z, Lin K et al (2023) A novel audio watermarking algorithm robust against recapturing attacks. Multimed Tools Appl 82(12):18599–18616
Acknowledgements
This work is supported by the National Natural Science Foundation of China(61902085), the Guizhou Provincial Science and Technology Plan Projects Qian Ke He Jichu - (ZK[2021]YiBan312, ZK[2021]YiBan311).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Qian, Q., Song, M., Zhou, S. et al. A music watermarking method based on the multi-band power distribution of copyright owner’s speech. Multimed Tools Appl 83, 67627–67642 (2024). https://doi.org/10.1007/s11042-024-18269-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-024-18269-x