Abstract
This paper proposes an on-line audio watermarking system for broadcast monitoring. The designed watermark (WM) encoder is a nonlinear data adaptive system that performs perceptual embedding. It allows working at very low watermark-to-signal ratio (WSR) levels thus preserves the inaudibility. The developed decoder adopts wavelet de-noising for blind watermark extraction and it is capable of watermark decoding while establishing the synchronization between the transmitter and the receiver. Unlike the published watermarking schemes, it is shown that the introduced WM embedding scheme minimizes false alarm ratio by adaptively controlling the WSR. Furthermore it integrates synchronization and WM extraction into one processing step resulting in an on-line decoding scheme suitable to audio broadcast monitoring. The proposed system is robust to Digital-to-Analog (D/A), Analog-to-Digital (A/D) conversions, compression and noise as well as the attenuations that arise from FM broadcasting and acoustic transmission. Performance under Stirmak attacks is also reported. It is shown that error free transmission of 24 bps is achieved at around WSR = −32 dB when the PSNR is around 50 dB. Granularity of the system is 0.6 s thus it is capable of tracking short audio clips i.e. commercials.
Similar content being viewed by others
References
Cox IJ, Miller ML, Bloom JA (2000) “Watermarking applications and their properties,” in Proc. of International Conference on Information Technology’2000, USA, 2000, pp 6–10
Depovere G, Kalker T, Haitsma J, Maes M, de Strycker L, Termont P, Vandewege J, Langell A, Alm C, Norman P, P’Reilly G, Howes B, Vaanholt H, Hintzen R, Donnely P, Hudson A (1999) “The VIVA project: digital watermarking for broadcast monitoring,” in Proc. of International Conference on Image Processing, vol.2, Kobe, Japan, pp.202–205
Donoho DL, Johnstone IM (1994) “Threshold selection for wavelet shrinkage of noisy data”. in Proc. of 16th Annual Conf. of the IEEE Engineering in Medicine and Biology Society 24a–25a
Furht B, Kirovski D (2005) “Multimedia Security Handbook”, CRC, Boca Raton, Florida
Gomes L de CT, Cano P, Gómez E, Bonnet M, Battle E (2003) audio watermarking and fingerprinting: for which applications? J New Music Res 32(1):65–81
Gunsel B, Kirbiz S (2006) “Perceptual audio watermarking by learning in wavelet domain”. in Proc. of International Conference on Pattern Recognition (ICPR 2006), Hong Kong
Gunsel B, Ulker Y, Kirbiz S (2006) “A statistical framework for audio watermark detection and decoding”. in Proc. of Multimedia Content Representation Classification and Security (MRCS 2006), Istanbul Turkey, 11–13 September
Haitsma J, Kalker T (2002) “A highly robust audio fingerprinting system,” in Proc. of the 3rd International Symposium on Music Information Retrieval”, Oct, pp. 144–148
Hernandez JJG, Miyatake MN, Meana HP (2006) “Realtime audio watermarking system prototype”, in Proc. of 8th IEEE International Symposium on Multimedia(ISM’06), San Diego, USA, December
Kirbiz S, Yaslan Y, Gunsel B (2005) “Robust audio watermark decoding by nonlinear classification”. in Proc. of 13th EUSIPCO, Turkey, 4–8 September
Kirovski Malvar DHS (2003) Spread-spectrum watermarking of audio signals. IEEE Trans Signal Process 51:1020–1033 (April)
Liu J, He X (2005) “A review study on digital watermarking”, in Proc. of First International Conference on Information and Communication Technologies (ICICT 2005), Cairo Egypt, 5–6 December
Löytynoja M, Cvejic N, Keskinarkaus A, Lähetkangas E, Seppänen T (2006) “Mobile commerce from watermarked broadcast audio” in Proc. of IEEE International Conference on Consumer Electronics, Las Vegas, NV
Malvar HS, Florencio DF (2003) Improved spread spectrum: a new modulation technique for robust watermarking. IEEE Trans Signal Process 51(4):898–905
Nakamura T, Tachibana R, Kobayashi S (2002) “Automatic music monitoring and boundary detection for broadcast using audio watermarking,” in Proc. of Security and Watermarking of Multimedia Contents IV, SPIE vol.4675, USA, pp.170–180
Painter T, Spanias A (2000) Perceptual coding of digital audio. Proc IEEE 88(4):451–515 April
Park CM, Thapa D, Wang GN (2007) Speech Authentication System Using Digital Watermarking and Pattern Recovery. Pattern Recognition Letters 28(8):931–938
Steinebach M, Lang A, Dittmann J, Petitcolas FAP (2002) “Stirmark benchmark: audio watermarking attacks based on lossy compression”. in Proc. of SPIE Security Watermarking Multimedia 4675:79–90, San Jose, CA, Jan
Swanson M, Zhu B, Tewfik A, Boney L (1998) “Robust Audio Watermarking Using Perceptual Masking”. Signal Processing 66(3):337–355
Tachibana R (2003) “Audio watermarking for live performance,” in Proc. of Security and Watermarking of Multimedia Contents V, SPIE vol.5020, USA, pp.32–43
Yaslan Y, Gunsel B (2004) “An integrated decoding framework for audio watermark extraction,” in Proc. of International Conference on Pattern Recognition (ICPR 2004), UK:879–882
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yaslan, Y., Gunsel, B. An integrated on-line audio watermark decoding scheme for broadcast monitoring. Multimed Tools Appl 40, 1–21 (2008). https://doi.org/10.1007/s11042-007-0182-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-007-0182-z