Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

Robust audio hashing based on discrete-wavelet-transform and non-negative matrix factorisation

Robust audio hashing based on discrete-wavelet-transform and non-negative matrix factorisation

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IET Communications — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Robust audio hashing defines a feature vector that characterises the audio signal, independent of content preserving manipulations, such as MP3 compression, amplitude boosting/cutting, hiss reduction, and so on. It provides a tool for fast and reliable identification of content in audio communications. In this study, the authors propose a new audio hashing based on discrete wavelet transform and non-negative matrix factorisation (NMF). The desirable property of NMF for hashing algorithm is its non-negative constraints, which result in bases that capture local feature of the audio, thereby significantly reducing misclassification. In addition, to ensure perceptual robustness, NMF is performed on the coarse wavelet coefficients, which are a low-pass approximation of the audio and not easy to change by content preserving manipulations. Experimental results over a large database reveal that the proposed scheme is more robust and provides much stronger discrimination than the conventional energy spectrum-based hashing algorithm, and that the proposed scheme can be applied in broadcast monitoring, successfully.

References

    1. 1)
      • M. Park , H. Kim , S.H. Yang . Frequency-temporal filtering for a robust audio fingerprinting scheme in real-noise environments. ETRI J. , 4 , 509 - 512
    2. 2)
      • Jiao, Y., Yang, B., Li, M., Niu, X.: `MDCT-based perceptual hashing for compressed audio content identification', Proc. IEEE Ninth Workshop on Multimedia Signal Processing, 2007, p. 381–384.
    3. 3)
      • J.S. Seo , J. Haitsma , T. Kalker , C.D. Yoo . A robust image fingerprinting system using the Radon transform. Signal Process.: Image Commun. , 4 , 325 - 339
    4. 4)
      • D.D. Lee , H.S. Seung . Algorithms for non-negative matrix factorization. Adv. Neural Inf. Process. Syst. , 556 - 562
    5. 5)
      • Chen, N., Wan, W.G.: `Speech hashing algorithm based on short-time stability', Proc. 19th Int. Conf. Artificial Neural Networks, 2009, 2009.
    6. 6)
      • D. Fragoulis , G. Rousopoulos , T. Panagopoulos , C. Alexiou , C. Papaodysseus . On the automated recognition of seriously distorted musical recordings. IEEE Trans. Signal Process. , 4 , 898 - 908
    7. 7)
      • H. O zer , B. Sankur , N. Memon , E. Anarim . Perceptual audio hashing functions. EURASIP J. Appl. Signal Process. , 1 , 1780 - 1793
    8. 8)
      • Sethi, I., Kulesh, V., Petrushin, V.: `Indexing and retrieval of music via gaussian mixture models', Proc. Third Int. Workshop on Content Based Multimedia Indexing, 2003, 2003.
    9. 9)
      • V. Monga , M.K. Mhcak . Robust and secure image hashing via non-negative matrix factorizations. IEEE Trans. Inf. Forensics Sec. , 376 - 390
    10. 10)
      • Mapelli, F., Lancini, R.: `Audio hashing technique for automatic song identification', Proc. 2003 Int. Conf. Information Technology: Research and Education, 2003, p. 84–88.
    11. 11)
      • Seo, J.S., Jin, M., Lee, S., Jang, D., Lee, S., Yoo, C.D.: `Audio fingerprinting based on normalized spectral subband centroids', Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, 2005, 2005.
    12. 12)
    13. 13)
      • A.J. Menezes , P.C. Van Oorschot , S.A. Vanstone . (1997) Handbook of applied cryptography.
    14. 14)
      • Jiao, Y., Li, M., Li, Q., Niu, X.: `Key-dependent compressed domain audio hashing', Proc. Eighth Int. Conf. Intelligent Systems Design and Applications, 2008, 3.
    15. 15)
      • Jiao, Y., Li, Q., Niu, X.: `Compressed domain perceptual hashing for MELP coded speech', Proc. Int. Conf. Intelligent Information Hiding and Multimedia Signal Processing, 2008, p. 410–413.
    16. 16)
    17. 17)
      • Haitsma, J., Kalker, T.: `A highly robust audio fingerprinting system', Proc. ISMIR, 2002, 2002.
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-com.2009.0749
Loading

Related content

content/journals/10.1049/iet-com.2009.0749
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address