Abstract
In this paper, a novel framework based on trace norm minimization for audio event detection is proposed. In the framework, both the feature extraction and pattern classifier are made by solving corresponding convex optimization problem with trace norm regularization or under trace norm constraint. For feature extraction, robust principle component analysis (robust PCA) via minimizing a combination of the nuclear norm and the ℓ1-norm is used to extract matrix representation features which is robust to outliers and gross corruption for audio segments. These matrix representation features are fed to a linear classifier where the weight matrix and bias are learned by solving similar trace norm regularized problems. Experiments on real data sets indicate that this novel framework is effective and noise robust.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Lu, L.: Content analysis for audio classification and segmentation. IEEE Trans. Speech and Audio Processing 10, 504–516 (2002)
Cui, R., Lu, L., Zhung, H.J., Cai, L.H.: Highlight sound effects detection in audio stream. In: Proceedings of IEEE International Conference on Multimedia and Expo, pp. 37–40 (2003)
Pradeep, K.A., Namunu, C.M., Mohan, S.K.: Audio based event detection for multimedia surveillance. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (2006)
Zhuang, X., Zhou, X., Huang, T.S., Hasegawa-Johnson, M.: Feature analysis and selection for acoustic event detection. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 17–20 (2008)
Fazel, M., Hindi, H., Boyd, S.P.: A rank minimization heuristic with application to minimum order system approximation. In: Proceedings of the American Control Conference, pp. 4734–4739 (2001)
Srebro, N., Rennie, J.D.M., Jaakkola, T.S.: Maximum-margin matrix factorization. In: Proceedings of Advances in Neural Information Processing Systems, pp. 1329–1336 (2005)
Argyriou, A., Evgeniou, T., Pontil, M.: Convex multi-task feature learning. Machine Learning 73(3), 243–272 (2008)
Wright, J., Ganesh, A., Rao, S., Peng, Y., Ma, Y.: Robust principal component analysis: Exact recovery of corrupted low-rank matrices via convex optimization. In: Proceedings of Advances in Neural Information Processing Systems (2009)
Lin, Z., Chen, M., Wu, L., Ma, Y.: The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices. UIUC Technical Report (2009)
Tomioka, R., Aihara, K.: Classifying matrices with a spectral regularization. In: 24th International Conference on Machine Learning, pp. 895–902 (2007)
Toh, K., Yun, S.: An accelerated proximal gradient algorithm for nuclear norm regularized least squares problems. Pacific J. Optim. 6, 615–640 (2010)
Ji, S., Ye, J.: An accelerated gradient method for trace norm minimization. In: 26th International Conference on Machine Learning, pp. 457–464 (2009)
Liu, Y.J., Sun, D., Toh, K.C.: An implementable proximal point algorithmic framework for nuclear norm minimization. Mathematical Programming, 1–38 (2009)
Jolliffe, I.T.: Principal Component Analysis. Springer Series in Statistics. Springer, Berlin (1986)
Davis, S., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech and Signal Processing 28(4), 357–366 (1980)
Youku, http://www.youku.com
Bickel, P., Ritov, Y., Tsybakov, A.: Simultaneous analysis of Lasso and Dantzig selector. Annals of Statistics 37(4), 1705–1732 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shi, Z., Han, J., Zheng, T. (2011). A Novel Framework Based on Trace Norm Minimization for Audio Event Detection. In: Lu, BL., Zhang, L., Kwok, J. (eds) Neural Information Processing. ICONIP 2011. Lecture Notes in Computer Science, vol 7063. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24958-7_75
Download citation
DOI: https://doi.org/10.1007/978-3-642-24958-7_75
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24957-0
Online ISBN: 978-3-642-24958-7
eBook Packages: Computer ScienceComputer Science (R0)