Abstract
We investigate the use of independent component analysis (ICA) for speech feature extraction in digits speech recognition systems.We observe that this may be true for a recognition tasks based on geometrical learning with little training data. In contrast to image processing, phase information is not essential for digits speech recognition. We therefore propose a new scheme that shows how the phase sensitivity can be removed by using an analytical description of the ICA-adapted basis functions via the Hilbert transform. Furthermore, since the basis functions are not shift invariant, we extend the method to include a frequency-based ICA stage that removes redundant time shift information. The digits speech recognition results show promising accuracy, Experiments show method based on ICA and geometrical learning outperforms HMM in different number of train samples.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bell, A.J., Sejnowski, T.J.: Learning the higher-order structure of a natural sound. Network Comput. Neural Syst. 7, 261–266 (1996)
ShouJue, W.: A new development on ANN in China - Biomimetic pattern recognition and multi weight vector neurons. LNCS(LNAI), vol. 2639, pp. 35–43. Springer, Heidelberg (2003)
Shoujue, W., et al.: Multi Camera Human Face Personal Identification System Based on Biomimetic pattern recognition. Acta Electronica Sinica 31(1), 1–3 (2003)
Shoujue, W., et al.: Discussion on the basic mathematical models of Neurons in General purpose Neurocomputer. Acta Electronica Sinica 29(5), 577–580 (2001)
Wang, X., Wang, S.: The Application of Feedforward Neural Networks in VLSI Fabrication Process Optimization. International Journal of Computational Intelligence and Applications 1(1), 83–90 (2001)
Cao, W., Hao, F., Wang, S.: The application of DBF neural networks for object recognition. Inf. Sci. 160(1-4), 153–160 (2004)
Hyvärinen, A., Karhunen, J., Oja, E.: Independent Component Analysis. Wiley, New York (2001)
Csiszar, I., Tusnady, G.: Information geometry and alternating minimization procedures, Statistics and Decisions (suppl. 1), 205–237 (1984)
Amari, S., Nagaoka, H.: Methods of Information Geometry. AMS and Oxford University Press (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cao, W., Pan, X., Wang, S., Hu, J. (2005). Digits Speech Recognition Based on Geometrical Learning. In: Li, X., Wang, S., Dong, Z.Y. (eds) Advanced Data Mining and Applications. ADMA 2005. Lecture Notes in Computer Science(), vol 3584. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11527503_50
Download citation
DOI: https://doi.org/10.1007/11527503_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27894-8
Online ISBN: 978-3-540-31877-4
eBook Packages: Computer ScienceComputer Science (R0)