Abstract
In this paper a layered architecture to spot and characterize vowel segments in running speech is presented. The detection process is based on neuromorphic principles, as is the use of Hebbian units in layers to implement lateral inhibition, band probability estimation and mutual exclusion. Results are presented showing how the association between the acoustic set of patterns and the phonologic set of symbols may be created. Possible applications of this methodology are to be found in speech event spotting, in the study of pathological voice and in speaker biometric characterization, among others.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Acero, A.: New Machine Learning Approaches to Speech Recognition. In: FALA 2010, Vigo, Spain, November 10-12 (2010); ISBN: 978-84-8158-510-0
Barbour, D.L., Wang, X.: Temporal Coherence Sensitivity in Auditory Cortex. J. Neurophysiol. 88, 2684–2699 (2002)
Gómez, P., Ferrández, J.M., Rodellar, V., Fernández, R.: Time-frequency Representations in Speech Perception. Neurocomputing 72, 820–830 (2009)
Gómez, P., Ferrández, J.M., Rodellar, V., Alvarez, A., Mazaira, L.M., Olalla, R., Muñoz, C.: Neuromorphic detection of speech dynamics. Neurocomputing 74(8), 1191–1202 (2011)
Greenberg, S., Ainsworth, W.H.: Speech processing in the auditory system: an overview. In: Greenberg, W.A.S. (ed.) Speech Processing in the Auditory System, pp. 1–62. Springer, New York (2004)
Hebb, D.O.: The Organization of Behavior. Wiley, New York (1949)
Huang, X., Acero, A., Hon, H.W.: Spoken Language Processing. Prentice-Hall, Upper Saddle River (2001)
Jahne, B.: Digital Image Processing. Springer, Berlin (2005)
Kohonen, T.: Self-Organizing Maps. Springer, Heidelberg (1997)
Munkong, R., Juang, B.H.: Auditory Perception and Cognition. IEEE Signal Proc. Magazine, 98–117 (May 2008)
O’Shaughnessy, D.: Speech Communication. Human and Machine. Addison-Wesley, Reading (2000)
Palmer, A., Shamma, S.: Physiological Representation of Speech. In: Greenberg, S., Ainsworth, W., Popper, A. (eds.), pp. 163–230. Springer, New York (2004)
Rose, P., Kinoshita, Y., Alderman, T.: Realistic Extrinsic Forensic Speaker Discrimination with the Diphthong /aI/. In: Proc. 11th Austr. Int. Conf. on Speech Sci. and Tech., pp. 329–334 (December 2006)
Shamma, S.: Physiological foundations of temporal integration in the perception of speech. J. Phonetics 31, 495–501 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gómez-Vilda, P. et al. (2011). Neuromorphic Detection of Vowel Representation Spaces. In: Ferrández, J.M., Álvarez Sánchez, J.R., de la Paz, F., Toledo, F.J. (eds) New Challenges on Bioinspired Applications. IWINAC 2011. Lecture Notes in Computer Science, vol 6687. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21326-7_1
Download citation
DOI: https://doi.org/10.1007/978-3-642-21326-7_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21325-0
Online ISBN: 978-3-642-21326-7
eBook Packages: Computer ScienceComputer Science (R0)