Abstract
This paper deals with outlier modeling within a very special framework: a segment-based speech recognizer. The recognizer is built on a neural net that, besides classifying speech segments, has to identify outliers as well. One possibility is to artificially generate outlier samples, but this is tedious, error-prone and significantly increases the training time. This study examines the alternative of applying a replicator neural net for this task, originally proposed for outlier modeling in data mining. Our findings show that with a replicator net the recognizer is capable of a very similar performance, but this time without the need for a large amount of outlier data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bourlard, H.A., Morgan, N.: Connectionist Speech Recognition – A Hybrid Approach. Kluwer Academic Publishers, Dordrecht (1994)
Clarkson, P., Moreno, P.J.: Onthe Use of SupportVector Machines for Phonetic Classification. In: Proceedings of ICASSP 1999, pp. 585–588 (1999)
Glass, J.R.: A Probabilistic Framework for Feature-Based Speech Recognition. In: Proceedings of ICSLP 1996, pp. 2277–2280 (1996)
Hawkins, S., He, H.X., Williams, G.J., Baxter, R.A.: Outlier Detection Using Replicator Neural Networks. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds.) DaWaK 2002. LNCS, vol. 2454, pp. 170–180. Springer, Heidelberg (2002)
Hecht-Nielsen, R.: Replicator Neural Networks for Universal Optimal Source Coding. Science 269, 1860–1863 (1995)
Huang, X.D., Acero, A., Hon, H.-W.: Spoken Language Processing. Prentice-Hall, Englewood Cliffs (2001)
Kocsor, A., Tóth, L., Kuba Jr., A., Kovács, K., Jelasity, M., Gyimóthy, T., Csirik, J.: A Comparative Study of Several Feature Space Transformation and Learning Methods for Phoneme Classification. International Journal of Speech Technology 3(3/4), 263–276 (2000)
Ostendorf, M., Digalakis, V., Kimball, O.A.: From HMMs to Segment Models: A Unified View of Stochastic Modeling for Speech Recognition. IEEE Trans. ASSP 4, 360–378 (1996)
Tóth, L., Kocsor, A., Kovács, K.: A Discriminative Segmental Speech Model and its Application to Hungarian Number Recognition. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2000. LNCS (LNAI), vol. 1902, pp. 307–313. Springer, Heidelberg (2000)
Verhasselt, J., Illina, I., Martens, J.P., Gong, Y., Haton, J.P.: Assessing the Importance of the Segmentation Probability in Segment-Based Speech Recognition. Speech Communication 24(1), 51–72 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tóth, L., Gosztolya, G. (2004). Replicator Neural Networks for Outlier Modeling in Segmental Speech Recognition. In: Yin, FL., Wang, J., Guo, C. (eds) Advances in Neural Networks – ISNN 2004. ISNN 2004. Lecture Notes in Computer Science, vol 3173. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28647-9_164
Download citation
DOI: https://doi.org/10.1007/978-3-540-28647-9_164
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22841-7
Online ISBN: 978-3-540-28647-9
eBook Packages: Springer Book Archive