Abstract
The exploitation of fuzzy logic control (FLC) mechanism in the fields of speaker adaptation (SA) is thoroughly investigated in this study, specifically in the reliable determination of HMM acoustic parameters. For enhancing the performance of speaker adaptation, the FLC mechanism is engineered into the MAP estimate of HMM parameters for Bayesian-based adaptation; also into the MLLR estimate for transformation-based adaptation. The speech recognition system using an adaptation scheme with the support of FLC will still be able to keep a satisfactory recognition performance even in an ordinary case.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Rabiner, L.R.: The Power of Speech. Science 301, 1494–1495 (2003)
Lippmann, R.P.: Speech Recognition by Machines and Humans. Speech Communication 22, 1–15 (1997)
Kuhn, R., Junqua, J.-C., Nguyen, P., Niedzielski, N.: Rapid Speaker Adaptation in Eigenvoice Space. IEEE Transactions on Speech and Audio Processing 8, 695–707 (2000)
Mak, B., Hsiao, R.: Kernel Eigenspace-based MLLR Adaptation. IEEE Transactions on Audio, Speech, and Language Processing 15, 784–795 (2007)
Kermiche, S., Saidi, M.L., Abbassi, H.A., Ghodbane, H.: Takagi-Sugeno Based Controller for Mobile Robot Navigation. Journal of Applied Science 6, 1838–1844 (2006)
Gauvain, J.L., Lee, C.H.: Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains. IEEE Transactions on Speech and Audio Processing 2, 291–298 (1994)
Lee, C.H., Lin, C.H., Juang, B.H.: A Study on Speaker Adaptation of the Parameters of Continuous Density Hidden Markov Models. IEEE Transactions on Acoustics, Speech and Signal Processing 39, 806–814 (1991)
Takahashi, J.-I., Sagayama, S.: Vector-field-smoothed Bayesian Learning for Fast and Incremental Speaker/Telephone-channel Adaptation. Computer Speech and Language 11, 127–146 (1997)
Woodland, P.C.: Speaker Adaptation: Techniques and Challenges. In: Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 85–90 (1999)
Takagi, T., Sugeno, M.: Fuzzy Identification of Systems and Its Application to Modeling and Control. IEEE Transactions on Systems, Man and Cybernetics 15, 116–132 (1985)
Leggetter, C.J., Woodland, P.C.: Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models. Computer Speech and Language 9, 171–185 (1995)
Chien, J.T., Lee, L.M., Wang, H.C.: Estimation of Channel Bias for Telephone Speech Recognition. In: Proceedings of International Conference on Spoken Language Processing, pp. 1840–1843 (1996)
Chien, J.T., Wang, H.C.: Telephone Speech Recognition Based on Bayesian Adaptation of Hidden Markov Models. Speech Communication 22, 369–384 (1997)
Chesta, C., Siohan, O., Lee, C.H.: Maximum a Posteriori Linear Regression for Hidden Markov Model Adaptation. In: Proceedings of the European Conference on Speech Communication and Technology, pp. 211–214 (1999)
Chou, W.: Maximum a Posteriori Linear Regression with Elliptically Symmetric Matrix Priors. In: Proceedings of the European Conference on Speech Communication and Technology, pp. 1–4 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ding, IJ. (2010). FLC-Regulated Speaker Adaptation Mechanisms for Speech Recognition. In: Pan, JS., Chen, SM., Nguyen, N.T. (eds) Computational Collective Intelligence. Technologies and Applications. ICCCI 2010. Lecture Notes in Computer Science(), vol 6422. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16732-4_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-16732-4_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16731-7
Online ISBN: 978-3-642-16732-4
eBook Packages: Computer ScienceComputer Science (R0)