Abstract
In this paper, we propose a voice code verification method for an intelligent surveillance guard robot, wherein a robot prompts for a code (i.e. word or phrase) for verification. In the application scenario, the voice code can be changed every day for security reasoning and the targeting domain is unlimited. Thus, the voice code verification system not only requires the text-prompted and speaker independent verification, but also it should not require an extra trained model as an alternative hypothesis for log-likelihood ratio test because of memory limitation. To resolve these issues, we propose to exploit the sub-word based anti-models for log-likelihood normalization through reusing an acoustic model and competing with voice code model. The anti-model is automatically produced by using the statistical distance of phonemes against a voice code. In addition, a harmonics-based spectral subtraction algorithm is applied for a noisy robust system on an outdoor environment. The performance evaluation is done by using a common Korean database, PBW452DB, which consists of 63,280 utterances of 452 isolated words recorded in silent environment.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Jiang, H., Lee, C.-H.: A new approach to utterance verification based on neighborhood information in model space. IEEE Transactions on Speech and Audio Processing 11(5) (Septmber 2003)
Matsui, T., Furui, S.: Likelihood normalization for speaker verification using a phoneme- and speaker-independent model. Speech Communication 17, 109–116 (1995)
Xiang, B., Berger, T.: Efficient text-independent speaker verification with structural Gaussian mixture models and neural network. IEEE Transactions on Speech and Audio Processing 11(5), 447–456 (2003)
Huang, X., Acero, A., Hon, H.: Spoken Language Processing. Prentice Hall PTR, Englewood Cliffs (2001)
Wessel, F., Schluter, R., Macherey, K., Ney, H.: Confidence measures for large vocabulary continuous speech recognition. IEEE Trans. Speech Audio Processing 9 (March 2001)
Kim, T., Ko, H.: Uttrance Verification Under Distributed Detection and Fusion Framework. In: Eurospeech 2003, pp. 889–892 (September 2003)
Park, H., B.A.,M.A.: Temporal ans spectral Characteristics of Korean Phonation Types. Doctor of philosophy degree thesis, The university of Taxas at Austin (August. 2002)
Hardcastle, W.J., laver, J.: The Handbook of Phonetic Sciences. Blackwell publishers Ltd, Malden (1997)
Beh, J., Ko, H.: A Novel Spectral Subtraction Scheme For Robust Speech Recognition: Spectral Subtraction using Spectral Harmonics of Speech. In: ICME 2003, pp. III 633 – III 636 (July 2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, H., Ko, H. (2004). Voice Code Verification Algorithm Using Competing Models for User Entrance Authentication. In: Webb, G.I., Yu, X. (eds) AI 2004: Advances in Artificial Intelligence. AI 2004. Lecture Notes in Computer Science(), vol 3339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30549-1_53
Download citation
DOI: https://doi.org/10.1007/978-3-540-30549-1_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24059-4
Online ISBN: 978-3-540-30549-1
eBook Packages: Computer ScienceComputer Science (R0)