Abstract
In this paper, we propose a post-processing method based on a duration model to improve the performance of a keyword spotting system. The proposed duration model-based post-processing method is performed after detecting a keyword. To detect the keyword, we first combine a keyword model, a non-keyword model, and a silence model. Using the information on the detected keyword, the proposed post-processing method is then applied to determine whether or not the correct keyword is detected. To this end, we generate the duration model using Gaussian distribution in order to accommodate different duration characteristics of each phoneme. Comparing the performance of the proposed method with those of conventional anti-keyword scoring methods, it is shown that the false acceptance and the false rejection rates are reduced.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Kim, M.J., Lee, J.C.: Non-keyword model for the improvement of vocabulary independent keyword spotting system. In: Proceedings of Acoustical Society of Korea Conference, vol. 25, pp. 319–324 (2006)
Rose, R.C., Paul, D.B.: A hidden Markov model based keyword recognition system. In: Proceedings of ICASSP, pp. 129–132 (1990)
Li, X.Q., King, I.: Gaussian mixture distance for information retrieval. In: Proceedings of International Conference on Neural Networks, pp. 2544–2549 (1999)
Johnson, D.H., Sinanović, S.: Symmetrizing the Kullback–Leibler Distance. Rice University, Houston, TX, Technical Report (2001)
Kim, Y.K., Song, H.J., Kim, H.S.: Performance evaluation of non-keyword modeling for vocabulary-independent keyword spotting. In: Proceedings of International Symposium on Chinese Spoken Language Processing, pp. 420–430 (2006)
ETSI ES 202 050, Speech Processing, Transmission and Quality Aspects (STQ); Distribution Speech Recognition; Advanced Feature Extraction Algorithm (2002)
Kim, B.W., Choi, D.L., Kim, Y.I., Lee, K.H., Lee, Y.J.: Current state and future plans at SiTEC for speech corpora for common use, Malsori, pp. 175–186 (2003)
Kim, S., Oh, S., Jung, H.Y., Jeong, H.B., Kim, J.S.: Common speech database collection. In: Proceedings of Acoustical Society of Korea Conference, pp. 21–24 (2002)
Zavagliakos, D., Schwartz, R., McDonough, J.: Maximum a posteriori adaptation for large scale HMM recognizers. In: Proceedings of ICASSP, pp. 725–728 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, M.J. et al. (2010). Duration Model-Based Post-processing for the Performance Improvement of a Keyword Spotting System. In: Kim, Th., Vasilakos, T., Sakurai, K., Xiao, Y., Zhao, G., Ślęzak, D. (eds) Communication and Networking. FGCN 2010. Communications in Computer and Information Science, vol 120. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17604-3_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-17604-3_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17603-6
Online ISBN: 978-3-642-17604-3
eBook Packages: Computer ScienceComputer Science (R0)