Abstract
As an emerging field of speech recognition, dialect identification plays an important role for promoting applications of speech recognition technology. Since the communications among Mainland China, Hong Kong and Taiwan are becoming frequently, it is particularly necessary to identify their dialects. This paper makes contributions to this issue in the following three-folds: 1) we build a speech corpus for main dialects of the three areas; 2) we use the popular GMM based method to extensively evaluate the main dialects between Mainland China and Hong Kong and the ones between Mainland China and Taiwan, and we find the differences between Mainland China Mandarin and Taiwan Mandarin are much smaller than those between Mandarin and Cantonese, resulting in unsatisfactory results in the latter case; 3) we propose an improved method based on the analysis of GMM, namely, maximum KL distance based Gaussian component selection (MKLD-GCS) in order to improve the performance of dialect identification between Mainland China Mandarin and Taiwan Mandarin. Experimental results show that our proposed method obtains better identification performance than related methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Gu, M., Ma, Y.: GMM-based Chinese Dialect Identification System. J. Computer Engineering and Applications 3(43), 204–206 (2007)
Gu, M., Xia, Y.: Chinese Dialect Identification using Clustered Support Vector Machine. In: IEEE Int. Conference Neural Networks & Signal Processing, Zhenjiang, China (June 2008)
Zissman, M.A., Gleason, T.P.: Automatic Dialect Identification of Extemporaneous Conversational, Latin American Spanish speech. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 777–780 (1996)
Wu, Z., Yang, Y.: Models and Methods for Speaker Recognition, pp. 27–28. Tsinghua University Press, Beijing (2009)
Jelinek, F.: Statistical Methods for Speech Recognition. MIT Press, Massachusetts (1999)
Lei, Y., Hansen, J.H.L.: Dialect Classification via Text-Independent Training and Testing for Arabic, Spanish, and Chinese. IEEE Transactions on Audio, Speech, and Language Processing 19(1), 85–96 (2011)
Reiss, R.D.: Approximate Distributions of Order Statistics. Springer, New York (1980)
Cover, T.M., Thomas, J.A.: Elements of Information Theory. John Wiley & Sons, New York (1991)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wei, D., Zhu, JY., Zheng, WS., Lai, J. (2011). Main Dialect Identification in Mainland China, Hong Kong and Taiwan. In: Sun, Z., Lai, J., Chen, X., Tan, T. (eds) Biometric Recognition. CCBR 2011. Lecture Notes in Computer Science, vol 7098. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25449-9_24
Download citation
DOI: https://doi.org/10.1007/978-3-642-25449-9_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25448-2
Online ISBN: 978-3-642-25449-9
eBook Packages: Computer ScienceComputer Science (R0)