Skip to main content

Main Dialect Identification in Mainland China, Hong Kong and Taiwan

  • Conference paper
Biometric Recognition (CCBR 2011)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7098))

Included in the following conference series:

  • 1619 Accesses

Abstract

As an emerging field of speech recognition, dialect identification plays an important role for promoting applications of speech recognition technology. Since the communications among Mainland China, Hong Kong and Taiwan are becoming frequently, it is particularly necessary to identify their dialects. This paper makes contributions to this issue in the following three-folds: 1) we build a speech corpus for main dialects of the three areas; 2) we use the popular GMM based method to extensively evaluate the main dialects between Mainland China and Hong Kong and the ones between Mainland China and Taiwan, and we find the differences between Mainland China Mandarin and Taiwan Mandarin are much smaller than those between Mandarin and Cantonese, resulting in unsatisfactory results in the latter case; 3) we propose an improved method based on the analysis of GMM, namely, maximum KL distance based Gaussian component selection (MKLD-GCS) in order to improve the performance of dialect identification between Mainland China Mandarin and Taiwan Mandarin. Experimental results show that our proposed method obtains better identification performance than related methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Gu, M., Ma, Y.: GMM-based Chinese Dialect Identification System. J. Computer Engineering and Applications 3(43), 204–206 (2007)

    Google Scholar 

  2. Gu, M., Xia, Y.: Chinese Dialect Identification using Clustered Support Vector Machine. In: IEEE Int. Conference Neural Networks & Signal Processing, Zhenjiang, China (June 2008)

    Google Scholar 

  3. Zissman, M.A., Gleason, T.P.: Automatic Dialect Identification of Extemporaneous Conversational, Latin American Spanish speech. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 777–780 (1996)

    Google Scholar 

  4. Wu, Z., Yang, Y.: Models and Methods for Speaker Recognition, pp. 27–28. Tsinghua University Press, Beijing (2009)

    Google Scholar 

  5. Jelinek, F.: Statistical Methods for Speech Recognition. MIT Press, Massachusetts (1999)

    Google Scholar 

  6. Lei, Y., Hansen, J.H.L.: Dialect Classification via Text-Independent Training and Testing for Arabic, Spanish, and Chinese. IEEE Transactions on Audio, Speech, and Language Processing 19(1), 85–96 (2011)

    Article  Google Scholar 

  7. Reiss, R.D.: Approximate Distributions of Order Statistics. Springer, New York (1980)

    Google Scholar 

  8. Cover, T.M., Thomas, J.A.: Elements of Information Theory. John Wiley & Sons, New York (1991)

    Book  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wei, D., Zhu, JY., Zheng, WS., Lai, J. (2011). Main Dialect Identification in Mainland China, Hong Kong and Taiwan. In: Sun, Z., Lai, J., Chen, X., Tan, T. (eds) Biometric Recognition. CCBR 2011. Lecture Notes in Computer Science, vol 7098. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25449-9_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-25449-9_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-25448-2

  • Online ISBN: 978-3-642-25449-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics