Abstract
In the present work we report recent progress in development of dialect recognition system for the Standard Modern Greek and Cypriot dialect of Greek language. Specifically, we rely on a compound recognition scheme, where the outputs of multiple phone recognizers, trained on different European languages are combined. This allows achieving higher recognition accuracy, when compared to the one of the mainstream phone recognizer. The evaluation results reported here indicate high recognition accuracy - up to 95%, which makes the proposed solution a feasible addition to existing spoken dialogue systems, such as voice banking applications, call routers, voice portals, smart-home environments, e-Government speech oriented services, etc.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Schultz, T., Kirchhoff, K.: Multilingual Speech Processing. Academic Press, Elsevier (2006)
Martin, A.F., Le, A.N.: NIST 2007 Language Recognition Evaluation. In: Odyssey 2008 - The Speaker and Language Recognition Workshop ISCA Tutorial and Research Workshop (2008)
Torres-Carrasquillo, P.A., Gleason, T.P., Reynolds, D.A.: Dialect Identification using Gaussian Mixture Models. In: Odyssey 2004 - The Speaker and Language Recognition Workshop, ISCA Tutorial and Research Workshop, pp. 297–300 (2004)
Tong, R., Ma, B., Li, H., Chng, E.S.: Integrating Acoustic, Prosodic and Phonotactic Features for Spoken Language Identification. In: 2006 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 205–208 (2006)
Campbell, W.M., Singer, E., Torres-Carrasquillo, P.A., Reynolds, D.A.: Language Recognition with Support Vector Machines. In: Odyssey 2004 - The Speaker and Language Recognition Workshop, ISCA Tutorial and Research Workshop, pp. 285–288 (2004)
Braun, J., Levkowitz, H.: Automatic Language Identification with Perceptually Guided Training and Recurrent Neural Networks. In: 5th International Conference on Spoken Language Processing (ICSLP), pp. 3201–3205 (1998)
Ghesquiere, P.J., Compernolle, D.V.: Flemish Accent Identification based on Formant and Duration Features. In: 2002 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 749–752 (2002)
Lin, C.Y., Wang, H.C.: Fusion of Phonotactic and Prosodic Knowledge for Language Identification. In: 9th International Conference on Spoken Language Processing (ICSLP), pp. 425–428 (2006)
Hazen, T., Zue, V.: Segment-based Automatic Language Identification. J. of the Acoustic Society of America 4(101), 2323–2331 (1997)
Farinas, J., Pellegrino, F., Rouas, J.L., Andre-Obrecht, R.: Merging Segmental and Rhythm Features for Automatic Language Identification. In: 2002 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 753–756 (2002)
Huang, R., Hansen, J.: Dialect/Accent Classification via Boosted Word Modeling. In: 2005 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 585–588 (2005)
Campbell, W.M., Richardson, F., Reynolds, D.A.: Language Recognition with Word Lattices and Support Vector Machines. In: 2007 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 425–428 (2007)
Zissman, M.: Comparison of Four Approaches to Automatic Language Identification. J. IEEE Transactions on Speech and Audio Processing 4(1), 31–44 (1996)
Tsai, W.H., Chang, W.W.: Chinese Dialect Identification using an Acoustic-Phonotactic Model. In: 6th European Conference on Speech Communication and Technology (EUROSPEECH), pp. 367–370 (1999)
Koντoσo πoυλoς NΓ.: Δι αλ εκ τo ι κα ι ιδ ιω μα τα τη ς Nεα ς Eλλ ην ικ ης Γρ ηγ oρ η (1994)
Chatzi, I., Fakotakis, N., Kokkinakis, G.: Greek speech database for creation of voice driven teleservices. In: 5th European Conference on Speech Communication and Technology (EUROSPEECH), pp. 1755–1758 (1997)
Kostoulas, T., Georgila, K.: Orientel Cypriot Greek Database. V.2.0 (2007)
Pollak, P., Cernocky, J., Boudy, J., Choukri, K., Van den Heuvel, H., Vicsi, K., Virag, A., Siemund, R., Majewski, W., Staroniewicz, P., Tropf, H.: SpeechDat(E) - Eastern European Telephone Speech Databases. In: XLDB Workshop and Satellite Event to LREC Conference on Very Large Telephone Speech Databases (2000)
Schwarz, P., Matejka, P., Cernocky, J.: Towards Lower Error Rates in Phoneme Recognition. In: Sojka, P., Kopecek, I., Pala, K. (eds.) TSD 2004. LNCS (LNAI), vol. 3206, pp. 465–472. Springer, Heidelberg (2004)
Hoge, H., Draxler, C., Van den Heuvel, H., Johansen, F.T., Sanders, E., Tropf, H.S.: SpeechDat Multilingual Speech Databases for Teleservices: Across the Finish Line. In: 6th European Conference on Speech Communication and Technology (EUROSPEECH), pp. 2699–2702 (1999)
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version 3.3). Cambridge University, Cambridge (2005)
Clarkson, P.R., Rosenfeld, R.: Statistical Language Modeling Using the CMU-Cambridge Toolkit. In: 5th European Conference on Speech Communication and Technology (EUROSPEECH), pp. 2707–2710 (1997)
Martin, A., Doddington, G., Kamm, T., Ordowski, M., Przybocki, M.: The DET curve in assessment of detection task performance. In: 5th European Conference on Speech Communication and Technology (EUROSPEECH), vol. 4, pp. 1895–1898 (1997)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mporas, I., Ganchev, T., Fakotakis, N. (2008). Phonotactic Recognition of Greek and Cypriot Dialects from Telephone Speech. In: Darzentas, J., Vouros, G.A., Vosinakis, S., Arnellos, A. (eds) Artificial Intelligence: Theories, Models and Applications. SETN 2008. Lecture Notes in Computer Science(), vol 5138. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87881-0_16
Download citation
DOI: https://doi.org/10.1007/978-3-540-87881-0_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87880-3
Online ISBN: 978-3-540-87881-0
eBook Packages: Computer ScienceComputer Science (R0)