Phonotactic Recognition of Greek and Cypriot Dialects from Telephone Speech

Mporas, Iosif; Ganchev, Todor; Fakotakis, Nikos

doi:10.1007/978-3-540-87881-0_16

Iosif Mporas¹,
Todor Ganchev¹ &
Nikos Fakotakis¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5138))

Included in the following conference series:

Hellenic Conference on Artificial Intelligence

1756 Accesses
2 Citations

Abstract

In the present work we report recent progress in development of dialect recognition system for the Standard Modern Greek and Cypriot dialect of Greek language. Specifically, we rely on a compound recognition scheme, where the outputs of multiple phone recognizers, trained on different European languages are combined. This allows achieving higher recognition accuracy, when compared to the one of the mainstream phone recognizer. The evaluation results reported here indicate high recognition accuracy - up to 95%, which makes the proposed solution a feasible addition to existing spoken dialogue systems, such as voice banking applications, call routers, voice portals, smart-home environments, e-Government speech oriented services, etc.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Schultz, T., Kirchhoff, K.: Multilingual Speech Processing. Academic Press, Elsevier (2006)
Google Scholar
Martin, A.F., Le, A.N.: NIST 2007 Language Recognition Evaluation. In: Odyssey 2008 - The Speaker and Language Recognition Workshop ISCA Tutorial and Research Workshop (2008)
Google Scholar
Torres-Carrasquillo, P.A., Gleason, T.P., Reynolds, D.A.: Dialect Identification using Gaussian Mixture Models. In: Odyssey 2004 - The Speaker and Language Recognition Workshop, ISCA Tutorial and Research Workshop, pp. 297–300 (2004)
Google Scholar
Tong, R., Ma, B., Li, H., Chng, E.S.: Integrating Acoustic, Prosodic and Phonotactic Features for Spoken Language Identification. In: 2006 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 205–208 (2006)
Google Scholar
Campbell, W.M., Singer, E., Torres-Carrasquillo, P.A., Reynolds, D.A.: Language Recognition with Support Vector Machines. In: Odyssey 2004 - The Speaker and Language Recognition Workshop, ISCA Tutorial and Research Workshop, pp. 285–288 (2004)
Google Scholar
Braun, J., Levkowitz, H.: Automatic Language Identification with Perceptually Guided Training and Recurrent Neural Networks. In: 5th International Conference on Spoken Language Processing (ICSLP), pp. 3201–3205 (1998)
Google Scholar
Ghesquiere, P.J., Compernolle, D.V.: Flemish Accent Identification based on Formant and Duration Features. In: 2002 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 749–752 (2002)
Google Scholar
Lin, C.Y., Wang, H.C.: Fusion of Phonotactic and Prosodic Knowledge for Language Identification. In: 9th International Conference on Spoken Language Processing (ICSLP), pp. 425–428 (2006)
Google Scholar
Hazen, T., Zue, V.: Segment-based Automatic Language Identification. J. of the Acoustic Society of America 4(101), 2323–2331 (1997)
Article Google Scholar
Farinas, J., Pellegrino, F., Rouas, J.L., Andre-Obrecht, R.: Merging Segmental and Rhythm Features for Automatic Language Identification. In: 2002 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 753–756 (2002)
Google Scholar
Huang, R., Hansen, J.: Dialect/Accent Classification via Boosted Word Modeling. In: 2005 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 585–588 (2005)
Google Scholar
Campbell, W.M., Richardson, F., Reynolds, D.A.: Language Recognition with Word Lattices and Support Vector Machines. In: 2007 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 425–428 (2007)
Google Scholar
Zissman, M.: Comparison of Four Approaches to Automatic Language Identification. J. IEEE Transactions on Speech and Audio Processing 4(1), 31–44 (1996)
Article Google Scholar
Tsai, W.H., Chang, W.W.: Chinese Dialect Identification using an Acoustic-Phonotactic Model. In: 6th European Conference on Speech Communication and Technology (EUROSPEECH), pp. 367–370 (1999)
Google Scholar
Koντoσo πoυλoς NΓ.: Δι αλ εκ τo ι κα ι ιδ ιω μα τα τη ς Nεα ς Eλλ ην ικ ης Γρ ηγ oρ η (1994)
Google Scholar
Chatzi, I., Fakotakis, N., Kokkinakis, G.: Greek speech database for creation of voice driven teleservices. In: 5th European Conference on Speech Communication and Technology (EUROSPEECH), pp. 1755–1758 (1997)
Google Scholar
Kostoulas, T., Georgila, K.: Orientel Cypriot Greek Database. V.2.0 (2007)
Google Scholar
Pollak, P., Cernocky, J., Boudy, J., Choukri, K., Van den Heuvel, H., Vicsi, K., Virag, A., Siemund, R., Majewski, W., Staroniewicz, P., Tropf, H.: SpeechDat(E) - Eastern European Telephone Speech Databases. In: XLDB Workshop and Satellite Event to LREC Conference on Very Large Telephone Speech Databases (2000)
Google Scholar
Schwarz, P., Matejka, P., Cernocky, J.: Towards Lower Error Rates in Phoneme Recognition. In: Sojka, P., Kopecek, I., Pala, K. (eds.) TSD 2004. LNCS (LNAI), vol. 3206, pp. 465–472. Springer, Heidelberg (2004)
Google Scholar
Hoge, H., Draxler, C., Van den Heuvel, H., Johansen, F.T., Sanders, E., Tropf, H.S.: SpeechDat Multilingual Speech Databases for Teleservices: Across the Finish Line. In: 6th European Conference on Speech Communication and Technology (EUROSPEECH), pp. 2699–2702 (1999)
Google Scholar
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version 3.3). Cambridge University, Cambridge (2005)
Google Scholar
Clarkson, P.R., Rosenfeld, R.: Statistical Language Modeling Using the CMU-Cambridge Toolkit. In: 5th European Conference on Speech Communication and Technology (EUROSPEECH), pp. 2707–2710 (1997)
Google Scholar
Martin, A., Doddington, G., Kamm, T., Ordowski, M., Przybocki, M.: The DET curve in assessment of detection task performance. In: 5th European Conference on Speech Communication and Technology (EUROSPEECH), vol. 4, pp. 1895–1898 (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Artificial Intelligence Group, Wire Communications Laboratory, Dept. of Electrical and Computer Engineering, University of Patras, 26500, Rion, Greece
Iosif Mporas, Todor Ganchev & Nikos Fakotakis

Authors

Iosif Mporas
View author publications
You can also search for this author in PubMed Google Scholar
Todor Ganchev
View author publications
You can also search for this author in PubMed Google Scholar
Nikos Fakotakis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

John Darzentas George A. Vouros Spyros Vosinakis Argyris Arnellos

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mporas, I., Ganchev, T., Fakotakis, N. (2008). Phonotactic Recognition of Greek and Cypriot Dialects from Telephone Speech. In: Darzentas, J., Vouros, G.A., Vosinakis, S., Arnellos, A. (eds) Artificial Intelligence: Theories, Models and Applications. SETN 2008. Lecture Notes in Computer Science(), vol 5138. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87881-0_16

Download citation

DOI: https://doi.org/10.1007/978-3-540-87881-0_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87880-3
Online ISBN: 978-3-540-87881-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics