Abstract
In this paper, a large Hungarian spoken language database is introduced. This phonetically-based multi-purpose database contains various types of spontaneous and read speech from 333 monolingual speakers (about 50 minutes of speech sample per speaker). This study presents the background and motivation of the development of the BEA Hungarian database, describes its protocol and the transcription procedure, and also presents existing and proposed research using this database. Due to its recording protocol and the transcription it provides a challenging material for various comparisons of segmental structures of speech also across languages.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Mengusoglu, E., Deroo, O.: Turkish LVCSR: Database preparation and language modeling for an agglutinative language. In: IEEE International Conference on Acoustics Speech And Signal Processing, vol. 6, pp. 4018–4018. IEEE (1999, 2001)
Seppänen, T., Toivanen, J., Väyrynen, E.: MediaTeam speech corpus: a first large Finnish emotional speech database. In: Proceedings of the Proceedings of XV International Conference of Phonetic Science, pp. 2469–2472 (2003)
Mihajlik, P., Fegyyó, T., Tüske, Z., Ircing, P.: A morphographemic approach for the recognition of spontaneous speech in agglutinative languages - like Hungarian. In: Proc. Interspeech 2007, Antwerp, Belgium, pp. 1497–1500 (2007)
Keating, P., Byrd, D., Flemming, E., Todaka, Y.: Phonetic analyses of word and segment variation using the TIMIT corpus of American english. Speech Communication 14(2), 131–142 (1994)
Bael, C.V., Boves, L., van den Heuvel, D., Strik, H.: Automatic phonetic transcription of large speech corpora. Journal of Computer Speech and Language 21(4), 652–668 (2007)
Aston, G., Burnard, L.: The BNC Handbook. Exploring the British National Corpus with SARA. Oxford University Press (1998)
Svartvik, J. (ed.): The London Corpus of Spoken English: Description and Research. Lund Studies in English, 82. Lund University Press, Lund (1990)
Godfrey, J.J., Holliman, E.C., Daniel, J.: SWITCHBOARD: telephone speech corpus for research and development. In: Acoustics, Speech, and Signal Processing, ICASSP 1992, vol. 1, pp. 517–520 (1992)
Anderson, A.H., Bader, M., Bard, E.G., Boyle, E., Doherty, G., Garrod, S.,…Weinert, R.: The HCRC map task corpus. Language and Speech 34(4), 351–366 (1991)
Pitt, M.A., Johnson, K., Hume, E., Kiesling, S., Raymond, W.: The Buckeye corpus of conversational speech: labeling conventions and a test of transcriber reliability. Speech Communication 45, 89–95 (2005)
Janin, A., Baron, D., Edwards, J., Ellis, D., Gelbart, D., Morgan, N., ... Wooters, C.: The ICSI meeting corpus. In: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2003, vol. 1, pp. 364–367 (2003)
Carletta, J.E., et al.: The AMI meeting corpus: A pre-announcement. In: Renals, S., Bengio, S. (eds.) MLMI 2005. LNCS, vol. 3869, pp. 28–39. Springer, Heidelberg (2006)
Kohler, K.J., Pätzold, M., Simpson, A.P.: From the acoustic data collection to a labelled speech data bank of spoken Standard German. Arbeitsberichte des Instituts fär Phonetik und digitale Sprachverarbeitung der Universität Kiel (AIPUK) 32, 1–29 (1997)
Grønnum, N.: A Danish phonetically annotated spontaneous speech corpus (DanPASS). Speech Communication 51(7), 594–603 (2009)
Maekawa, K.: Corpus of Spontaneous Japanese: Its design and evaluation. In: ISCA IEEE Workshop on Spontaneous Speech Processing and Recognition (2003)
Chan, D., et al.: EUROM: a spoken language resource for the EU. In: Proceedings of the 4th European Conference on Speech Communication and Speech Tecnology, Eurospeech 1995, Madrid, vol. 1, pp. 867–880 (1995)
Roach, P., Arnfield, S., Barry, W.J., Baltova, J., Boldea, M., Fourcin, A., ... Vicsi, K.: BABEL: an eastern european multi-language database. In: ICSLP (1996)
Váradi, T.: A Budapesti Szociolingvisztikai Interjú. In: Kiefer F, Siptár P. (ed.). A magyar nyelv kézikényve Akadémiai Kiadó, Budapest, pp. 339–359 (2003)
Vicsi, K., Tóth, L., Kocsor, A., Gordos, G., Csirik, J.: MTBA – magyar nyelvű telefonbeszéd-adatbázis. Híradástechnika 8, 35–39 (2002)
Papay, K.: Designing a Hungarian multimodal database – speech recording and annotation. In: Esposito, A., Esposito, A.M., Martone, R., Müller, V.C., Scarpetta, G. (eds.) COST 2102 Int. Training School 2010. LNCS, vol. 6456, pp. 403–411. Springer, Heidelberg (2011)
Gósy, M.: BEA A multifunctional Hungarian spoken language database. The Phonetician 105(106), 50–61 (2012)
Gósy, M. (ed.): Beszéd, adatbázis, kutatások. Akadémiai Kiadó, Budapest (2012)
Gráczi, T.E., Horváth, V.: A magánhangzók realizációja spontán beszédben. In: Beszédkutatás 2010, pp. 5–16 (2010)
Beke, A., Gósy, M.: Characteristic and spectral features used in automatic prediction of vowel duration in spontaneous speech. In: Institute of Electrical Electronics Engineers (eds.): CogInfoCom 2012: 3rd International Conference on Cognitive Infocommunications, pp. 65–71 (2012)
Gráczi, T.E., Beke, A.: Fricatives in spontaneous speech. In: ExAPP 2013, Copenhagen, March 20-22 (2013)
Beke, A., Gósy, M., Horváth, V.: Temporal variability in spontaneous Hungarian speech. In: Proceedings of 6th Language Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Poznan, December 7-9, pp. 219–223 (2013)
Gósy, M., Gyarmathy, D., Horváth, V.: Improper activation and monitoring failures in speech planning. Govor / Speech 29(1), 3–22 (2012)
Gyarmathy, D., Neuberger, T.: Self-monitoring strategies: the factor of age. In: Presentation at the 19th International Congress of Linguists, Geneva, July 21-27 (2012)
Beke, A.: Automatic speaker diarization in Hungarian spontaneous conversations. PhD thesis. ELTE, Budapest (2013)
Neuberger, T., Beke, A.: Automatic laughter detection in spontaneous speech using GMM-SVM method. In: Habernal, I. (ed.) TSD 2013. LNCS (LNAI), vol. 8082, pp. 113–120. Springer, Heidelberg (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Neuberger, T., Gyarmathy, D., Gráczi, T.E., Horváth, V., Gósy, M., Beke, A. (2014). Development of a Large Spontaneous Speech Database of Agglutinative Hungarian Language. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2014. Lecture Notes in Computer Science(), vol 8655. Springer, Cham. https://doi.org/10.1007/978-3-319-10816-2_51
Download citation
DOI: https://doi.org/10.1007/978-3-319-10816-2_51
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10815-5
Online ISBN: 978-3-319-10816-2
eBook Packages: Computer ScienceComputer Science (R0)