Abstract
In this paper, we describe the second stage of the study aimed at describing the factors that influence the phonetic reduction of words in Russian speech using machine learning algorithms. We discuss the limitations of the first stage of our study and try to overcome some of them by increasing the dataset and using new algorithms such as random forest, gradient boosting, and perceptron. We used the texts from the Corpus of Russian Speech as the data. The dataset was divided into two separate datasets: one consisted of single words and the other contained multiword units from our corpus. According to the results, for single words the most important features turned out to be the number of syllables and whether the word is an adjective as they were chosen by all algorithms. For the multiword units, the main features were the number of syllables, frequency in Russian spoken texts (in ipm), and token frequency in a given text. In our further research, we are going to expand the dataset and look closer on such features as text type and token frequency in a given text.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The source code can be found here: https://github.com/dayterr/sp_article_2021.
References
Jurafski, D., Bell, A., Gregory, M., Raymond, W.D.: Probabilistic relations between words: evidence from reduction in lexical production. In: Bybee, J., Hopper, P. (eds.) Frequency and the Emergence of Linguistic Structure, pp. 229–254. John Benjamins, Philadelphia (2001). https://doi.org/10.1075/tsl.45.13jur
Kipyatkova, I.: Improving Russian LVCSR using deep neural networks for acoustic and language modeling. In: Karpov, A., Jokisch, O., Potapova, R. (eds.) SPECOM 2018. LNCS (LNAI), vol. 11096, pp. 291–300. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99579-3_31
Ernestus, M., Tucker, B.V.: Why we need to investigate casual speech to truly understand language production, processing and mental lexicon. Ment. Lex. 11(3), 375–400 (2016). https://doi.org/10.1075/ml.11.3.03tuc
Dayter, M., Riekhakaynen, E.: Automatic prediction of word form reduction in Russian spontaneous speech. In: Karpov, A., Potapova, R. (eds.) SPECOM 2020. LNCS (LNAI), vol. 12335, pp. 119–127. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60276-5_12
Ernestus, M.: Voice Assimilation and Segment Reduction in Casual Dutch. A Corpus-Based Study of the Phonology-Phonetics Interface. Landelijke Onderzoekschool Taalwetenschap, Utrecht (2000)
Spilková, H.: Phonetic Reduction in Spontaneous Speech: An Investigation of Native and Non-Native Production. Norwegian University of Science and Technology, Trondheim (2014)
Stoyka, D.A.: Reduced Forms of Russian Speech: Linguistic and Extralinguistic Aspects. PhD thesis, Saint Petersburg (2016). (in Russian)
Lobanov, B.M., Tsyrulnik, L.I.: Modeling of intra-word and inter-word phonetic-acoustic phenomena in the synthesizer of Russian speech by text. In: Ideas and Methods of Experimental Study of Speech: Collection of Articles. Art. in Memory of prof. L.A. Chistovich and prof. V. A. Kozhevnikov, pp. 47–63. St. Petersburg (2008). (in Russian)
Riekhakaynen, E.: Realization of intervocalic consonant clusters in frequency words of the Russian language. Vestnik Sankt-Peterburgskogo Universiteta, Yazyk i Literatura 17(4), 672–690 (2020). https://doi.org/10.21638/spbu09.2020.411. (in Russian)
Schachtenhaufen, R.: Phonetic reductions and linguistic factors. In: New Perspectives on Speech in Action. Proceedings of the 2nd SJUSK Conference on Contemporary Speech Habits, pp. 167–179. Samfundslitteratur, Frederiksberg (2013)
Pharao, N.: Consonant Reduction in Copenhagen Danish: A Study of Linguistic and Extra-linguistic Factors in Phonetic Variation and Change. Det Humanistiske Fakultet, Københavns Universitet, København (2010)
Riekhakaynen, E.: Corpora of Russian spontaneous speech as a tool for modelling natural speech production and recognition. In: 10th Annual Computing and Communication Workshop and Conference, CCWC 2020, January 2020, pp. 406–411. IEEE, Las Vegas (2020). https://doi.org/10.1109/CCWC47524.2020.9031251
Ventsov, A.V., Grudeva, E.V.: A Frequency Dictionary of Russian. CHSU Publishing House, Cherepovets (2008). (in Russian)
Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001). https://doi.org/10.1023/A:1010933404324
Geurts, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Mach. Learn. 63, 3–42 (2006). https://doi.org/10.1007/s10994-006-6226-1
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, New York (2012). https://doi.org/10.1007/978-0-387-84858-7
Aggarwal, C.C.: Machine Learning for Text. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73531-3
Manning, C.D., Raghavan, P., Schütze, H.: An Introduction to Information Retrieval. Cambridge University Press, Cambridge (2009)
Alpaydin, E.: Introduction to Machine Learning. MIT Press, Cambridge (2014)
Riekhakaynen, E.: Reduction in spontaneous speech: How to survive. In: Heegart, J., Henrichsen, P.J. (eds.) Copenhagen Studies in Language. 43: New Perspectives on Speech in Action: Proceedings of the 2nd SJUSK Conference on Contemporary Speech Habits, pp. 153–167. Samfundslitteratur, Frederiksberg (2013)
Knyazev, S.A., Pozharitskaya, S.K.: Modern Russian Language: Phonetics, Correct Pronunciation, Writing System, Spelling. Academic Project, Gaudeamus, Moscow (2011). (in Russian)
Riekhakaynen, E.I.: Recognition of Russian Speech: Context + Frequency. St. Petersburg State University, St. Petersburg (2016). (in Russian)
Apushkina, I.E.: Stressed and unstressed words in a spontaneous spoken text. In: Cherepovets Scientific Readings–2009: Proceedings of the All-Russian Conference Dedicated to the Day of the City of Cherepovets (November 2–3, 2009). Part 1. Literature Studies and Linguistics at the Beginning of the 21st Century, pp. 57–60. GOU VPO ChGU, Cherepovets (2010). (in Russian)
Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422 (2002). https://doi.org/10.1023/A:1012487302797
Freund, Y., Schapire, R.E.: Large margin classification using the perceptron algorithm. Mach. Learn. 37, 277–296 (1999). https://doi.org/10.1023/A:1007662407062
Zumel, N., Mount, J.: Practical Data Science with R. Manning Publications, New York (2020)
Sholle, F.: Deep Learning in Python. Piter, St. Petersburg (2018). (in Russian)
Pavlova, A.V., Svetozarova, N.D.: Phrasal Stress in Phonetic, Functional and Semantic Aspects. Flinta, Moscow (2017). (in Russian)
Acknowledgments
The research is supported by the grant #19-012-00629 from the Russian Foundation for Basic Research. We are very grateful to the anonymous peer-reviewers for constructive comments on an earlier version of the article.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Dayter, M., Riekhakaynen, E. (2021). What Causes Phonetic Reduction in Russian Speech: New Evidence from Machine Learning Algorithms. In: Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2021. Lecture Notes in Computer Science(), vol 12997. Springer, Cham. https://doi.org/10.1007/978-3-030-87802-3_14
Download citation
DOI: https://doi.org/10.1007/978-3-030-87802-3_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87801-6
Online ISBN: 978-3-030-87802-3
eBook Packages: Computer ScienceComputer Science (R0)