What Causes Phonetic Reduction in Russian Speech: New Evidence from Machine Learning Algorithms

Dayter, Maria; Riekhakaynen, Elena

doi:10.1007/978-3-030-87802-3_14

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12997))

Included in the following conference series:

International Conference on Speech and Computer

1563 Accesses

Abstract

In this paper, we describe the second stage of the study aimed at describing the factors that influence the phonetic reduction of words in Russian speech using machine learning algorithms. We discuss the limitations of the first stage of our study and try to overcome some of them by increasing the dataset and using new algorithms such as random forest, gradient boosting, and perceptron. We used the texts from the Corpus of Russian Speech as the data. The dataset was divided into two separate datasets: one consisted of single words and the other contained multiword units from our corpus. According to the results, for single words the most important features turned out to be the number of syllables and whether the word is an adjective as they were chosen by all algorithms. For the multiword units, the main features were the number of syllables, frequency in Russian spoken texts (in ipm), and token frequency in a given text. In our further research, we are going to expand the dataset and look closer on such features as text type and token frequency in a given text.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The source code can be found here: https://github.com/dayterr/sp_article_2021.

References

Jurafski, D., Bell, A., Gregory, M., Raymond, W.D.: Probabilistic relations between words: evidence from reduction in lexical production. In: Bybee, J., Hopper, P. (eds.) Frequency and the Emergence of Linguistic Structure, pp. 229–254. John Benjamins, Philadelphia (2001). https://doi.org/10.1075/tsl.45.13jur
Chapter Google Scholar
Kipyatkova, I.: Improving Russian LVCSR using deep neural networks for acoustic and language modeling. In: Karpov, A., Jokisch, O., Potapova, R. (eds.) SPECOM 2018. LNCS (LNAI), vol. 11096, pp. 291–300. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99579-3_31
Chapter Google Scholar
Ernestus, M., Tucker, B.V.: Why we need to investigate casual speech to truly understand language production, processing and mental lexicon. Ment. Lex. 11(3), 375–400 (2016). https://doi.org/10.1075/ml.11.3.03tuc
Article Google Scholar
Dayter, M., Riekhakaynen, E.: Automatic prediction of word form reduction in Russian spontaneous speech. In: Karpov, A., Potapova, R. (eds.) SPECOM 2020. LNCS (LNAI), vol. 12335, pp. 119–127. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60276-5_12
Chapter Google Scholar
Ernestus, M.: Voice Assimilation and Segment Reduction in Casual Dutch. A Corpus-Based Study of the Phonology-Phonetics Interface. Landelijke Onderzoekschool Taalwetenschap, Utrecht (2000)
Google Scholar
Spilková, H.: Phonetic Reduction in Spontaneous Speech: An Investigation of Native and Non-Native Production. Norwegian University of Science and Technology, Trondheim (2014)
Google Scholar
Stoyka, D.A.: Reduced Forms of Russian Speech: Linguistic and Extralinguistic Aspects. PhD thesis, Saint Petersburg (2016). (in Russian)
Google Scholar
Lobanov, B.M., Tsyrulnik, L.I.: Modeling of intra-word and inter-word phonetic-acoustic phenomena in the synthesizer of Russian speech by text. In: Ideas and Methods of Experimental Study of Speech: Collection of Articles. Art. in Memory of prof. L.A. Chistovich and prof. V. A. Kozhevnikov, pp. 47–63. St. Petersburg (2008). (in Russian)
Google Scholar
Riekhakaynen, E.: Realization of intervocalic consonant clusters in frequency words of the Russian language. Vestnik Sankt-Peterburgskogo Universiteta, Yazyk i Literatura 17(4), 672–690 (2020). https://doi.org/10.21638/spbu09.2020.411. (in Russian)
Article Google Scholar
Schachtenhaufen, R.: Phonetic reductions and linguistic factors. In: New Perspectives on Speech in Action. Proceedings of the 2nd SJUSK Conference on Contemporary Speech Habits, pp. 167–179. Samfundslitteratur, Frederiksberg (2013)
Google Scholar
Pharao, N.: Consonant Reduction in Copenhagen Danish: A Study of Linguistic and Extra-linguistic Factors in Phonetic Variation and Change. Det Humanistiske Fakultet, Københavns Universitet, København (2010)
Google Scholar
Riekhakaynen, E.: Corpora of Russian spontaneous speech as a tool for modelling natural speech production and recognition. In: 10th Annual Computing and Communication Workshop and Conference, CCWC 2020, January 2020, pp. 406–411. IEEE, Las Vegas (2020). https://doi.org/10.1109/CCWC47524.2020.9031251
Ventsov, A.V., Grudeva, E.V.: A Frequency Dictionary of Russian. CHSU Publishing House, Cherepovets (2008). (in Russian)
Google Scholar
Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001). https://doi.org/10.1023/A:1010933404324
Article MATH Google Scholar
Geurts, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Mach. Learn. 63, 3–42 (2006). https://doi.org/10.1007/s10994-006-6226-1
Article MATH Google Scholar
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, New York (2012). https://doi.org/10.1007/978-0-387-84858-7
Book MATH Google Scholar
Aggarwal, C.C.: Machine Learning for Text. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73531-3
Book MATH Google Scholar
Manning, C.D., Raghavan, P., Schütze, H.: An Introduction to Information Retrieval. Cambridge University Press, Cambridge (2009)
MATH Google Scholar
Alpaydin, E.: Introduction to Machine Learning. MIT Press, Cambridge (2014)
MATH Google Scholar
Riekhakaynen, E.: Reduction in spontaneous speech: How to survive. In: Heegart, J., Henrichsen, P.J. (eds.) Copenhagen Studies in Language. 43: New Perspectives on Speech in Action: Proceedings of the 2nd SJUSK Conference on Contemporary Speech Habits, pp. 153–167. Samfundslitteratur, Frederiksberg (2013)
Google Scholar
Knyazev, S.A., Pozharitskaya, S.K.: Modern Russian Language: Phonetics, Correct Pronunciation, Writing System, Spelling. Academic Project, Gaudeamus, Moscow (2011). (in Russian)
Google Scholar
Riekhakaynen, E.I.: Recognition of Russian Speech: Context + Frequency. St. Petersburg State University, St. Petersburg (2016). (in Russian)
Google Scholar
Apushkina, I.E.: Stressed and unstressed words in a spontaneous spoken text. In: Cherepovets Scientific Readings–2009: Proceedings of the All-Russian Conference Dedicated to the Day of the City of Cherepovets (November 2–3, 2009). Part 1. Literature Studies and Linguistics at the Beginning of the 21st Century, pp. 57–60. GOU VPO ChGU, Cherepovets (2010). (in Russian)
Google Scholar
Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422 (2002). https://doi.org/10.1023/A:1012487302797
Article MATH Google Scholar
Freund, Y., Schapire, R.E.: Large margin classification using the perceptron algorithm. Mach. Learn. 37, 277–296 (1999). https://doi.org/10.1023/A:1007662407062
Article MATH Google Scholar
Zumel, N., Mount, J.: Practical Data Science with R. Manning Publications, New York (2020)
Google Scholar
Sholle, F.: Deep Learning in Python. Piter, St. Petersburg (2018). (in Russian)
Google Scholar
Pavlova, A.V., Svetozarova, N.D.: Phrasal Stress in Phonetic, Functional and Semantic Aspects. Flinta, Moscow (2017). (in Russian)
Google Scholar

Download references

Acknowledgments

The research is supported by the grant #19-012-00629 from the Russian Foundation for Basic Research. We are very grateful to the anonymous peer-reviewers for constructive comments on an earlier version of the article.

Author information

Authors and Affiliations

Saint-Petersburg State University, Universitetskaya Emb. 7/9, 199034, St. Petersburg, Russia
Maria Dayter & Elena Riekhakaynen

Authors

Maria Dayter
View author publications
You can also search for this author in PubMed Google Scholar
Elena Riekhakaynen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Elena Riekhakaynen .

Editor information

Editors and Affiliations

St. Petersburg Federal Research Center of the Russian Academy of Sciences, St. Petersburg, Russia
Alexey Karpov
Moscow State Linguistic University, Moscow, Russia
Rodmonga Potapova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dayter, M., Riekhakaynen, E. (2021). What Causes Phonetic Reduction in Russian Speech: New Evidence from Machine Learning Algorithms. In: Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2021. Lecture Notes in Computer Science(), vol 12997. Springer, Cham. https://doi.org/10.1007/978-3-030-87802-3_14

Download citation

DOI: https://doi.org/10.1007/978-3-030-87802-3_14
Published: 22 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87801-6
Online ISBN: 978-3-030-87802-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics