Skip to main content

What Causes Phonetic Reduction in Russian Speech: New Evidence from Machine Learning Algorithms

  • Conference paper
  • First Online:
Speech and Computer (SPECOM 2021)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12997))

Included in the following conference series:

  • 1563 Accesses

Abstract

In this paper, we describe the second stage of the study aimed at describing the factors that influence the phonetic reduction of words in Russian speech using machine learning algorithms. We discuss the limitations of the first stage of our study and try to overcome some of them by increasing the dataset and using new algorithms such as random forest, gradient boosting, and perceptron. We used the texts from the Corpus of Russian Speech as the data. The dataset was divided into two separate datasets: one consisted of single words and the other contained multiword units from our corpus. According to the results, for single words the most important features turned out to be the number of syllables and whether the word is an adjective as they were chosen by all algorithms. For the multiword units, the main features were the number of syllables, frequency in Russian spoken texts (in ipm), and token frequency in a given text. In our further research, we are going to expand the dataset and look closer on such features as text type and token frequency in a given text.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The source code can be found here: https://github.com/dayterr/sp_article_2021.

References

  1. Jurafski, D., Bell, A., Gregory, M., Raymond, W.D.: Probabilistic relations between words: evidence from reduction in lexical production. In: Bybee, J., Hopper, P. (eds.) Frequency and the Emergence of Linguistic Structure, pp. 229–254. John Benjamins, Philadelphia (2001). https://doi.org/10.1075/tsl.45.13jur

    Chapter  Google Scholar 

  2. Kipyatkova, I.: Improving Russian LVCSR using deep neural networks for acoustic and language modeling. In: Karpov, A., Jokisch, O., Potapova, R. (eds.) SPECOM 2018. LNCS (LNAI), vol. 11096, pp. 291–300. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99579-3_31

    Chapter  Google Scholar 

  3. Ernestus, M., Tucker, B.V.: Why we need to investigate casual speech to truly understand language production, processing and mental lexicon. Ment. Lex. 11(3), 375–400 (2016). https://doi.org/10.1075/ml.11.3.03tuc

    Article  Google Scholar 

  4. Dayter, M., Riekhakaynen, E.: Automatic prediction of word form reduction in Russian spontaneous speech. In: Karpov, A., Potapova, R. (eds.) SPECOM 2020. LNCS (LNAI), vol. 12335, pp. 119–127. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60276-5_12

    Chapter  Google Scholar 

  5. Ernestus, M.: Voice Assimilation and Segment Reduction in Casual Dutch. A Corpus-Based Study of the Phonology-Phonetics Interface. Landelijke Onderzoekschool Taalwetenschap, Utrecht (2000)

    Google Scholar 

  6. Spilková, H.: Phonetic Reduction in Spontaneous Speech: An Investigation of Native and Non-Native Production. Norwegian University of Science and Technology, Trondheim (2014)

    Google Scholar 

  7. Stoyka, D.A.: Reduced Forms of Russian Speech: Linguistic and Extralinguistic Aspects. PhD thesis, Saint Petersburg (2016). (in Russian)

    Google Scholar 

  8. Lobanov, B.M., Tsyrulnik, L.I.: Modeling of intra-word and inter-word phonetic-acoustic phenomena in the synthesizer of Russian speech by text. In: Ideas and Methods of Experimental Study of Speech: Collection of Articles. Art. in Memory of prof. L.A. Chistovich and prof. V. A. Kozhevnikov, pp. 47–63. St. Petersburg (2008). (in Russian)

    Google Scholar 

  9. Riekhakaynen, E.: Realization of intervocalic consonant clusters in frequency words of the Russian language. Vestnik Sankt-Peterburgskogo Universiteta, Yazyk i Literatura 17(4), 672–690 (2020). https://doi.org/10.21638/spbu09.2020.411. (in Russian)

    Article  Google Scholar 

  10. Schachtenhaufen, R.: Phonetic reductions and linguistic factors. In: New Perspectives on Speech in Action. Proceedings of the 2nd SJUSK Conference on Contemporary Speech Habits, pp. 167–179. Samfundslitteratur, Frederiksberg (2013)

    Google Scholar 

  11. Pharao, N.: Consonant Reduction in Copenhagen Danish: A Study of Linguistic and Extra-linguistic Factors in Phonetic Variation and Change. Det Humanistiske Fakultet, Københavns Universitet, København (2010)

    Google Scholar 

  12. Riekhakaynen, E.: Corpora of Russian spontaneous speech as a tool for modelling natural speech production and recognition. In: 10th Annual Computing and Communication Workshop and Conference, CCWC 2020, January 2020, pp. 406–411. IEEE, Las Vegas (2020). https://doi.org/10.1109/CCWC47524.2020.9031251

  13. Ventsov, A.V., Grudeva, E.V.: A Frequency Dictionary of Russian. CHSU Publishing House, Cherepovets (2008). (in Russian)

    Google Scholar 

  14. Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001). https://doi.org/10.1023/A:1010933404324

    Article  MATH  Google Scholar 

  15. Geurts, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Mach. Learn. 63, 3–42 (2006). https://doi.org/10.1007/s10994-006-6226-1

    Article  MATH  Google Scholar 

  16. Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, New York (2012). https://doi.org/10.1007/978-0-387-84858-7

    Book  MATH  Google Scholar 

  17. Aggarwal, C.C.: Machine Learning for Text. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73531-3

    Book  MATH  Google Scholar 

  18. Manning, C.D., Raghavan, P., Schütze, H.: An Introduction to Information Retrieval. Cambridge University Press, Cambridge (2009)

    MATH  Google Scholar 

  19. Alpaydin, E.: Introduction to Machine Learning. MIT Press, Cambridge (2014)

    MATH  Google Scholar 

  20. Riekhakaynen, E.: Reduction in spontaneous speech: How to survive. In: Heegart, J., Henrichsen, P.J. (eds.) Copenhagen Studies in Language. 43: New Perspectives on Speech in Action: Proceedings of the 2nd SJUSK Conference on Contemporary Speech Habits, pp. 153–167. Samfundslitteratur, Frederiksberg (2013)

    Google Scholar 

  21. Knyazev, S.A., Pozharitskaya, S.K.: Modern Russian Language: Phonetics, Correct Pronunciation, Writing System, Spelling. Academic Project, Gaudeamus, Moscow (2011). (in Russian)

    Google Scholar 

  22. Riekhakaynen, E.I.: Recognition of Russian Speech: Context + Frequency. St. Petersburg State University, St. Petersburg (2016). (in Russian)

    Google Scholar 

  23. Apushkina, I.E.: Stressed and unstressed words in a spontaneous spoken text. In: Cherepovets Scientific Readings–2009: Proceedings of the All-Russian Conference Dedicated to the Day of the City of Cherepovets (November 2–3, 2009). Part 1. Literature Studies and Linguistics at the Beginning of the 21st Century, pp. 57–60. GOU VPO ChGU, Cherepovets (2010). (in Russian)

    Google Scholar 

  24. Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422 (2002). https://doi.org/10.1023/A:1012487302797

    Article  MATH  Google Scholar 

  25. Freund, Y., Schapire, R.E.: Large margin classification using the perceptron algorithm. Mach. Learn. 37, 277–296 (1999). https://doi.org/10.1023/A:1007662407062

    Article  MATH  Google Scholar 

  26. Zumel, N., Mount, J.: Practical Data Science with R. Manning Publications, New York (2020)

    Google Scholar 

  27. Sholle, F.: Deep Learning in Python. Piter, St. Petersburg (2018). (in Russian)

    Google Scholar 

  28. Pavlova, A.V., Svetozarova, N.D.: Phrasal Stress in Phonetic, Functional and Semantic Aspects. Flinta, Moscow (2017). (in Russian)

    Google Scholar 

Download references

Acknowledgments

The research is supported by the grant #19-012-00629 from the Russian Foundation for Basic Research. We are very grateful to the anonymous peer-reviewers for constructive comments on an earlier version of the article.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Elena Riekhakaynen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dayter, M., Riekhakaynen, E. (2021). What Causes Phonetic Reduction in Russian Speech: New Evidence from Machine Learning Algorithms. In: Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2021. Lecture Notes in Computer Science(), vol 12997. Springer, Cham. https://doi.org/10.1007/978-3-030-87802-3_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-87802-3_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-87801-6

  • Online ISBN: 978-3-030-87802-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics