Abstract
The main aim of the ORD speech corpus is to fix Russian spontaneous speech in natural communicative situations. The corpus presents the unique linguistic material, allowing to perform fundamental research in many scientific aspects and to solve different practical tasks, especially in speech technologies. The paper concerns methodology and description of the ORD corpus creating and presents the system of annotations.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Asinovsky, A.S., Arkhipova, E.A., Bogdanova, N.V., Rusakova, M.V., Ryko, A.I., Stepanova, S.B., Sherstinova, T.Y.: Polevaya lingvisticheskaya praktika. Uchebno-metodicheskij kompleks slozhnoj struktury. Chast’ 1. Teoreticheskie osnovy i metodika sbora lingvisticheskikh dannykh dl’a predstavlenia ikh v linguisticheskom korpuse russkogo yazyka. St. Petersburg (2007)
Asinovsky, A.S., Bogdanova, N.V., Rusakova, M.V., Stepanova, S.B., Sherstinova, T.Y.: Zvukovoj korpus russkogo yazyka povsednevnogo obschenia “Odin rechevoj den”: koncepcia i sosytoyanie formirovania. In: Kompjuternaya lingvistika i intellektualnye tekhnologii. Vypusk, Moscow. Po materialam mezhd. konferencii “Dialog”, vol. 7 (14), pp. 488–494 (2008)
Asinovsky, A.S., Koroleva, I.V., Rusakova, M.V., Ryko, A.I., Philippova, N.S., Stepanova, S.B.: On Integral Multilevel Annotation of a Spoken Russian Corpus. In: Proc. the XIIth International Conference “Speech and Computer” SPECOM 2007, Moscow (2007)
Bogdanova, N.V.: Allegrovye formy russkoj rechi: ot proiznositel’noj redukcii k pis’mennoj fiksacii i leksikalizacii v yazyke. Mat-ly XXXVII mezhd. filologicheskoj konferencii. Vypusk 18. “Fonetika”. St. Petersburg (2008)
ELAN - Linguistic Annotator. Version 3.6, http://www.mpi.nl/corpus/manuals/manual-elan.pdf
Koroleva, I.V.: Individual’nye sostoyania i svoistva yazykovoj lichnosti: vliyanie na lingvisticheskuju strukturu vyskazyvanij. Mat-ly XXXVII mezhd. filologicheskoj konferencii. Vypusk 21. St. Petersburg. pp. 36–45 (2008)
Markasova, E.V.: Ritoricheskaya enantiosemia v korpuse russkogo yazyka povsednevnogo obschenia “Odin rechevoj den”. In: Kompjuternaya lingvistika i intellektualnye tekhnologii. Vypusk “Dialog”, Moscow, vol. 7(14), pp. 352–356 (2008)
Praat: Doing Phonetics by computer, http://www.praat.org
Ryko, A.I., Stepanova, S.B.: Mnogourovnevaya lingvisticheskaya razmetka zvukovogo korpusa russkogo yazyka. In: Kompjuternaya lingvistika i intellektualnye tekhnologii. Vypusk. Po materialam mezhd. konferencii “Dialog”, Moscow, vol. 7 (14), pp. 460–465 (2008)
Ryko, A.I., Stepanova, S.B.: Problemy vychlenenia jedinic analiza spontannogo ustnogo teksta. In: Mat-ly XXXVII mezhd. filologicheskoj konferencii. Vypusk, St. Petersburg, vol. 21, pp. 71–80 (2008)
Sherstinova, T.Y.: “Odin rechevoj den” na vremennoj shkale: o perspektivakh issledovania dinamicheskikh processov na materiale zvukovogo korpusa. In: Vestnik Sankt-Peterburgskogo universiteta, Seria 9: Filologia, Vostokovedenie, Zhurnalistika, Chast’ 2, St. Petersburg, vol. 4, pp. 227–235 (2008)
The British National Corpus http://www.natcorp.ox.ac.uk/
Zobnina, E.A.: Social’nye characteristiki govoriaschego: objektivnye dannye i ekspertnaya ocenka rechi (po materialam zvukovogo korpusa “Odin rechevoj den”. In: Mat-ly XXXVII mezhd. filologicheskoj konferencii. Vypusk, St. Petersburg, vol. 21, pp. 17–24 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Asinovsky, A., Bogdanova, N., Rusakova, M., Ryko, A., Stepanova, S., Sherstinova, T. (2009). The ORD Speech Corpus of Russian Everyday Communication “One Speaker’s Day”: Creation Principles and Annotation. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2009. Lecture Notes in Computer Science(), vol 5729. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04208-9_36
Download citation
DOI: https://doi.org/10.1007/978-3-642-04208-9_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04207-2
Online ISBN: 978-3-642-04208-9
eBook Packages: Computer ScienceComputer Science (R0)