Abstract
The paper deals with a corpus of the Russian language of the 19th century. The corpus offers the opportunity to accomplish some essential tasks of modern Russian linguistics like getting various linguistic and statistical information, investigating dynamic processes in the vocabulary, analyzing grammatical changes in the lexicon. To make the corpus representative, some special criteria should be determined. The corpus belongs to corpora with morphological annotation, i.e. each word form has a list of morphological features. Additionally, the metadata set includes identifiers of structural division together with external features. The metadata description is based on international recommendations.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Leech, G.: The State of Art in Corpus Linguistics, English Corpus Linguistics. In: Aijmer, K., Altenberg, B. (eds.), Longman, London, pp. 8–29 (1991)
Fillmore, C.J.: ‘Corpus linguistics’ vs. ‘Computer-aided armchair linguistics’. In: Svartvik, J. (ed.) Directions in Corpus Linguistics. Mouton de Gruyter, pp. 35–60 (1992)
Andryushchenko, V. M.: Kontseptsiya i Arkhitektura Mashinnogo Fonda Russkogo Yazyka. Yershov, A. P. (ed.) Moscow (1989)
Konferentsii, D.N.: Korpusnaya Lingvistika i Lingvisticheskiye Bazy Dannykh. Gerd, A.S. (ed.), St. Petersburg (2002)
Sichinava, D.V.: K zadache sozdaniya korpusov russkogo yazyka, Nauchno-tekhnicheskaya informatsiya. Seriya 2(11), 25–31 (2002)
Boguslavskiy, I.M., et al.: Annotirovannyy Korpus Russkikh Tekstov. Trudy Mezhdunarodnogo Seminara po Komp’yuternoy Lingvistike i Yeyo Prilozheniyam “Dialog–2000”, Protvino, Russia (2000)
Plungyan, V.A., Kustova, G.I.: Kratkoye Opisaniye Proyekta “Russkiy Standart” URL: http://rscorpora.narod.ru/zay.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zakharov, V. (2003). Russian Corpus of the 19th Century. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2003. Lecture Notes in Computer Science(), vol 2807. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39398-6_21
Download citation
DOI: https://doi.org/10.1007/978-3-540-39398-6_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20024-6
Online ISBN: 978-3-540-39398-6
eBook Packages: Springer Book Archive