Abstract
Texts in the “Digital Library of Serbian Cultural Heritage of the 18th Century” are encoded as a word-aligned corpus of TEI XML documents in two versions: one using traditional 18th-century orthography, including the graphemes which have since disappeared from Serbian, and one using modernized and standardized Serbian spelling rules that increase the legibility and searchability of these texts for modern users. The corpus also contains linguistic and semantic annotations that add modern phonetic, morphological, lexical and conceptual equivalents to the largely archaic vocabulary. By applying basic techniques of cross-lingual information retrieval to a historical dimension of one language, and making provisions for multiple indexing and annotations, our project exposes a notoriously difficult chapter in the development of the Serbian language to a wider audience, without sacrificing the edition’s scholarly potential.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ivić, M.: O jeziku Vukovom i vukovskom. Novi Sad: Knjiž. zajednica Novog Sada (1990) (in Serbian)
Albin: The Creation of the Slaveno-Serbski Literary Language. The Slavonic and East European Review 48, 483–491 (1970)
Jovanović-Gorup, R.: Dositej Obradović and Serbian Cultural Rebirth. Serbian Studies 6, 45–80 (1991)
Stefanović, M. (ed.): Sabrana dela Dositeja Obradovića , pp. 1-6. Zadužbina “Dositej Obradović”, Beograd (2008)
Tasovac, T.: Why every picture is not worth a thousand words: digital libraries from a textual perspective. In: Vraneš, A., Marković, L.J., Vulović, K. (eds.) Electronic Library, pp. 721–732. Filološki fakultet, Beograd (2009) (in Serbian)
Pusch, C., Kabatek, J., Raible, W. (eds.): Romance corpus linguistics II: corpora and diachronic linguistics. Gunter Narr Verlag, Tübingen (2005)
TEI Consortium, (eds.): Guidelines for Electronic Text Encoding and Interchange, http://www.tei-c.org/Guidelines/P5/
Tasovac, T.: More or Less Than a Dictionary? Wordnet as a Model for Serbian L2 Lexicography. Infotheca: Journal of Informatics and Librarinaship 10, 13a–22a (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tasovac, T., Ermolaev, N. (2011). Encoding Diachrony: Digital Editions of Serbian 18th-Century Texts. In: Gradmann, S., Borri, F., Meghini, C., Schuldt, H. (eds) Research and Advanced Technology for Digital Libraries. TPDL 2011. Lecture Notes in Computer Science, vol 6966. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24469-8_58
Download citation
DOI: https://doi.org/10.1007/978-3-642-24469-8_58
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24468-1
Online ISBN: 978-3-642-24469-8
eBook Packages: Computer ScienceComputer Science (R0)