Abstract
The paper presents statistical approach to discover semantic relations of political lexica using parallel Bulgarian-Slovak EUROPARL 7 Corpus. It employs statistical properties incorporated in the Sketch Engine software to generate concordances, co-occurrences and collocations. A comparative analysis of semantic structure of political lexica investigating synonymic, attributive and reciprocal semantic relations of most frequent key words from two parallel corpora – for both Bulgarian and Slovak languages is offered. The paper address some issue related to correct terms discovery, their translations and use in political speech. Finally, more general conclusions about semantic properties of political lexica are presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Gale, W., Church, K.: A program for aligning sentences in bilingual corpora. Comput. Linguist. 19(1), 5–102 (1993)
Kilgarriff, A., Reddy, S., Pomikalek, J., Avinesh, P.: A corpus factory for many languages. In: Proceedings of the LREC 2010, pp. 904–910 (2010)
Kilgarriff, A., Rundell, M.: Lexical profiling software and its lexicographic applications: a case study. In: Proceedings from EURALEX 2002, pp. 807–811 (2002)
Kilgarriff, A., Rychly, P., Smrz, P., Tugwell, D.: The sketch engine. In: Proceedings from EURALEX 2004, pp. 105–116 (2004)
Koehn, P.: Europarl: A parallel corpus for statistical machine translation. In: Proceedings from MT Summit, pp. 79–86 (2005)
Michelfeit, J.: Parallel corpora in sketch engine. In: Sketch Engine Workshop IV, Tallinn (2013) (presentation)
Ondrejovic, S.: Between purism and glocalism. In: Sociolinguistica Slovaca, vol. 8, pp. 25–32. VEDA (2014)
Stoykova, V., Petkova, E.: Automatic extraction of mathematical terms for precalculus. In: Proceedia Technology, vol. 1, pp. 464–468. Elsevier (2012)
Stoykova, V., Simkova, M., Majchrakova, D., Gajdosova, K.: Detecting time expressions for bulgarian and slovak language from electronic text corpora. Proc. Soc. Behav. Sci. 186, 257–260 (2015). Elsevier
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Stoykova, V. (2016). Using Statistical Search to Discover Semantic Relations of Political Lexica – Evidences from Bulgarian-Slovak EUROPARL 7 Corpus. In: Kotsireas, I., Rump, S., Yap, C. (eds) Mathematical Aspects of Computer and Information Sciences. MACIS 2015. Lecture Notes in Computer Science(), vol 9582. Springer, Cham. https://doi.org/10.1007/978-3-319-32859-1_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-32859-1_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32858-4
Online ISBN: 978-3-319-32859-1
eBook Packages: Computer ScienceComputer Science (R0)