Dispersion of Words in a Language Corpus

Hlaváčová, Jaroslava; Rychlý, Pavel

doi:10.1007/3-540-48239-3_58

Dispersion of Words in a Language Corpus

Jaroslava Hlaváčová³ &
Pavel Rychlý⁴

Conference paper
First Online: 01 January 1999

534 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1692))

Abstract

This paper proposes new measures for dealing with word dispersion in a language corpus - reduced frequency and rarity. Their calculation is described and some results from the Czech National Corpus (CNC) presented. Some previous approaches are briefly mentioned.

This research was supported by the GACR, Grant Nr. 405/96/K214.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Králík, J.: On the dispersion and its computation. Prague Studies in Mathematical Linguistics, Prague, Academia 1978, pp. 149–158.
Google Scholar
Oakes, M.P.: Statistics for Corpus Linguistics. Edinburgh University Press, 1998.
Google Scholar
Rychlý, P.: The Improvement of Common Statistical Measure. Proc. TSD’ 98, Brno 1998, pp. 109–112.
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Informatics, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Jaroslava Hlaváčová
Institute of the Czech National Corpus, Faculty of Arts, Charles University, nám. J. Palacha 2, 116 38, Prague 1, Czech Republic
Pavel Rychlý

Authors

Jaroslava Hlaváčová
View author publications
You can also search for this author in PubMed Google Scholar
Pavel Rychlý
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineerig, Faculty of Applied Sciences, University of West Bohemia in Plzeň, Universitní 22, 306 14, Pizeň, Czech Republic
Václav Matousek , Pavel Mautner & Jana Ocelíková , &
Department of Programming Systems and Communication, Faculty of Informatics, Masaryk University Brno, Botanická 68a, 602 00, Brno, Czech Republic
Petr Sojka

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hlaváčová, J., Rychlý, P. (1999). Dispersion of Words in a Language Corpus. In: Matousek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds) Text, Speech and Dialogue. TSD 1999. Lecture Notes in Computer Science(), vol 1692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48239-3_58

Download citation

DOI: https://doi.org/10.1007/3-540-48239-3_58
Published: 01 October 1999
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66494-9
Online ISBN: 978-3-540-48239-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics