Universal Coding of Zipf Distributions

Freund, Yoav; Orlitsky, Alon; Santhanam, Prasad; Zhang, Junan

doi:10.1007/978-3-540-45167-9_57

Universal Coding of Zipf Distributions

Yoav Freund⁸,
Alon Orlitsky⁹,
Prasad Santhanam⁹ &
…
Junan Zhang⁹

Conference paper

5274 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2777))

Abstract

Background. One of the best known results in information theory says that a data sequence x ₁,x ₂,...,x _n produced by independent random draws from a fixed distribution P over a discrete domain can be compressed into a binary sequence, or code whose expected length is at most nH(P)+1 bits, where H(P) = − ∑ _i P _i logP _i is the entropy of P. It is also known that this compression is near optimal as nH(P) is the smallest achievable expected number of code bits.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Krichevsky, R.E., Trofimov, V.K.: The preformance of universal coding. IEEE Transactions on Information Theory 27, 199–207 (1981)
Article MATH MathSciNet Google Scholar
Zipf, G.: Selective studies and the principle of relative frequency in language. Technical report, Harvard university press (1932)
Google Scholar
Li, W.: North Shore LIJ Research Institure, http://linkage.rockefeller.edu/wli/zipf/
Jevtic, N., Orlitsky, A., Santhanam, N.P.: Universal compression of unknown alphabets. In: IEEE Symposium on Information Theory (2002)
Google Scholar
Aberg, J., Shtarkov, Y.M., Smeets, B.J.M.: Multialphabet coding with separate alphabet description. In: Proceedings of compression and complexity of sequences (1997)
Google Scholar
Orlitsky, A., Santhanam, N.P., Zhang, J.: Bounds on compression of unknown alphabets. In: IEEE Symposium on Information Theory (2003)
Google Scholar
Freund, Y.: Predicting a binary sequence almost as well as the optimal biased coin. Information and Computation 182, 73–94 (2003)
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Mitsubishi Electric Research Labs, Cambridge, MA
Yoav Freund
ECE Department, UC San Diego, La Jolla, CA, 92093, USA
Alon Orlitsky, Prasad Santhanam & Junan Zhang

Authors

Yoav Freund
View author publications
You can also search for this author in PubMed Google Scholar
Alon Orlitsky
View author publications
You can also search for this author in PubMed Google Scholar
Prasad Santhanam
View author publications
You can also search for this author in PubMed Google Scholar
Junan Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

MPI for Biological Cybernetics, Spemannstr. 38, 72076, Tübingen, Germany
Bernhard Schölkopf
University of California, Santa Cruz
Manfred K. Warmuth

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Freund, Y., Orlitsky, A., Santhanam, P., Zhang, J. (2003). Universal Coding of Zipf Distributions. In: Schölkopf, B., Warmuth, M.K. (eds) Learning Theory and Kernel Machines. Lecture Notes in Computer Science(), vol 2777. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45167-9_57

Download citation

DOI: https://doi.org/10.1007/978-3-540-45167-9_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40720-1
Online ISBN: 978-3-540-45167-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics