Skip to main content

Universal Types and Simulation of Individual Sequences

  • Conference paper
LATIN 2004: Theoretical Informatics (LATIN 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2976))

Included in the following conference series:

Abstract

We define the universal type class of an individual sequence x \(_{\rm 1}^{n}\), in analogy to the classical notion used in the method of types of information theory. Two sequences of the same length are said to be of the same universal (LZ) type if and only if they yield the same set of phrases in the incremental parsing of Ziv and Lempel (1978). We show that the empirical probability distributions of any finite order k of two sequences of the same universal type converge, in the variational sense, as the sequence length increases. Consequently, the logarithms of the probabilities assigned by any k-th order probability assignment to two sequences of the same universal type converge, for any k. We estimate the size of a universal type class, and show that its behavior parallels that of the conventional counterpart, with the LZ78 code length playing the role of the empirical entropy. We present efficient procedures for enumerating the sequences in a universal type class, and for drawing a sequence from the class with uniform probability. As an application, we consider the problem of universal simulation of individual sequences. A sequence drawn with uniform probability from the universal type class of x \(_{\rm 1}^{n}\) is a good simulation of x \(_{\rm 1}^{n}\) in a well defined mathematical sense.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Csiszár, I., Körner, J.: Information Theory: Coding Theorems for Discrete Memoryless Systems. Academic, New York (1981)

    MATH  Google Scholar 

  2. Cover, T.M., Thomas, J.A.: Elements of Information Theory. John Wiley & Sons, Inc., New York (1991)

    Book  MATH  Google Scholar 

  3. Csiszár, I.: The method of types. IEEE Trans. Inform. Theory IT-44, 2505–2523 (1998)

    Article  MATH  MathSciNet  Google Scholar 

  4. Merhav, N., Weinberger, M.J.: On universal simulation of information sources using training data. Technical Report HPL-2002-263, Hewlett-Packard Laboratories (2002); To appear in IEEE Transactions on Information Theory

    Google Scholar 

  5. Ziv, J., Lempel, A.: Compression of individual sequences via variable-rate coding. IEEE Trans. Inform. Theory IT-24, 530–536 (1978)

    Article  MATH  MathSciNet  Google Scholar 

  6. Rissanen, J.: A universal data compression system. IEEE Trans. Inform. Theory IT-29, 656–664 (1983)

    Article  MATH  MathSciNet  Google Scholar 

  7. Seroussi, G.: Universal types and simulation of individual sequences (in preparation, 2003)

    Google Scholar 

  8. Knuth, D.E.: The Art of Computer Programming. Seminumerical Algorithms, 3rd edn., vol. 2. Addison-Wesley, Reading (1998)

    Google Scholar 

  9. Freiling, C., Jungreis, D., Theberge, F., Zeger, K.: Almost all complete binary prefix codes have a self-synchronizing string. IEEE Trans. Inform. Theory IT-49, 2219–2225 (2003)

    Article  MathSciNet  Google Scholar 

  10. Cover, T.M.: Enumerative source encoding. IEEE Transactions on Information Theory IT-19, 73–77 (1973)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Seroussi, G. (2004). Universal Types and Simulation of Individual Sequences. In: Farach-Colton, M. (eds) LATIN 2004: Theoretical Informatics. LATIN 2004. Lecture Notes in Computer Science, vol 2976. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24698-5_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24698-5_35

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-21258-4

  • Online ISBN: 978-3-540-24698-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics