Skip to main content
Log in

Comparison of transform coding techniques for two-dimensional arbitrarily shaped images

  • Published:
Multimedia Systems Aims and scope Submit manuscript

Abstract

Envisioned advanced multimedia video services include arbitrarily shaped (AS) image segments as well as regular rectangular images. Image segments of the TV weather report produced by the chromo-key technique [1] and image segments produced by video analysis and image segmentation [2–4] are typical examples of AS image segments. This paper explores efficient intraframe transform coding techniques for general two-dimensional (2D) AS image segments, treating the traditional rectangular images as a special case. In particular, we focus on the transform coding of the partially defined image blocks along the boundary of the AS image segments. We recognize two different approaches — thebrute force transform coding approach and theshape-adaptive transform coding approach. The former fills the uncovered area with the optimal redundant data such that the resulting transform spectrum is compact. A simple but efficient mirror image extension technique is proposed. Once augmented into full image blocks, these boundary blocks can be processed by traditional block-based transform techniques like the popular discrete cosine transform (DCT). In the second approach, we change either the transform basis or the coefficient calculation process adaptively based on the shape of the AS image segment. We propose an efficientshape-projected problem formulation to reduce the dimension of the problem. Existing coding algorithms, such as the orthogonal transform by Gilge [5] and the iterative coding by Kaup and Aach [6], can be interpreted intuitively. We also propose a new adaptive transform based on the same principle as that used in deriving the DCT from the optimal Karhunen-Loeve transform (KLT). We analyze the tradeoff relationship between compression performance, computational complexity, and codec complexity for different coding schemes. Simulation results show that complicated algorithms (e.g., iterative, adaptive) can improve the quality by 5–10 dB at some computational or hardware cost. Alternatively, the simple mirror image extension technique improves the quality by 3–4 dB without any overheads. The contributions of this paper lie in efficient problem formulations, new transform coding techniques, and numerical tradeoff analyses.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Sandbank CP (1990) Digital Television, John Wiley, West Sussex

    Google Scholar 

  2. Musmann HG, Hotter M, Ostermann J (1989) Object-oriented analysis-synthesis coding of images. Signal Processing: Image Commun 1:117–138

    Google Scholar 

  3. Hotter M (1990) Object-oriented analysis-synthesis coding based on moving two-dimensional objects. Signal Processing: Image Commun 2:409–428

    Google Scholar 

  4. Jain A (1989) Fundamentals of Digital Image Processing. Prentice-Hall, Englewood Cliffs, NJ

    Google Scholar 

  5. Gilge M, Engelhardt T, Mehlan R (1989) Coding of arbitrarily shaped image segments based on a generalized orthogonal transform. Signal Processing: Image Commun 1:153–180

    Google Scholar 

  6. Kaup A, Aach T (1992) A new approach towards description of arbitrarily shaped image segments. International Workshop on Intelligent Signal Processing and Communication Systems, Taipei, Taiwan, pp 543–553

  7. Chang S-F, Messerschmitt DG (1993) A new approach to decoding and compositing motion-compensated DCT-based images. IEEE International Conference on Acoustics, Speech, and Signal Processing, pp 421–424

  8. Porter T, Duff T (1984) Compositing digital images. Comput Graph 18:253–259

    Google Scholar 

  9. Clarke RJ (1985) Transform coding of images. Academic Press, London

    Google Scholar 

  10. CCITT Recommendation H.261 (1990) Video Codec for Audio-visual Services at px64 kbits/s

  11. Standard Draft (1991) JPEG-9-R7

  12. Standard Draft (1990) MPEG Video Committee Draft, MPEG 90/176 Rev. 2

  13. Takahashi M, Chang S-F, Messerschmitt DG (1993) Joint shape representation and anti-aliasing for arbitrarily-shaped image objects. IEEE International Workshop on Intelligent Signal Processing and Communication Systems, Sendai, Japan

  14. Soltanian-Zadeh H, Yagle AE (1993) Fast algorithms for extrapolation of discrete band-limited signals. IEEE International Conference on Acoustics, Speech, and Signal Processing, pp 591–594

  15. Foley JD, Dam A, Feiner S, Hughes J (1990) Computer graphics: principles and practice, 2nd ed, Addison-Wesley, Reading, Mass.

    Google Scholar 

  16. Karlsson G, Vetterli M (1989) Extension of finite length signals for sub-band coding. Signal Processing 17:161–168

    Google Scholar 

  17. Golub GH, Van Loan CF (1989) Matrix Computations, 2nd edition, John Hopkins University Press, Baltimore

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shih-Fu Chang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chang, SF., Messerschmitt, D.G. Comparison of transform coding techniques for two-dimensional arbitrarily shaped images. Multimedia Systems 1, 231–239 (1994). https://doi.org/10.1007/BF01223345

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01223345

Key words

Navigation