Skip to main content

Face-Tracking and Coding for Video Compression

  • Conference paper
  • First Online:
Computer Vision Systems (ICVS 1999)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1542))

Included in the following conference series:

Abstract

While computing power and transmission bandwidth have both been steadily increasing over the last few years, bandwidth rather than processing power remains the primary bottleneck for many complex multimedia applications involving communication. Current video coding algorithms use intelligent encoding to yield higher compression ratios at the cost of additional computing requirements for encoding and decoding. The use of techniques from the fields of computer vision and robotics such as object recognition, scene interpretation, and tracking can further improve compression ratios as well as provide additional information about the video sequence being transmitted. We used a new face tracking system developed in the robotics area to normalize a video sequence to centered images of the face. The face-tracking allowed us to implement a compression scheme based on Principal Component Analysis (PCA), which we call Orthonormal Basis Coding (OBC). We designed and implemented the face tracker and video codecs entirely in software. Our current implementation of OBC operates on recorded video sequences, making it appropriate for applications such as video email.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. T. S. Huang and R. Lopez, “Computer vision in next generation image and video coding,” Lecture Notes in Computer Science, vol. 0, no. 1035, pp. 13–22, 1996.

    Article  Google Scholar 

  2. “La visioconférence sur IP selon PictureTel et Intel,” 01 Reseaux, pp. 64–65, January 1998.

    Google Scholar 

  3. J. L. Crowley, “Integration and control of reactive visual processes,” Robotics and Autonomous Systems, vol. 16, no. 1, pp. 17–28, 1995.

    Article  MathSciNet  Google Scholar 

  4. J. L. Crowley, “Vision for man-machine interaction,” Robotics and Autonomous Systems, vol. 19, no. 3–4, pp. 347–358, 1997.

    Article  Google Scholar 

  5. J. L. Crowley, “Multi-modal tracking of faces for video communications,” IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 640–645, June 1997.

    Google Scholar 

  6. ITU-T Study Group XV, “Recommendation H.263: Video coding for low bit rate communication,” tech. rep., Telecommunication Standardization Sector of the International Telecommunication Union, Geneva, Switzerland, http://www.itu.ch, 1996.

    Google Scholar 

  7. “The GZIP home page,” http://www.gzip.org, May 1998.

  8. ITU-T Study Group XV, “Recommendation H.261: Video codec for audiovisual services at px64 kbit/s,” tech. rep., Telecommunication Standardization Sector of the International Telecommunication Union, Geneva, Switzerland, http://www.itu.ch, 1993.

    Google Scholar 

  9. ISO/IEC 11172-2, “Information technology-coding of moving pictures and associated audio for digital storage media at up to about 1,5 mbit/s-part 2: Video,” tech. rep., International Organization of Standardization, Geneva, Switzerland, http://www.iso.ch/cate/d22411.html, 1993.

    Google Scholar 

  10. “Berkeley MPEG research,” http://bmrc.berkeley.edu/projects/mpeg, 1997.

  11. M. J. Turk and A. Pentland, “Eigenfaces for recognition,” in IFIP Working Conference on Engineering for Human-Computer Interaction, vol. 3, pp. 71–86, 1991.

    Google Scholar 

  12. M. Kirby and L. Sirovich, “Application of the karhunen-loeve procedure for the characterization of human faces,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 12, pp. 103–108, 1990.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Vieux, W.E., Schwerdt, K., Crowley, J.L. (1999). Face-Tracking and Coding for Video Compression. In: Computer Vision Systems. ICVS 1999. Lecture Notes in Computer Science, vol 1542. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49256-9_10

Download citation

  • DOI: https://doi.org/10.1007/3-540-49256-9_10

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65459-9

  • Online ISBN: 978-3-540-49256-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics