Skip to main content
Log in

Parallel Implementations of Block-Based Motion Vector Estimation for Video Compression on Four Parallel Processing Systems

  • Published:
International Journal of Parallel Programming Aims and scope Submit manuscript

Abstract

Parallel algorithms, based on a distributed memory machine model, for an exhaustive search technique for motion vector estimation in video compression are being designed and evaluated. Results from the execution on a 16,384 processor MasPar MP-1 (an SIMD machine), a 140 node Intel Paragon XP/S and a 16 node IBM SP2 (two M IMD machines), and the 16 processor PASM prototype (a partitionable SIMD/MIMD mixed-mode machine) are presented. The trade-offs of using different modes of parallelism (SIMD, SPMD, and mixed-mode) and different data partitioning schemes (the rectangular and stripe subimage methods) are examined. The analytical and experimental results shown in this application study will help practitioners to predict and contrast the performance of different approaches to parallel implementation of this important video compression technique. The results presented are also applicable to a large class of image and video processing tasks. Case studies, such as the one presented here, are a necessary step in developing software tools for mapping an application task onto a single parallel machine and for mapping a set of independent application tasks, or the subtasks of a single application task, onto a heterogeneous suite of parallel machines.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

REFERENCES

  1. P. H. Ang, P. A. Rutz, and D. Auld, Video Compression Makes Big Gains, IEEE Spectrum, 28(10):16–19 (October 1991).

    Google Scholar 

  2. D. LeGall, MPEG: A Video Compression Standard for Multimedia Applications, Commun. ACM, 34(4):47–58 ( April 1991).

    Google Scholar 

  3. J. R. Jain and A. K. Jain, Displacement Measurement and Its Application in Interframe Image Coding, IEEE Trans. Commun., COM-29(12):1799–1808 (December 1981).

    Google Scholar 

  4. A. N. Netravali and J. D. Robbins, Motion-Compensated Television Coding: Part I, Bell Syst. Techn. J., 58(3):631–669 (March 1979).

    Google Scholar 

  5. M. Ghanbari, The Cross-Search Algorithm for Motion Estimation, IEEE Trans. Commun., 38(7):950–953 ( July 1990).

    Google Scholar 

  6. B. Liu and A. Zaccarin, New Fast Algorithms for the Estimation of Block Motion Vectors, IEEE Trans. Circuits Syst. Video Technol., 3(2):148–157 (April 1993).

    Google Scholar 

  7. A. Puri, H. M. Hang, and D. L. Shilling, An Efficient Block-Matching Algorithm for Motion Compensated Coding, IEEE Int'l. Conf. Acoustics, Speech, and Signal Proc., pp. 25.4.1–25.4.4 (1987).

  8. H. J. Siegel, J. B. Armstrong, and D. W. Watson, Mapping Computer-Vision-Related Tasks onto Reconfigurable Parallel Processing Systems, IEEE Computer, 25(2):54–63 (February 1992).

    Google Scholar 

  9. M. J. Flynn, Very High-Speed Computing Systems, Proc. IEEE, 54(12):1901–1909 (December 1966).

    Google Scholar 

  10. H. J. Siegel, Interconnection Networks for Large-Scale Parallel Processing: Theory and Case Studies, Second Edition, McGraw-Hill, New York (1990).

    Google Scholar 

  11. T. Blank, The MasPar MP-1 Architecture, IEEE Compcon, pp. 20–24 (February 1990).

  12. T. Agerwala, J. L. Martin, J. H. Mirza, D. C. Sadler, D. M. Dias, and M. Snir, SP2 System Architecture, IBM Syst. J., 34(2):152–184 (1995).

    Google Scholar 

  13. G. S. Almasi and A. Gottlieb, Highly Parallel Computing, Second Edition, Benjamin/ Cummings, Redwood City, California (1994).

    Google Scholar 

  14. H. J. Siegel, T. D. Braun, M. B. Kulaczewksi, M. Maheswaran, P. H. Pero, J. M. Siegel, J. E. So, M. Tan, M. D. Theys, and L. Wang, The PASM project: A Study of Reconfigurable Parallel Computing, Second Int'l. Symp. Parallel Architectures, Algorithms, and Networks, pp. 529–536 (June 1996).

  15. H. J. Siegel, T. Schwederski, W. G. Nation, J. B. Armstrong, L. Wang, J. T. Kuehn, R. Gupta, M. D. Allemang, D. G. Meyer, and D. W. Watson, The Design and Prototyping of the PASM Reconfigurable Parallel Processing System. In Parallel Computing: Paradigms and Applications, A. Y. Zomaya (ed.), International Thomson Computer Press, London, United Kingdom, pp. 78–114 (1996).

    Google Scholar 

  16. H. J. Siegel, M. Maheswaran, D. W. Watson, J. K. Antonio, and M. J. Atallah, Mixed-Mode System Heterogeneous Computing. In Heterogeneous Computing, M. M. Eshaghian (ed.), Artech House, Norwood, Massachusetts, pp. 19–65 (1996).

    Google Scholar 

  17. J. A. Armstrong, M. Maheswaran, M. D. Theys, H. J. Siegel, M. A. Nichols, and K. H. Casey, Parallel Image Correlation: Case Study to Examine Trade-Offs in Algorithm-to-Machine Mappings, J. Supercomputing 12(1/2):7–35 ( January 1998). [Special Issue: High-Performance Computing and Applications in Computer Graphics, Image Processing, and Computer Vision.]

    Google Scholar 

  18. N. Giolmas, D. W. Watson, D. M. Chelberg, P. V. Henstock, J. H. Yi, and H. J. Siegel, Aspects of Computational Mode and Data Distribution for Parallel Range Image Segmentation, Parallel Computing (to appear). [A preliminary version appears in the Sixth Int'l. Parallel Proc. Symp., pp. 334–342 (March 1992).]

  19. J. J. E. So, T. J. Downar, R. Janardhan, and H. J. Siegel, Mapping Conjugate Gradient Algorithms for Neutron Diffusion Applications onto SIMD, MIMD, and Mixed-Mode Machines, IJPP, 26(2):183–207 (April 1998).

    Google Scholar 

  20. M. C. Wang, W. G. Nation, J. B. Armstrong, H. J. Siegel, S. D. Kim, M. A. Nichols, and M. Gherrity, Multiple Quadratic Forms: A Case Study in the Design of Data-Parallel Algorithms, J. Parallel Distribut. Comput. [Special Issue on Data-Parallel Algorithms and Programming, 21(1):124–139 (April 1994).]

    Google Scholar 

  21. L. H. Jamieson, Characterizing Parallel Algorithms. In The Characteristics of Parallel Algorithms, L. H. Jamieson, D. G. Gannon, and R. J. Douglass (eds.), The MIT Press, Cambridge, Massachusetts, pp. 65–100 (1987).

    Google Scholar 

  22. H. J. Siegel, H. G. Dietz, and J. K. Antonio, Software Support for Heterogeneous Computing. In The Computer Science and Engineering Handbook, A. B. Tucker, Jr. (ed.), CRC Press, Boca Raton, Florida, pp. 1886–1909 (1997).

    Google Scholar 

  23. R. Freund, D. Hensgen, T. Kidd, and L. Moore, SmartNet: A Scheduling Framework for Heterogeneous Computing, Second Int'l. Symp. Parallel Architectures, Algorithms, and Networks, pp. 514–521 (June 1996).

  24. American National Standards Institute, Inc., ``Digital Processing of Video Signals––Video Coder/Decoder for Audiovisual Services at 56 to 1.536 kbit/s,” American National Standard TI.p64-199x (October 1990).

  25. International Standards Organization/Motion Pictures Expert Group 90/176, Coding of Moving Pictures and Associated Audio, Committee Draft of Standard ISO11172 (December 1990).

  26. Z. Xu and K. Hwang, Modeling Communication Overhead: MPI and MPL performance on the IBM SP2, IEEE Parallel and Distribut. Technol., pp. 9–23 (1996).

  27. Z. Xu and K. Hwang, Early Prediction of MPP Performance: The SP2, T3D, and Paragon Experiences, Parallel Computing, pp. 917–942 (October 1996).

  28. H. J. Siegel, L. Wang, J. E. So, and M. Maheswaran, Data Parallel Algorithms. In Parallel and Distributed Computing Handbook, A. Y. Zomaya (ed.), McGraw-Hill, New York, 1996, pp. 466–499 (1996).

    Google Scholar 

  29. S. M. Akramullah, I. Ahmad, and M. L. Liou, A Data-Parallel Approach for Real-Time MPEG-2 Video Encoding, J. Parallel Distribut. Comput., 30(2):129–146 (November 1995).

    Google Scholar 

  30. T. Akiyama, H. Aono, K. Aoki, K. W. Ler, B. Wilson, T. Araki, T. Morihige, H. Takeno, A. Sato, S. Nakatani, and T. Senoh, MPEG2 Video codec Using Image Compression DSP, IEEE Trans. Consumer Electronics, 40(3):466–472 (August 1994).

    Google Scholar 

  31. R. J. Gove, The MVP: A Highly-Integrated Video Compression Chip, IEEE Data Compression Conf., pp. 215–224 (March 1994).

  32. H. H. Taylor, D. Chin, and A. W. Jessup, An MPEG Encoder Implementation on the Princeton Engine Video Supercomputer, IEEE Data Compression Conf., pp. 420–429 (March 1993).

  33. P. Wayner, Digital Video Goes Real-Time, Byte, 19(1):107–112 (January 1994).

    Google Scholar 

  34. Z. Huang, Y. Takeuchi, and H. Kunieda, Distributed Load Balancing Schemes for Parallel Video Encoding System, IEICE Trans. Fundamentals of Electronics, Commun. Computer Sci., E77-A(5):923–930 (May 1994).

    Google Scholar 

  35. Y. Yu and D. Anastassiou, Software Implementation of MPEG-II Video Encoding Using Socket Programming in LAN, SPIE Conf. Digital Video Compression: Algorithms and Technol., 2187:229–240 (February 1994).

    Google Scholar 

  36. K. Shen, L. A. Rowe, and E. J. Delp, A Parallel Implementation of an MPEG1 Encoder: Faster than Real Time!, SPIE Conf. Digital Video Compression: Algorithms and Technol., pp. 407–418 (February 1995).

  37. F. Darema, D. A. George, V. A. Norton, and G. F. Pfister, A Single-Program-Multiple-Data Computational Model for EPEX/FORTRAN, Parallel Computing, 7:11–24 (April 1988).

    Google Scholar 

  38. P. Christy, Software to Support Massively Parallel Computing on the MasPar MP-1, IEEE Compcon, pp. 29–33 (February 1990).

  39. J. R. Nickolls, The Design of the MasPar MP-1: A Cost Effective Massively Parallel Computer, IEEE Compcon, pp. 25–28 (February 1990).

  40. E. K. P. Chong and S. H. Zak, An Introduction to Optimization, John Wiley, New York (1996).

    Google Scholar 

  41. C. B. Stunkel, D. G. Shea, B. Abali, M. G. Atkins, C. A. Bender, D. G. Grice, P. H. Hochschild, D. J. Joseph, B. J. Nathanson, R. A. Swetz, R. F. Stucke, M. Tsao, and P. R. Varker, The SP2 High-Performance Switch, IBM Syst. J., 34(2):185–204 (1995).

    Google Scholar 

  42. M. A. Nichols, H. J. Siegel, and H. G. Dietz, Data Management and Control-Flow Aspects of an SIMD/SPMD Parallel Language/Compiler, IEEE Trans. Parallel Distribut. Syst., 4(2):222–234 (February 1993).

    Google Scholar 

  43. S. A. Fineberg, T. L. Casavant, and H. J. Siegel, Experimental Analysis of a Mixed-Mode Parallel Architecture Using Bitonic Sequence Sorting, J. Parallel Distribut. Comput., 11(3):239–251 (March 1991).

    Google Scholar 

  44. G. Saghi, H. J. Siegel, and J. L. Gray, Predicting Performance and Selecting Modes of Parallelism: A Case Study Using Cyclic Reduction on Three Parallel Machines, J. Parallel Distribut. Comput. [Special Issue on Performance of Supercomputers, 19(3):219–233 (November 1993). ]

    Google Scholar 

  45. H. J. Siegel, J. K. Antonio, R. C. Metzger, M. Tan, and Y. A. Li, Heterogeneous Computing. In Parallel and Distributed Computing Handbook, A. Y. Zomaya (ed.), McGraw-Hill, New York, pp. 725–761 (1996).

    Google Scholar 

Download references

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tan, M., Siegel, J.M. & Siegel, H.J. Parallel Implementations of Block-Based Motion Vector Estimation for Video Compression on Four Parallel Processing Systems. International Journal of Parallel Programming 27, 195–225 (1999). https://doi.org/10.1023/A:1018785512609

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1018785512609

Navigation