Abstract
Parallel algorithms, based on a distributed memory machine model, for an exhaustive search technique for motion vector estimation in video compression are being designed and evaluated. Results from the execution on a 16,384 processor MasPar MP-1 (an SIMD machine), a 140 node Intel Paragon XP/S and a 16 node IBM SP2 (two M IMD machines), and the 16 processor PASM prototype (a partitionable SIMD/MIMD mixed-mode machine) are presented. The trade-offs of using different modes of parallelism (SIMD, SPMD, and mixed-mode) and different data partitioning schemes (the rectangular and stripe subimage methods) are examined. The analytical and experimental results shown in this application study will help practitioners to predict and contrast the performance of different approaches to parallel implementation of this important video compression technique. The results presented are also applicable to a large class of image and video processing tasks. Case studies, such as the one presented here, are a necessary step in developing software tools for mapping an application task onto a single parallel machine and for mapping a set of independent application tasks, or the subtasks of a single application task, onto a heterogeneous suite of parallel machines.
Similar content being viewed by others
REFERENCES
P. H. Ang, P. A. Rutz, and D. Auld, Video Compression Makes Big Gains, IEEE Spectrum, 28(10):16–19 (October 1991).
D. LeGall, MPEG: A Video Compression Standard for Multimedia Applications, Commun. ACM, 34(4):47–58 ( April 1991).
J. R. Jain and A. K. Jain, Displacement Measurement and Its Application in Interframe Image Coding, IEEE Trans. Commun., COM-29(12):1799–1808 (December 1981).
A. N. Netravali and J. D. Robbins, Motion-Compensated Television Coding: Part I, Bell Syst. Techn. J., 58(3):631–669 (March 1979).
M. Ghanbari, The Cross-Search Algorithm for Motion Estimation, IEEE Trans. Commun., 38(7):950–953 ( July 1990).
B. Liu and A. Zaccarin, New Fast Algorithms for the Estimation of Block Motion Vectors, IEEE Trans. Circuits Syst. Video Technol., 3(2):148–157 (April 1993).
A. Puri, H. M. Hang, and D. L. Shilling, An Efficient Block-Matching Algorithm for Motion Compensated Coding, IEEE Int'l. Conf. Acoustics, Speech, and Signal Proc., pp. 25.4.1–25.4.4 (1987).
H. J. Siegel, J. B. Armstrong, and D. W. Watson, Mapping Computer-Vision-Related Tasks onto Reconfigurable Parallel Processing Systems, IEEE Computer, 25(2):54–63 (February 1992).
M. J. Flynn, Very High-Speed Computing Systems, Proc. IEEE, 54(12):1901–1909 (December 1966).
H. J. Siegel, Interconnection Networks for Large-Scale Parallel Processing: Theory and Case Studies, Second Edition, McGraw-Hill, New York (1990).
T. Blank, The MasPar MP-1 Architecture, IEEE Compcon, pp. 20–24 (February 1990).
T. Agerwala, J. L. Martin, J. H. Mirza, D. C. Sadler, D. M. Dias, and M. Snir, SP2 System Architecture, IBM Syst. J., 34(2):152–184 (1995).
G. S. Almasi and A. Gottlieb, Highly Parallel Computing, Second Edition, Benjamin/ Cummings, Redwood City, California (1994).
H. J. Siegel, T. D. Braun, M. B. Kulaczewksi, M. Maheswaran, P. H. Pero, J. M. Siegel, J. E. So, M. Tan, M. D. Theys, and L. Wang, The PASM project: A Study of Reconfigurable Parallel Computing, Second Int'l. Symp. Parallel Architectures, Algorithms, and Networks, pp. 529–536 (June 1996).
H. J. Siegel, T. Schwederski, W. G. Nation, J. B. Armstrong, L. Wang, J. T. Kuehn, R. Gupta, M. D. Allemang, D. G. Meyer, and D. W. Watson, The Design and Prototyping of the PASM Reconfigurable Parallel Processing System. In Parallel Computing: Paradigms and Applications, A. Y. Zomaya (ed.), International Thomson Computer Press, London, United Kingdom, pp. 78–114 (1996).
H. J. Siegel, M. Maheswaran, D. W. Watson, J. K. Antonio, and M. J. Atallah, Mixed-Mode System Heterogeneous Computing. In Heterogeneous Computing, M. M. Eshaghian (ed.), Artech House, Norwood, Massachusetts, pp. 19–65 (1996).
J. A. Armstrong, M. Maheswaran, M. D. Theys, H. J. Siegel, M. A. Nichols, and K. H. Casey, Parallel Image Correlation: Case Study to Examine Trade-Offs in Algorithm-to-Machine Mappings, J. Supercomputing 12(1/2):7–35 ( January 1998). [Special Issue: High-Performance Computing and Applications in Computer Graphics, Image Processing, and Computer Vision.]
N. Giolmas, D. W. Watson, D. M. Chelberg, P. V. Henstock, J. H. Yi, and H. J. Siegel, Aspects of Computational Mode and Data Distribution for Parallel Range Image Segmentation, Parallel Computing (to appear). [A preliminary version appears in the Sixth Int'l. Parallel Proc. Symp., pp. 334–342 (March 1992).]
J. J. E. So, T. J. Downar, R. Janardhan, and H. J. Siegel, Mapping Conjugate Gradient Algorithms for Neutron Diffusion Applications onto SIMD, MIMD, and Mixed-Mode Machines, IJPP, 26(2):183–207 (April 1998).
M. C. Wang, W. G. Nation, J. B. Armstrong, H. J. Siegel, S. D. Kim, M. A. Nichols, and M. Gherrity, Multiple Quadratic Forms: A Case Study in the Design of Data-Parallel Algorithms, J. Parallel Distribut. Comput. [Special Issue on Data-Parallel Algorithms and Programming, 21(1):124–139 (April 1994).]
L. H. Jamieson, Characterizing Parallel Algorithms. In The Characteristics of Parallel Algorithms, L. H. Jamieson, D. G. Gannon, and R. J. Douglass (eds.), The MIT Press, Cambridge, Massachusetts, pp. 65–100 (1987).
H. J. Siegel, H. G. Dietz, and J. K. Antonio, Software Support for Heterogeneous Computing. In The Computer Science and Engineering Handbook, A. B. Tucker, Jr. (ed.), CRC Press, Boca Raton, Florida, pp. 1886–1909 (1997).
R. Freund, D. Hensgen, T. Kidd, and L. Moore, SmartNet: A Scheduling Framework for Heterogeneous Computing, Second Int'l. Symp. Parallel Architectures, Algorithms, and Networks, pp. 514–521 (June 1996).
American National Standards Institute, Inc., ``Digital Processing of Video Signals––Video Coder/Decoder for Audiovisual Services at 56 to 1.536 kbit/s,” American National Standard TI.p64-199x (October 1990).
International Standards Organization/Motion Pictures Expert Group 90/176, Coding of Moving Pictures and Associated Audio, Committee Draft of Standard ISO11172 (December 1990).
Z. Xu and K. Hwang, Modeling Communication Overhead: MPI and MPL performance on the IBM SP2, IEEE Parallel and Distribut. Technol., pp. 9–23 (1996).
Z. Xu and K. Hwang, Early Prediction of MPP Performance: The SP2, T3D, and Paragon Experiences, Parallel Computing, pp. 917–942 (October 1996).
H. J. Siegel, L. Wang, J. E. So, and M. Maheswaran, Data Parallel Algorithms. In Parallel and Distributed Computing Handbook, A. Y. Zomaya (ed.), McGraw-Hill, New York, 1996, pp. 466–499 (1996).
S. M. Akramullah, I. Ahmad, and M. L. Liou, A Data-Parallel Approach for Real-Time MPEG-2 Video Encoding, J. Parallel Distribut. Comput., 30(2):129–146 (November 1995).
T. Akiyama, H. Aono, K. Aoki, K. W. Ler, B. Wilson, T. Araki, T. Morihige, H. Takeno, A. Sato, S. Nakatani, and T. Senoh, MPEG2 Video codec Using Image Compression DSP, IEEE Trans. Consumer Electronics, 40(3):466–472 (August 1994).
R. J. Gove, The MVP: A Highly-Integrated Video Compression Chip, IEEE Data Compression Conf., pp. 215–224 (March 1994).
H. H. Taylor, D. Chin, and A. W. Jessup, An MPEG Encoder Implementation on the Princeton Engine Video Supercomputer, IEEE Data Compression Conf., pp. 420–429 (March 1993).
P. Wayner, Digital Video Goes Real-Time, Byte, 19(1):107–112 (January 1994).
Z. Huang, Y. Takeuchi, and H. Kunieda, Distributed Load Balancing Schemes for Parallel Video Encoding System, IEICE Trans. Fundamentals of Electronics, Commun. Computer Sci., E77-A(5):923–930 (May 1994).
Y. Yu and D. Anastassiou, Software Implementation of MPEG-II Video Encoding Using Socket Programming in LAN, SPIE Conf. Digital Video Compression: Algorithms and Technol., 2187:229–240 (February 1994).
K. Shen, L. A. Rowe, and E. J. Delp, A Parallel Implementation of an MPEG1 Encoder: Faster than Real Time!, SPIE Conf. Digital Video Compression: Algorithms and Technol., pp. 407–418 (February 1995).
F. Darema, D. A. George, V. A. Norton, and G. F. Pfister, A Single-Program-Multiple-Data Computational Model for EPEX/FORTRAN, Parallel Computing, 7:11–24 (April 1988).
P. Christy, Software to Support Massively Parallel Computing on the MasPar MP-1, IEEE Compcon, pp. 29–33 (February 1990).
J. R. Nickolls, The Design of the MasPar MP-1: A Cost Effective Massively Parallel Computer, IEEE Compcon, pp. 25–28 (February 1990).
E. K. P. Chong and S. H. Zak, An Introduction to Optimization, John Wiley, New York (1996).
C. B. Stunkel, D. G. Shea, B. Abali, M. G. Atkins, C. A. Bender, D. G. Grice, P. H. Hochschild, D. J. Joseph, B. J. Nathanson, R. A. Swetz, R. F. Stucke, M. Tsao, and P. R. Varker, The SP2 High-Performance Switch, IBM Syst. J., 34(2):185–204 (1995).
M. A. Nichols, H. J. Siegel, and H. G. Dietz, Data Management and Control-Flow Aspects of an SIMD/SPMD Parallel Language/Compiler, IEEE Trans. Parallel Distribut. Syst., 4(2):222–234 (February 1993).
S. A. Fineberg, T. L. Casavant, and H. J. Siegel, Experimental Analysis of a Mixed-Mode Parallel Architecture Using Bitonic Sequence Sorting, J. Parallel Distribut. Comput., 11(3):239–251 (March 1991).
G. Saghi, H. J. Siegel, and J. L. Gray, Predicting Performance and Selecting Modes of Parallelism: A Case Study Using Cyclic Reduction on Three Parallel Machines, J. Parallel Distribut. Comput. [Special Issue on Performance of Supercomputers, 19(3):219–233 (November 1993). ]
H. J. Siegel, J. K. Antonio, R. C. Metzger, M. Tan, and Y. A. Li, Heterogeneous Computing. In Parallel and Distributed Computing Handbook, A. Y. Zomaya (ed.), McGraw-Hill, New York, pp. 725–761 (1996).
Rights and permissions
About this article
Cite this article
Tan, M., Siegel, J.M. & Siegel, H.J. Parallel Implementations of Block-Based Motion Vector Estimation for Video Compression on Four Parallel Processing Systems. International Journal of Parallel Programming 27, 195–225 (1999). https://doi.org/10.1023/A:1018785512609
Issue Date:
DOI: https://doi.org/10.1023/A:1018785512609