Abstract
A framework for mapping systematically 2-dimensional (2-D) separable transforms into a parallel architecture consisting of fully pipelined linear array stages is presented. The resulting model architecture is characterized by its generality, high degree of modularity, high throughput, and the exclusive use of distributed memory and control. There is no central shared memory block to facilitate the transposition of intermediate results, as it is commonly the case in row-column image processing architectures. Avoiding shared central memory has positive implications for speed, area, power dissipation and scalability of the architecture. The architecture presented here may be used to realize any separable 2-D transform by only changing the coefficients stored in the processing elements. Pipelined linear arrays for computing the 2-D Discrete Fourier Transform and 2-D separable convolution are presented as examples and their performance is evaluated.
Similar content being viewed by others
References
A. Akansu and R. Haddad, Multiresolution Signal Decomposi-tion, Academic Press, 1992.
K.R. Rao and P. Yip, Discrete Cosine Transform: Algorithms, Advantages, and Applications, Academic Press, 1990.
J.I. Guo, C.M. Liu, and C.W. Jen, "An Novel CORDIC-Based Array Architecture for the Multidimensional Discrete Hartley Transform," IEEE Transactions on Signal Processing, vol. 43, no. 1, 1995, pp. 331–336.
Y.-P. Lee, T.-H. Chen, L.-G. Chen, M.-J. Chen, and C.-W. Ku, "A Cost-Effective Architecture for 8 ×8 Two-Dimensional DCT/IDCT Using Direct Method," IEEE Transactions on Circuits and Systems for Video Technology, vol. 7, no. 3, 1997, pp. 459–467.
V. Bhaskaran and K. Konstantinides, Image and Video Com-pression Standars, Kluwer Academic Publishers, 1995, Boston, MA, USA.
I. Gertner and M. Shamash, "VLSI Architectures for Multidi-mensional Fourier Transform Processing," IEEE Transactions on Computers, vol. C-36, 1987, pp. 1265–1274.
C. Chakrabarti and J. J´aj´a, "VLSI Architectures for Multidimen-sional Transforms," IEEE Transactions on Computers, vol. 40, 1991, pp. 1053–1057.
W. Namgoong, N. Chaddha, and T.H.Y. Meng, "Low-Power Video Encoder/Decoder Using Wavelet/TSVQ With Condi-tional Replenishment," in Proc. Int'l. Conf. on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, 1996, pp. VI-3241, Atlanta, GA, USA.
H. Lim and E.E. Swartzlander, "Multidimensional Systolic Ar-rays for Multidimensional DFTs," in Proc. Int'l. Conf. on Acous-tics, Speech and Signal Processing, IEEE Signal Processing Society, 1996, pp. VI-3277, Atlanta, GA, USA.
C.-L. Wang and Y.-T. Chang, "Highly Parallel VLSI Architec-tures for the 2-D DCTand IDCT Computations," in IEEE Region 10's Ninth Annual Int'l. Conference, 1994, pp. 295–299.
A.L. Fisher and H.T. Kung, "Synchronizing Large Systolic Arrays," in Proc. 10th. Annual Int'l. Symposium on Computer Ar-chitecture, 1983, pp. 54–58.
P. Lee and Z. Kedem, "Synthesizing Linear Array Algorithms from Nested for Loop Algorithms," IEEE Transactions on Computers, vol. 37, no. 12, 1988, pp. 1578–1598.
E.E. Swartzlander Jr., VLSI Signal Processing Systems, Kluwer-Academic, 1986.
C.D. Thompson, "Fourier Transforms in VLSI," IEEE Transactions on Computers, vol. C-32, 1983, pp. 1047–1057.
J.A. Beraldin, T. Aboulnasr, and W. Steenaart, "Efficient One-Dimensional Systolic Array Realization of the Discrete Fourier Transform," IEEE Transactions on Circuit and Systems, vol. 36, no. 1, 1989, pp. 95–100.
D.E. Dudgeon and R.M. Mersereau, Multidimensional Digital Signal Processing, Prentice-Hall, 1984.
J.F. Abramatic, F. Germain, and E. Rosencher, "Design of 2-D Recursive Filters with Separable Denominator Transfer Functions," in Proc. IEEE Int't. Conf. Accoustics, Speech and Signal Processing, April 1979, pp. 24–27.
S. Treitel and J.L. Shanks, "The Design of Multistage Separable Planar Filters," IEEE Transactions on Geoscience Electronics, vol. GE-9, no. 1, 1971, pp. 10–27.
J. Fridman, "Parallel Algorithms and Architectures for Discrete Wavelet Transforms," Ph.D. Thesis, Northeastern University, 1996.
A. Jain, Fundamentals of Digital Image Processing, Prentice-Hall, 1989.
G. Golub and C. Van Loan, Matrix Computations, Johns Hopkins University Press, 1989, Baltimore, MD, USA.
R. Tolimieri, M. An, and C. Lu, Algorithms for Discrete Fourier Transform and Convolution, New York: Springer-Verlag, 1989.
S.Y. Kung, VLSI Array Processors, Prentice Hall, 1989.
W. Shang and J.A.B. Fortes, "On Time Mapping of Uniform Dependence Algorithms into Lower Dimensional Processor Arrays," IEEE Trans. on Parallel and Distributed Systems 203 vol. 3, no. 3, 1992, pp. 350–363.
J. Bu, E.F. Deprettere, and L. Thiele, "Systolic Array Imple-mentation of Nested Loop Programs," in Proc. of IEEE Int'l Conference on Application Specific Array Processors, 1990, pp. 31–42.
S. Rajopadhye, "Synthesizing Systolic Arrays with Control Sig-nals from Recurrence Equations," Distributed Computing, vol. 3, 1988, pp. 88–105.
J. Ullman, Computational Aspects of VLSI, Computer Science Press, 1984, Rockville, MD, USA.
J. Fridman and E. Manolakos, "On the Synthesis of Regular VLSI Architectures for the 1-D Discrete Wavelet Transform," in Wavelet Applications in Signal and Image Processing II Proceedings, A. Lane and M. Unser (Eds.), SPIE, July 1994, pp. 91–104, San Diego, CA, USA.
J. Fridman and E.S. Manolakos, "Distributed Memory and Con-trol VLSI Architectures for the 1-D Discrete Wavelet Transform," in VLSI Signal Processing VII, J. Rabaey, P. Chau, and J. Eldon (Eds.), IEEE Signal Processing Society, October 1994, pp. 388–397, New York, NY, USA.
J. Fridman and E.S. Manolakos, "1-D Discrete Wavelet Trans-form: Data Dependence Analysis and Synthesis of Distributed Memory and Control Array Architectures," IEEE Transactions on Signal Processing, vol. 45, no. 5, 1997, pp. 1291–1308.
A. Stone and E.S. Manolakos, "DG2VHDL: A Tool to Facilitate the High Level Synthesis of Parallel Processing Array Archi-tectures," Journal of VLSI Signal Processing Systems, vol. 24, no. 1, 2000, pp. 99–120.
"DG2VHDL: A Tool to Facilitate the Synthesis of Parallel VLSI Architectures," http://www.cdsp.neu.edu/info/faculty/ manolakos/dg2vhdl root.html.
D.W. Knapp, Behavioral Synthesis. Digital System Design Using the Synopsys Behavioral Compiler, Prentice Hall, 1996, Upper Saddle River, NJ, USA.
A. Stone and E.S. Manolakos, "Using DG2VHDL to Synthe-size an FPGA Implementation of the 1-D Discrete Wavelet Transform," in Proc. of IEEE Signal Processing Systems (SiPS), October 1998, pp. 489–498.
J. Bonk, A. Stone, and E.S. Manolakos, "Synthesis of Array Architectures of Block Matching Motion Estimation: Design Exploration Using the Tool DG2VHDL," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE Press, 1999, pp. 1925–1928, Phoenix, AZ USA.
R. Anamalai, "Design and Synthesis of Maximum Throughout Parallel Array Architectures for Real-Time Image Transforms, MS Thesis, ECE Dept. Northeastern University, 1998.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Fridman, J., Manolakos, E.S. Distributed Memory Parallel Architecture Based on Modular Linear Arrays for 2-D Separable Transforms Computation. The Journal of VLSI Signal Processing-Systems for Signal, Image, and Video Technology 28, 187–203 (2001). https://doi.org/10.1023/A:1011113507906
Published:
Issue Date:
DOI: https://doi.org/10.1023/A:1011113507906