skip to main content
10.1145/3090354.3090445acmotherconferencesArticle/Chapter ViewAbstractPublication PagesbdcaConference Proceedingsconference-collections
research-article

Novel parallel Givens QR decomposition implementation on VLIW architecture with Efficient memory access for real time image processing applications

Published: 29 March 2017 Publication History

Abstract

Compressed Sensing (CS) methods impact is important in the health care systems. The acquisition and the processing speed of many medical imaging applications is highly improved using this technique. Orthogonal Matching Pursuit (OMP) is one of the widely used image reconstruction algorithms in CS techniques. Traditionally, OMP is divided on two main tasks: optimization stage and least square problem (LSP) resolution stage. The LSP resolution is the most complex and time consuming step in OMP. QRD is one of the main techniques used to solve the LSP in a very short time. However, the increasing image and video resolutions (UHDTV) makes the real time constraints very tight. In this paper, a novel implementation of QRD on VLIW DSP architecture with an efficient memory access approach is introduced. An instruction, data and loop levels parallelism (ILP, DLP and LLP) based Givens QRD kernel is designed. A robust memory access management based on intelligent loading/storing strategies and proficient data alignment techniques is adopted for the proposed parallel implementation. Therefore, memory read misses and CPU stalls are significantly diminished. As a final result the processing time is greatly reduced. The proposed scheme is implemented on C6678 DSP and reaches 2.22 GFLOPS. The performances achieved are 7.4, 3.83 and 2 times faster than standard Givens, Texas Instrument optimized QRD and state of the art, respectively.

References

[1]
D. Donoho, "Compressed Sensing, IEEE Transactions on Information Theory", Vol. 52, Issue, pp. 1289--1306, 2006.
[2]
E. Candes, T. Tao, "Near optimal Signal recovery from random projections: universal encoding strategies, IEEE Transactions on Information Theory", Vol. 52, Issue 12, pp. 5406--5425, 2006.
[3]
F. Santosa and W. Symes, "Linear inversion of band-limited reflection seismograms", SIAM J. Scientific Statistical Comput., vol. 7, pp-1307, 1986
[4]
L. Rudin, S. Osher, and E. Fatemi, "Nonlinear total variation based noise removal algorithms", Physica D: Nonlinear Phenomena, vol. 60, no. 1--4, pp. 259--268, 1992.
[5]
Lustig, M., Donoho, D., Santos, J., Pauly, J.: Compressed sensing MRI. Sig. Process. Mag. IEEE 25(2), 72--82 (2008)
[6]
G. Wright, "Magnetic resonance imaging," IEEE Signal Processing Mag., vol. 14, no. 1, pp. 56--66, Jan. 1997.
[7]
Dong Liang and Leslie Ying, Compressed-Sensing Dynamic MR Imaging with Partially Known Support, Conf Proc IEEE Eng Med Biol Soc. 2010;2010:2829--32.
[8]
M. Usman, C. Prieto, F. Odille, D. Atkinson, T. Schaeffter and P. G. Batchelor, A computationally efficient OMP-based compressed sensing reconstruction for dynamic MRI, PHYSICS IN MEDICINE AND BIOLOGY, 56 (2011) N99--N114.
[9]
Hassan Rabah, Abbes Amira, Basant Kumar Mohanty, Somaya Almaadeed and Pramod Kumar Meher, FPGA Implementation of Orthogonal Matching Pursuit for Compressive Sensing Reconstruction, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, Volume: 23, Issue: 10, pp. 2209--2220, Oct. 2015.
[10]
Mohamed Najoui, Anas Hatim, Mounir Bahtat and Said Belkouch, Efficient Implementation of Givens QR Decomposition on VLIW DSP Architecture for Orthogonal Matching Pursuit Image Reconstruction, Proceedings of the Mediterranean Conference on Information & Communication Technologies 2015, Lecture Notes in Electrical Engineering 380, © Springer 2016. DOI 10.1007/978-3-319-30301-7_16
[11]
El-Amawy, A., Dharmarajan, K.R.: Parallel VLSI algorithm for stable inversion of dense matrices, IEEE Proc. 136(6) (1989)
[12]
Heath, M.T.: Numerical methods for large sparse linear least squares problems, ORNL/CSD-114, Distribution category UC-32 (1983)
[13]
Shewen Sun, Song Wei and Cong Wang, DPCC and QR Factorization-Based Color Medical Image Authentication Algorithm, IEEE International Conference on Image Analysis and Signal Processing, 2009. IASP 2009.
[14]
Yashar Naderahmadian and Saied Hosseini-Khayat, Fast and robust watermarking in still images based on QR decomposition, Multimed Tools Appl (2014) 72: 2597.
[15]
Nikolic, Z., Nguyen, H.T., Frantz, G.: Design and implementation of numerical linear algebra algorithms on fixed point DSPs, EURASIP J. Adv. Sig. Process. 2007, p. 22 (2007).
[16]
Dongjin Kim and Kyu-Ho Park; Tiled QR Decomposition and Its Optimization on CPU and GPU Computing System; 2013 42nd International Conference on Parallel Processing; pp. 744--753
[17]
Monica Lam, Software Pipelining: An Effective Scheduling Technique for VLIW Machines, Proc. SIGPLAN '88 Conference on Programming Language Design and Implementation, Atlanta, Georgia, June 22-24, 1988, pp. 318--328.
[18]
J. P. Grossman, Compiler and Architectural Techniques for Improving the Effectiveness of VLIW Compilation, International Conference on Computer Design. 2007.
[19]
TMDXEVM6678L EVM Technical Reference Manual Version 2.0, http://wfcache.advantech.com/support/TMDXEVM6678L_Technical_Reference_Manual_2V00.pdf; (Texas Instruments, USA,2011).
[20]
DSPLIB User's Manual, Release 3.4.0.0, Copyright 2014, Texas Instruments, August 27, 2014

Cited By

View all
  • (2022)Ultra-fast and efficient implementation schemes of complex matrix multiplication algorithm for VLIW architecturesComputers and Electrical Engineering10.1016/j.compeleceng.2022.108294102:COnline publication date: 1-Sep-2022
  • (2019)Optimized Implementation of Modified Gram Schmidt Algorithm on VLIW Architecture2019 4th World Conference on Complex Systems (WCCS)10.1109/ICoCS.2019.8930756(1-7)Online publication date: Apr-2019

Index Terms

  1. Novel parallel Givens QR decomposition implementation on VLIW architecture with Efficient memory access for real time image processing applications

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    BDCA'17: Proceedings of the 2nd international Conference on Big Data, Cloud and Applications
    March 2017
    685 pages
    ISBN:9781450348522
    DOI:10.1145/3090354
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • Ministère de I'enseignement supérieur: Ministère de I'enseignement supérieur

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 March 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Compressed Sensing
    2. Givens
    3. Medical imaging
    4. OMP
    5. QRD
    6. VLIW

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    BDCA'17

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 14 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Ultra-fast and efficient implementation schemes of complex matrix multiplication algorithm for VLIW architecturesComputers and Electrical Engineering10.1016/j.compeleceng.2022.108294102:COnline publication date: 1-Sep-2022
    • (2019)Optimized Implementation of Modified Gram Schmidt Algorithm on VLIW Architecture2019 4th World Conference on Complex Systems (WCCS)10.1109/ICoCS.2019.8930756(1-7)Online publication date: Apr-2019

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media