Skip to main content
Log in

High-throughput parallel DWT hardware architecture implemented on an FPGA-based platform

  • Original Research Paper
  • Published:
Journal of Real-Time Image Processing Aims and scope Submit manuscript

Abstract

To detect fast myoclonus jerks, doctors require a full-HD video capture of the patient at 100 frames per second. Real-time video compression becomes mandatory to archive/transmit the generated data. To achieve this goal, we used a certified medical imaging coder based on discrete wavelet transform (DWT). Thus, a major challenge was to design a 2D DWT architecture, achieving the throughput of 100 full-HD frames/s. The novel unified 2D DWT computation architecture performs both horizontal and vertical transform simultaneously and eliminates the problem of column-wise image pixel accesses to/from the off-chip DDR RAM. All of these factors have led to the reduction of the required off-chip DDR RAM bandwidth by more than 2X. The proposed concept uses four-port line buffers leading to pipelined parallel processing of direct memory access (DMA) read, horizontal 1D DWT, vertical 1D DWT, and DMA write. The proposed architecture has cycles per pixel of just 1/8, making it far exceeds 100 full-HD fps and well positioned for the 4K and 8K video processing. Finally, we highlighted that the developed architecture is highly scalable, outperforms state of the art and is deployed in a first video EEG medical prototype.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24

Similar content being viewed by others

References

  1. Lambert, L., Despatin, J., Dhif, I., Mhedhbi, I., Ibraheem, M.S., Syed-Zahid, A., Granado, B., Hachicha, K., Pinna, A., Garda, P., Kaddouh, F., Terosiet, M., Histace, A., Romain, O., Bellet, C., Durand, F., Commes, J.P., Hochberg, S., Heudes, D., Lozeron, P., Kubis, N.: Telemedecine, electroencephalography and current issues. SMART-EEG: an innovative solution. Eur. Res. Telemed. 4(3), 81–86 (2015)

    Article  Google Scholar 

  2. Taubman, D.S., Michael, W.M., Rabbani, M.: JPEG 2000: image compression fundamentals, standards and practice. J. Electron. Imaging 11(2), 286–287 (2002)

    Article  Google Scholar 

  3. Mhedhbi, I., Kaddouh, F., Hachicha, K., Heudes, D., Hochberg, S., Garda, P.: Mask motion adaptive medical image coding. In: IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), pp. 408–411 (2014)

  4. Ysitcom.: Waaves compression format (2017) http://en.ysitcom.com/

  5. Philippe, H.S.: Procede de decodage d’image et dispositif de mise en oeuvre de ce procede. WO Patent App. PCT/FR2004/001,942 (2005)

  6. Öktem, L., Astola, J.: Hierarchical enumerative coding of locally stationary binary data. Electron. Lett. 35(17), 1428–1429 (1999)

    Article  Google Scholar 

  7. Lambert, L., Hachicha, K., Ahmed, S.Z., Pinna, A., Garda, P.: Synchronizing physiological data and video in a telemedicine application: a multimedia approach. In: 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 181–185 (2015)

  8. Mallat, S.G.: A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans. Pattern Anal. Mach. Intell. 11(7), 674–693 (1989)

    Article  Google Scholar 

  9. Daubechies, I., Sweldens, W.: Factoring wavelet transforms into lifting steps. J. Fourier Anal. Appl. 4(3), 247–269 (1998)

    Article  MathSciNet  Google Scholar 

  10. Xiong, C., Tian, J., Liu, J.: Efficient architectures for two-dimensional discrete wavelet transform using lifting scheme. IEEE Trans. Image Process. 16(3), 607–614 (2007)

    Article  MathSciNet  Google Scholar 

  11. Xiong, C.-Y., Tian, J.-W., Liu, J.: Efficient high-speed/low-power line-based architecture for two-dimensional discrete wavelet transform using lifting scheme. IEEE Trans. Circuits Syst. Video Technol. 16(2), 309–316 (2006)

    Article  Google Scholar 

  12. Mohanty, B.K., Meher, P.K.: Memory-efficient high-speed convolution-based generic structure for multilevel 2-D DWT. IEEE Trans. Circuits Syst. Video Technol. 23(2), 353–363 (2013)

    Article  Google Scholar 

  13. Hu, Y., Jong, C.C.: A memory-efficient high-throughput architecture for lifting-based multi-level 2-D DWT. IEEE Trans. Signal Process. 61(20), 4975–4987 (2013)

    Article  MathSciNet  Google Scholar 

  14. Andra, K., Chakrabarti, C., Acharya, T.: A VLSI architecture for lifting-based forward and inverse wavelet transform. IEEE Trans. Signal Process. 50(4), 966–977 (2002)

    Article  Google Scholar 

  15. Wu, B.-F., Lin, C.-F.: A high-performance and memory-efficient pipeline architecture for the 5/3 and 9/7 discrete wavelet transform of JPEG2000 codec. IEEE Trans. Circuits Syst. Video Technol. 15(12), 1615–1628 (2005)

    Article  Google Scholar 

  16. Mohanty, B.K., Meher, P.K.: Memory efficient modular VLSI architecture for highthroughput and low-latency implementation of multilevel lifting 2-D DWT. IEEE Trans. Signal Process. 59(5), 2072–2084 (2011)

    Article  MathSciNet  Google Scholar 

  17. Aziz, S.M., Pham, D.M.: Efficient parallel architecture for multi-level forward discrete wavelet transform processors. Comput. Electr. Eng. 38(5), 1325–1335 (2012)

    Article  Google Scholar 

  18. Darji, A., Agrawal, S., Oza, A., Sinha, V., Verma, A., Merchant, S.N., Chandorkar, A.N.: Dual-scan parallel flipping architecture for a lifting-based 2-D discrete wavelet transform. IEEE Trans. Circuits Syst. II Express Briefs 61(6), 433–437 (2014)

    Article  Google Scholar 

  19. Sweldens, W.: The lifting scheme: a construction of second generation wavelets. SIAM J. Math. Anal. 29(2), 511–546 (1998)

    Article  MathSciNet  Google Scholar 

  20. Liao, H., Mandal, M.K., Cockburn, B.F.: Efficient architectures for 1-D and 2-D lifting-based wavelet transforms. IEEE Trans. Signal Process. 52(5), 1315–1326 (2004)

    Article  MathSciNet  Google Scholar 

  21. Darji, A.D., Kushwah, S.S., Merchant, S.N., Chandorkar, A.N.: High-performance hardware architectures for multi-level lifting-based discrete wavelet transform. EURASIP J. Image Video Process. 1, 1–19 (2014)

    Google Scholar 

  22. Sameen, I., Chang, Y.C., Ng, M.S., Goi, B.-M., Ooi, C.-P.: A unified FPGA-based system architecture for 2-D discrete wavelet transform. J. Signal Process. Syst. 71(2), 123–142 (2013)

    Article  Google Scholar 

  23. Hu, Y., Jong, C.C.: A memory-efficient scalable architecture for lifting-based discrete wavelet transform. IEEE Trans. Circuits Syst. II Express Briefs 60(8), 502–506 (2013)

    Article  Google Scholar 

  24. Cohen, A., Daubechies, I., Feauveau, J.-C.: Biorthogonal bases of compactly supported wavelets. Commun. Pure Appl. Math. 45(5), 485–560 (1992)

    Article  MathSciNet  Google Scholar 

  25. Mathworks.: Matlab (2017) https://www.mathworks.com/

  26. Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)

    Article  Google Scholar 

  27. Hsia, C.H., Chiang, J.S., Guo, J.M.: Memory-efficient hardware architecture of 2-D dual-mode lifting-based discrete wavelet transform. IEEE Trans. Circuits Syst. Video Technol. 23(4), 671–683 (2013)

    Article  Google Scholar 

  28. Bai, Y., Ahmed, S.Z., Granado, B.: FPGA implementation of hierarchical enumerative coding for locally stationary image source. In: 2013 23rd International Conference on Field programmable Logic and Applications, pp 1–6 (2013)

Download references

Acknowledgements

This paper is based upon the work in the Smart EEG project. This project is conducted jointly with the companies CIRA, Partelec, ACACIA, 2CSI, the hospitals Georges Pompidou and Lariboisiere, ETIS and LIP6 labs. The authors thank all the project members for their constructive debate.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Khalil Hachicha.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ibraheem, M.S., Hachicha, K., Ahmed, S.Z. et al. High-throughput parallel DWT hardware architecture implemented on an FPGA-based platform. J Real-Time Image Proc 16, 2043–2057 (2019). https://doi.org/10.1007/s11554-017-0711-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11554-017-0711-6

Keywords

Navigation