Skip to main content
Log in

Recurrent pattern matching based stereo image coding using linear predictors

  • Published:
Multidimensional Systems and Signal Processing Aims and scope Submit manuscript

Abstract

The Multidimensional Multiscale Parser (MMP) is a pattern-matching-based generic image encoding solution which has been investigated earlier for the compression of stereo images with successful results. While first MMP-based proposals for stereo image coding employed dictionary-based techniques for disparity compensation, posterior developments have demonstrated the advantage of using predictive methods. In this paper, we focus on recent investigations on the use of predictive methods in the MMP algorithm and propose a new prediction framework for efficient stereo image coding. This framework comprises an advanced intra directional prediction model and a new linear predictive scheme for efficient disparity compensation. The linear prediction model is the main novelty of this work, combining adaptive linear models estimated by least-squares algorithm with fixed linear models provided by the block-matching algorithm. The performance of the proposed intra prediction and disparity compensation methods when applied in an MMP encoder has been evaluated experimentally. Comparisons with the current stereo image coding standards showed that the proposed MMP algorithm significantly outperforms the Stereo High Profile of H.264/AVC standard. In addition, it presents a competitive performance relative to the MV-HEVC standard. These results also suggest that current stereo image coding standards may benefit from the proposed linear prediction scheme for disparity compensation, as an extension to the omnipresent block-matching solution.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Notes

  1. The authors would like to thank Poznan University of Technology, Nagoya University-Tanimoto Lab, HHI, GIST, NICT, Nokia and Microsoft for providing Poznan Street and Poznan Hall2, Kendo and Balloons, Book Arrival, Newspaper, Shark, GT Fly and Undo Dancer sequences, Ballet and Breakdancers, respectively.

References

  • Accame, M., De Natale, F., & Giusto, D. (1995). Hierarchical block matching for disparity estimation in stereo sequences. International Conference on Image Processing, 2, 374–377.

    Article  Google Scholar 

  • Bjøntegaard, G. (2001). Calculation of average psnr differences between RD-curves. ITU-T SG 16 Q6 VCEG, Doc VCEG-M33.

  • Carvalho, M., da Silva, E., & Finamore, W. (2002). Multidimensional signal compression using multiscale recurrent patterns. Elsevier Signal Processing, 82, 1559–1580.

    Article  MATH  Google Scholar 

  • Chen, Y., Wang, Y. K., Ugur, K., Hannuksela, M. M., Lainema, J., & Gabbouj, M. (2008). The emerging MVC standard for 3D video services. EURASIP Journal on Advances in Signal Processing, 2009, 1–13.

    Article  Google Scholar 

  • Dinstein, I., Guy, G., Rabany, J., Tzelgov, J., & Henik, A. (1988). On stereo image coding. 9th International Conference on Pattern Recognition, 1, 357–359.

    Google Scholar 

  • Duarte, M., Carvalho, M., da Silva, E., Pagliari, C., & Mendonca, G. (2005). Multiscale recurrent patterns applied to stereo image coding. IEEE Transactions on Circuits and Systems for Video Technology, 15(11), 1434–1447.

    Article  Google Scholar 

  • Ellinas, J. N., & Sangriotis, M. S. (2006). Stereo image coder based on the MRF model for disparity compensation. EURASIP Journal on Advances in Signal Processing, 2006(1), 1–13. doi:10.1155/ASP/2006/73950.

    Article  MATH  Google Scholar 

  • Frajka, A., & Zeger, K. (2002). Residual image coding for stereo image compression. In IEEE international conference on image processing (Vol. 2). http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=1039926.

  • Francisco, N. C., Rodrigues, N. M. M., da Silva, E. A. B., de Carvalho, M. B., de Faria, S. M. M., da Silva, V. M. M., & Reis, M. J. C. S. (2008). Multiscale recurrent pattern image coding with a flexible partition scheme. In 15th IEEE international conference on image processing (pp. 141–144). http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4711711.

  • Francisco, N., Rodrigues, N., da Silva, E., Carvalho, M., Faria, S., & Silva, V. (2010). Scanned compound document encoding using multiscale recurrent patterns. IEEE Transactions on Image Processing, 19(10), 2712–2724.

    Article  MathSciNet  Google Scholar 

  • Francisco, N., Rodrigues, N., da Silva, E., & Faria, S. (2012). A generic post-deblocking filter for block based image compression algorithms. Signal Processing: Image Communication, 27(9), 985–997.

    Google Scholar 

  • Graziosi, D. B., Rodrigues, N. M. M., da Silva, E. A. B., de Faria, S. M. M., & de Carvalho, M. B. (2009). Improving multiscale recurrent pattern image coding with least-squares prediction mode. In 16th IEEE international conference on image processing (ICIP) (pp. 2813–2816). http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5414219.

  • ITU-T, ISO/IEC JTC 1/SC 29 (MPEG). (2013). High efficiency video coding. Recommendation ITU-T H.265 and ISO/IEC 23008-2.

  • ITU-T, ISO/IEC JTC1 (2010) Advanced video coding for generic audiovisual services. ITU-T Recommendation H.264 and ISO/IEC 14496-10 (MPEG-4 AVC).

  • Kaup, A., & Fecker, U. (2006). Analysis of multi-reference block matching for multi-view video coding. In Proceedings of 7th workshop digital broadcasting (pp. 33–39).

  • Li, X., & Orchard, M. (2001). Edge-directed prediction for lossless compression of natural images. IEEE Transactions on Image Processing, 10(6), 813–817.

    Article  MATH  Google Scholar 

  • Lucas, L., Rodrigues, N., de Faria, S., da Silva, E., Carvalho, M., & da Silva, V. (2010). Intra-prediction for color image coding using YUV correlation. 17th IEEE international conference on image processing (pp. 1329–1332).

  • Lucas, L., Rodrigues, N., da Silva, E., & Faria, S. (2011a). Adaptive least squares prediction for stereo image coding. 18th IEEE international conference on image processing (pp. 2013–2016).

  • Lucas, L., Rodrigues, N., da Silva, E., & Faria, S. (2011b). Stereo image coding using dynamic template-matching prediction. IEEE EUROCON2011—International conference on computer as a tool (pp. 1–4).

  • Marpe, D., Schwarz, H., & Wiegand, T. (2003). Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard. IEEE Transactions on Circuits and Systems for Video Technology, 13(7), 620–636.

    Article  Google Scholar 

  • Merkle, P., Smolic, A., Muller, K., & Wiegand, T. (2007). Efficient prediction structures for multiview video coding. IEEE Transactions on Circuits and Systems for Video Technology, 17(11), 1461–1473.

    Article  Google Scholar 

  • Mueller, K., Schwarz, H., Marpe, D., Bartnik, C., Bosse, S., Brust, H., et al. (2013). 3D high efficiency video coding for multi-view video and depth data. IEEE Transactions on Image Processing, 22(9), 3366–3378.

    Article  MathSciNet  Google Scholar 

  • Muller, K., & Vetro, A. (2014). Common test conditions of 3DV core experiments. Joint Collaborative Team on 3D Video Coding Extension Development of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11, 7th Meeting, San Jos, USA.

  • Muller, K., Merkle, P., & Wiegand, T. (2011). 3-D video representation using depth maps. Proceedings of the IEEE, 99(4), 643–656.

    Article  Google Scholar 

  • Palaz, D., Tosic, I., & Frossard, P. (2011). Sparse stereo image coding with learned dictionaries. 2011 18th IEEE international conference on image processing (ICIP) (pp. 133–136).

  • Perkins, M. (1992). Data compression of stereopairs. IEEE Transactions on Communications, 40(4), 684–696.

    Article  Google Scholar 

  • Rodrigues, N., da Silva, E., Carvalho, M., Faria, S., & Silva, V. (2005). Universal image coding using multiscale recurrent patterns and prediction. IEEE international conference on image processing.

  • Rodrigues, N., da Silva, E., Carvalho, M., Faria, S., & Silva, V. (2008). On dictionary adaptation for recurrent pattern image coding. IEEE Transactions on Image Processing, 17(9), 1640–1653.

    Article  MathSciNet  Google Scholar 

  • Seo, S. H., Azimi-Sadjadi, M., & Tian, B. (2000). A least-squares-based 2-D filtering scheme for stereo image compression. IEEE Transactions on Image Processing, 9(11), 1967–1972.

    Article  Google Scholar 

  • Sethuraman, S., Siegel, M., & Jordan, A. G. (1995). Multiresolutional region-based segmentation scheme for stereoscopic image compression. Proceedings of SPIE, 2419, 265–274. doi:10.1117/12.206365.

    Article  Google Scholar 

  • Siegel, S. (1956). Non-parametric statistics for the behavioral sciences (pp. 75–83). New York: McGraw-Hill.

    Google Scholar 

  • Stankowski, J., Domanski, M., Stankiewicz, O., Konieczny, J., Siast, J., & Wegner, K. (2012). Extensions of the HEVC technology for efficient multiview video coding. 2012 19th IEEE international conference on image processing (ICIP) (pp. 225–228).

  • Tech, G., Wegner, K., Chen, Y., Hannuksela, M. M., & Boyce, J. (2013). MV-HEVC draft text 5. Joint Collaborative Team on 3D Video Coding Extensions (JCT-3V) Document JCT3V-E1004, 5th Meeting. Vienna, Austria

  • Vetro, A. (2010). Frame compatible formats for 3d video distribution. 17th IEEE international conference on image processing (pp. 2405–2408).

  • Woo, W., & Ortega, A. (2000). Overlapped block disparity compensation with adaptive windows for stereo image coding. IEEE Transactions on Circuits and Systems for Video Technology, 10(2), 194–200.

    Article  Google Scholar 

  • Woo, W., & Ortega, A. (1996). Stereo image compression with disparity compensation using the MRF model. Proc SPIE VCIP (pp. 28–41)

  • Woo, W., & Ortega, A. (1999). Optimal blockwise dependent quantization for stereo image coding. IEEE Transactions on Circuits and Systems for Video Technology, 9(6), 861–867.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luís F. R. Lucas.

Additional information

This project was funded by FCT—“Fundação para a Ciência e Tecnologia”, Portugal, under the Grant SFRH/BD/79553/2011. This work was partially financed by CAPES/Pro-Defesa under Grant Number 23038.009094/2013-83.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lucas, L.F.R., Rodrigues, N.M.M., Pagliari, C.L. et al. Recurrent pattern matching based stereo image coding using linear predictors. Multidim Syst Sign Process 28, 1393–1416 (2017). https://doi.org/10.1007/s11045-016-0417-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11045-016-0417-0

Keywords

Navigation