Elsevier

Digital Signal Processing

Volume 29, June 2014, Pages 45-53
Digital Signal Processing

Binocular energy response based quality assessment of stereoscopic images

https://doi.org/10.1016/j.dsp.2014.03.003Get rights and content

Abstract

Perceptual quality assessment in three-dimensional (3D) is challenging. In this research, we propose a binocular energy response based quality assessment metric of stereoscopic images. To be more specific, we first construct binocular energy responses of the original and distorted images, and measure the similarity between them as Image Quality Metric (IQM). Then, the binocular response and the binocular masking components are used to modulate the IQM, respectively. Finally, two evaluation results are nonlinearly integrated into an overall score by considering the importance of each component. Experimental results show that compared with the relevant existing metrics, the proposed metric can achieve higher consistency with the subjective assessment of stereoscopic image.

Introduction

Three-dimensional (3D) imaging technologies have been researched widely recently [1], the application of which ranges from content creation, video coding, network transmission and stereoscopic display. Therefore, designing for objective perceptual 3D image quality assessment (3D-IQA) approach is increasingly important [2], since such perceptual issue is hardly considered in the traditional 2D image quality assessment (2D-IQA). Following the research of 2D-IQA, 3D-IQA approaches fall into two categories: subjective assessment and objective assessment. Specifically, the development of objective quality assessment models has been a fruitful area of work.

In the aspect of objective/subjective assessment, the term ‘quality of experience (QoE)’ should be considered to capture the various factors that contribute to the overall visual experience of the 3D visual signal [3]. In contrast to the 2D case, QoE of 3D involves not only evaluating 2D image quality, but also additional aspects of quality, e.g., depth perception, visual comfort, and other visual experience. Many 2D-IQA metrics were proposed during the last decade, such as Structural SIMilarity (SSIM) [4], visual signal-to-noise ratio (VSNR) [5], visual information fidelity (VIF) [6], etc. However, the direct use of 2D-IQA in measuring 3D-IQA may not be straightforward, since the above 3D perceptual attributes needed to be considered. Lambooij et al. constructed a 3D quality model as a weighted sum of 2D image quality and perceived depth, and the model was validated by subjective experiments [7]. Chen et al. explored 3D QoE by constructing the visual experience as a weight sum of image quality, depth quantity and visual comfort, and subjective experiments were conducted to test the model [8]. However, these methods remained on a subjective level to explore the combination of various perceptual scales.

Currently, some publicly available 3D databases were provided, such as LIVE 3D IQA database [9], EPFL 3D image database [10], IRCCyN/IVC 3D image database [11], etc., by adding different types of stimuli (e.g., image distortion or camera distance) on both left and right images. Some objective 3D-IQA metrics were proposed by verifying on the databases. Research on objective 3D-IQA can be divided into two categories based on the involved information for evaluation. The most direct use of state-of-the-art 2D-IQA approaches in 3D-IQA is to evaluate the two views of the stereoscopic images, disparity/depth images separately by 2D metrics, and then combined into an overall score. Benoit et al. presented a linear combination solution for disparity distortion and 2D image quality on both views [11]. You et al. integrated the disparity information into quality assessment, and investigated the capabilities of some combination schemes [12]. Ha et al. designed a quality assessment method by considering the factors of temporal variation and disparity distribution [13]. Hewage et al. performed the evaluation for 3D video by using the extracted information from the depth maps and color images [14]. Obviously, it is not effective to assess the quality of perceived depth using image quality assessment methods (e.g., SSIM), because stimuli toward perceived depth are different with those for 2D image quality.

From another point of view, visual perceptual properties (e.g., monocular and binocular properties) were other important cues in 3D-IQA. Maalouf et al. computed the ‘Cyclopean’ image from left and right images to simulate the brain perception, and used contrast sensitivity coefficients of cyclopean image as the basis of evaluation [15]. Lin et al. utilized binocular integration (i.e., binocular combination and the binocular frequency integration) behaviors as the bases for measuring the quality of stereoscopic 3D images [16]. Ryu et al. formulated a model for stereoscopic images based on binocular perception model considering asymmetric properties of stereoscopic images [17]. Ko et al. proposed a structural distortion parameter based binocular perception model for 3D image quality assessment [18]. Wang et al. proposed a metric by considering the binocular spatial sensitivity to reflect the binocular fusion and suppression properties [19]. Bensalma et al. proposed a Binocular Energy Quality Metric (BEQM) by modeling the simple cells responsible for the local spatial frequency analysis and the complex cells responsible for the generation of the binocular energy [20]. However, these methods are simple extensions of the monocular visual properties into the binocular vision, and how these monocular visual properties affect the binocular vision is still not accounted.

From the observation of the existing 3D-IQA metrics, both image quality and depth perception are expected to measure the QoE of 3D. However, the combination of these two parts is somewhat ill-defined in the existing metrics, since disparity map is estimated from the stereoscopic image, while the perception of depth from disparity is generally not well understood. In order to tackle the problem, we propose a binocular energy response based stereoscopic image quality assessment metric in this paper. The main contributions of this work are as follows: (1) We construct binocular energy responses of the original and distorted images based on Gabor filters and disparity information, and measure the similarity between them as Image Quality Metric (IQM); (2) By considering the binocular response and binocular masking characteristics, we use the binocular energy and the binocular just noticeable difference (BJND) components to modulate the IQM, respectively; (3) Two evaluation results are nonlinearly integrated into an overall score by considering the importance of each component. The rest of the paper is organized as follows. Section 2 discusses the background and motivation of the proposed metric. Section 3 presents the proposed quality assessment metric. The experimental results are analyzed in Section 4, and finally conclusions are drawn in Section 5.

Section snippets

Background and motivation

In order to explain the ideas of the proposed 3D-IQA metric, we first review some relevant works and point out the problems, and then present our innovation in the design of 3D-IQA.

Proposed quality assessment metric

The framework of the proposed quality assessment metric is illustrated in Fig. 3. Given the original and distorted stereoscopic images, the corresponding binocular energy responses are generated based on Gabor filter responses and the estimated disparity maps. Then, the similarity between the binocular energy responses of the original and distorted stereoscopic images is measured as Image Quality Metric (IQM). Finally, the binocular energy and binocular masking (e.g., BJND) features are used to

Stereoscopic image quality databases

In the experiment, we have used two databases, NBU 3D IQA database [31], [32] and LIVE 3D IQA database [9], to verify the performance of the proposed metric. The NBU 3D IQA database has been released and can be downloaded at http://cise.nbu.edu.cn/MPC-lab/resourse.htm. The NBU 3D IQA database consists of 312 distorted stereoscopic pairs generated from 12 reference images. Five types of distortions are symmetrically applied to the reference images at various levels: Gaussian Blur (60), White

Conclusions

This paper has presented a quality assessment method of stereoscopic image based on binocular energy response. Compared with the existing two-dimensional (2D) metrics, the technical contribution of the proposed method is that we try to use binocular energy and binocular masking features to quantify the binocular visual characteristics. The prominent advantage of the proposed method is as follows: 1) We construct binocular energy responses of the original and distorted images, respectively, and

Acknowledgements

This work was supported by the National Natural Science Foundation of China (grant 61271021, 61271270, U1301257), the Natural Science Foundation of Zhejiang Province (grant Y1111061). It was also sponsored by K.C. Wong Magna Fund in Ningbo University.

Feng Shao received his B.S. and Ph.D. degrees from Zhejiang University, Hangzhou, China, in 2002 and 2007, respectively, all in Electronic Science and Technology. He is currently an Associate Professor in Faculty of Information Science and Engineering, Ningbo University, China. He was a visiting Fellow with the School of Computer Engineering, Nanyang Technological University, Singapore, from February 2012 to August 2012. His research interests include 3D video coding, 3D quality assessment, and

References (36)

  • W. Chen et al.

    Quality of experience model for 3DTV

  • L. Goldmann et al.

    A comprehensive database and subjective evaluation methodology for quality of experience in stereoscopic video

  • A. Benoit et al.

    Quality assessment of stereoscopic images

    EURASIP J. Image Video Process.

    (2008)
  • J. You et al.

    Perceptual quality assessment for stereoscopic images based on 2D image quality metrics and disparity analysis

  • K. Ha et al.

    A perceptual quality assessment metric using temporal complexity and disparity information for stereoscopic video

  • C.T.E.R. Hewage et al.

    Quality evaluation of color plus depth map-based stereoscopic video

    IEEE J. Sel. Top. Signal Process.

    (April 2009)
  • A. Maalouf et al.

    CYCLOP: a stereo color image quality assessment metric

  • Y. Lin et al.

    Quality assessment of stereoscopic 3D image compression by binocular integration behaviors

    IEEE Trans. Image Process.

    (April 2014)
  • Cited by (20)

    • Binocular spatial activity and reverse saliency driven no-reference stereopair quality assessment

      2017, Signal Processing: Image Communication
      Citation Excerpt :

      While these early advances on the NR S3D IQA problem are promising, further improvements are possible by better accounting for human stereo perception mechanisms. Algorithms for conducting S3D IQA have been developed by modeling a variety of binocular vision characteristics, e.g., binocular energy responses [17], gain-control mechanisms [18], models of visual information extraction [10] and of Gabor filter responses [19]. However, there is still room for improvement beyond available S3D IQA models with regards to correlation with human subjective judgments.

    • Stereoscopic image quality assessment method based on binocular combination saliency model

      2016, Signal Processing
      Citation Excerpt :

      Maalouf et al. [33] integrated the left and right images into a cyclopean image to simulate human brain perception to derive a quality index. Shao et al. [34], based on binocular energy responses, proposed another effective quality assessment method of stereoscopic images. What׳s more, Chen et al. [35] addressed binocular rivalry issues by modeling the binocular suppression behavior and developed an effective model to measure the quality of stereoscopic images.

    • Cooperative content and radio resource allocation for visual information maximization in a digital signage scenario

      2015, Digital Signal Processing: A Review Journal
      Citation Excerpt :

      The volume of depth information is mainly determined by the degree of pixel disparity. However, when the pixel disparity is excessively large, the stereoscopic 3D pair cannot be fused as clear single vision (a cyclopean image); rather, it is perceived as double vision (diplopia) [7,8]. The depth range within which the binocular vision system is able to fuse the stereoscopic 3D pair to form a cyclopean image is affected by angular disparity, with the horopter (fixation) considered as the center.

    • Stereoscopic Image Quality Assessment Method Using Projection-Based Weight Normalization

      2020, Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology
    • Non-Reference Stereoscopic Image Quality Evaluation Based on Fusion Image

      2019, Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology
    View all citing articles on Scopus

    Feng Shao received his B.S. and Ph.D. degrees from Zhejiang University, Hangzhou, China, in 2002 and 2007, respectively, all in Electronic Science and Technology. He is currently an Associate Professor in Faculty of Information Science and Engineering, Ningbo University, China. He was a visiting Fellow with the School of Computer Engineering, Nanyang Technological University, Singapore, from February 2012 to August 2012. His research interests include 3D video coding, 3D quality assessment, and image perception, etc.

    Gangyi Jiang received his M.S. degree from Hangzhou University in 1992, and received his Ph.D. degree from Ajou University, Korea, in 2000. He is now a Professor in Faculty of Information Science and Engineering, Ningbo University, China. His research interests mainly include digital video compression, multi-view video coding, etc.

    Mei Yu received her M.S. degree from Hangzhou Institute of Electronics Engineering, China, in 1993, and Ph.D. degree from Ajou University, Korea, in 2000. She is now a Professor in Faculty of Information Science and Engineering, Ningbo University, China. Her research interests include image/video coding and video perception.

    Fucui Li received her M.S. degree from HeFei University of Technology, China, in 2004, and is currently pursuing the Ph.D. degree in Ningbo University, China. Her research interests include image/video coding and video perception.

    Zongju Peng received his B.S. degree from Sichuan Normal College, China, in 1995, and M.S. degree from Sichuan University, China, in 1998, and received his Ph.D. degree from Institute of Computing Technology, Chinese Academy of Science, in 2010. He is now an Associate Professor in Faculty of Information Science and Engineering, Ningbo University, China. His research interests mainly include image/video compression, 3D video coding and video perception.

    Randi Fu received his M.S. degree from the PLA Information Engineering University, China, in 2001. He is now an Associate Professor in Faculty of Information Science and Engineering, Ningbo University, China. His research interests mainly include remote sensing image processing and pattern recognition.

    View full text