Full-reference IPTV image quality assessment by deeply learning structural cues

https://doi.org/10.1016/j.image.2020.115779Get rights and content

Highlights

  • We leverage structural cues to calculate the similarity between the original image and the test one.

  • A novel distance metric is proposed to measure the difference between the reference image and each test image.

  • We propose a structure-preserved deep neural networks to extract deep representation from these images.

Abstract

Image quality assessment (IQA) attempts to quantify the quality-aware visual attributes perceived by humans. They can be divided into subjective and objective image quality assessment. Subjective IQA algorithms rely on human judgment of image quality, where the human visual perception functions as the dominant factor However, they cannot be widely applied in practice due to the heavy reliance on different individuals. Motivated by the fact that objective IQA largely depends on image structural information, we propose a structural cues-based full-reference IPTV IQA algorithm. More specifically, we first design a grid-based object detection module to extract multiple structural information from both the reference IPTV image (i.e., video frame) and the test one. Afterwards, we propose a structure-preserved deep neural networks to generate the deep representation for each IPTV image. Subsequently, a new distance metric is proposed to measure the similarity between the reference image and the evaluated image. A test IPV image with a small calculated distance is considered as a high quality one. Comprehensive comparative study with the state-of-the-art IQA algorithms have shown that our method is accurate and robust.

Introduction

Image quality assessment (IQA) is an indispensable technique in computer vision and pattern recognition, such as image clustering, retrieval, compression and classification. The target of IQA is to assess image quality objectively based on some designed algorithm, which attempts to mimic human visual perception. That means the IQA score obtained by the algorithm should approximate the visual quality assessment by human beings [1]. Nowadays, IQA plays a significant role in modern advertising systems as well as photo sharing platform. For example, IQA-guided recommendation systems can display photographs or video clips according to users’ favorite content. This function replaces the traditional manual selection of various sections of website. In addition, IQA-guided image sharing platform (e.g., Flickr and Google Picasa) can intelligently construct user groups by analyzing the quality tendency of the uploaded images.

Image quality is determined by multiple factors, e.g., color distribution, illumination, and objects, within each image. For example, harmonic color distribution can reflect the high image quality, while low-resolution or illumination images typically exhibit poor quality. As shown in Fig. 1, the aesthetically pleasing illumination and color harmony will lead to a high image quality. The color hue wheels proposed by Luo et al. [2] can quantify the color harmony within each image. They argue that an image can be considered as harmonic if its hue overlaps most regions in the pre-defined hue wheels. In their work, seven different hue wheels were proposed, each of which represents a particular color harmony sample. The hue of image is related to illumination and color matching. Images with harmonic hue are deemed as high quality.

In general, images with various visual distortion have low quality score, e.g., images with the Gaussian blur, salt-pepper impulsive noise. Subjective IQA can conveniently distinguish different quality scores of images. However, this is still a challenge task for objective IQA algorithms which aim to design mathematical models that mimic human visual perception of different images. Realizing that structural cues within each image are perceived by humans when they viewing scenes, we consider that structural cues play the most important role in IQA. As far as we know, however, such informative attribute has not been explicitly encoded. Besides, some other limitations of the conventional IQA algorithms are as follows:

  • (1)

    Existing designed mathematical model cannot reflect human visual perception effectively. Thus, the results of objective IQA algorithms are far from understanding human visual perception.

  • (2)

    A carefully-designed distance metric is significant to IQA, since the inherent differences between the distorted images and the original one might be difficult to distinguish. Nevertheless, conventional IQA methods cannot exploit distance metric effectively.

To solve these limitations, we propose a structural cues based full-reference IQA algorithm. Specifically, we first propose a grid-based object detection algorithm to extract structural clues from both the reference image and the evaluated one. Afterwards, we propose a structure-reservation based deep neural network to generate the deep feature. Subsequently, a new distance metric is constructed to calculate similarity between reference image and evaluated image. A test image with a small calculated distance is deemed as high quality and thus will be assigned with a high score. Extensive experimental results on popular IQA databases have demonstrated the usefulness of our method. The main contributions of our proposed IQA method are as follows: (1) We leverage structural cues to calculate the similarity between the original image and the test one, where both local and global image features are exploited. (2) A novel distance metric is proposed to measure the difference between the reference image and each test image. (3) We propose a structure-preserved deep neural networks to extract deep representation from these images, where image structural feature can be well preserved.

Section snippets

Related work

IQA can be roughly grouped into two categories: subjective IQA and objective IQA. The former methods rely on human participation, that is, subjective IQA should optimally reflect human visual perception of various images. These methods are based on statistical models, where observers should be closely participated in during IQA in order to guarantee the statistical significance of subjective IQA. Subjective assessment can be further divided into absolute and relative assessment. Generally

Proposed method

In this paper, we propose a novel FR-IQA framework based on image structural cues, where a novel distance metric is designed to calculate the similarity between reference image and evaluated image.

Experimental results

Many IQA algorithms exhibit impressive performance in evaluating the quality of distortion images generated from the same original images. Nevertheless, the effectiveness of these IQA algorithms might decrease when evaluating different types of distorted images or those generated from different original images. In this way, cross-image and cross-distortion validation are significant indexes to evaluate the performance of different IQA algorithms.

Conclusions

In this paper, we propose a deep FR-IQA framework based on exploiting the structural features within each image. More specifically, we design a grid-based object detection algorithm to extract structural information from both the reference image and the evaluated image. Afterward, a structure-preserved deep neural network is formulated to generate the deep representation. Next, a new distance metric is built to calculate the similarity between the reference image and the evaluated image. A test

CRediT authorship contribution statement

YanQiang Kong: Funding acquisition, Investigation, Methodology, Project administration. Liu Cui: Data curation, Formal analysis, Resources, Software. Rui Hou: Writing - original draft, Writing - review and editing, Supervision, Validation.

Funding

The Postdoctoral Innovative Talent Support Program of China, Grant No. BX20180098

The China Postdoctoral Science Foundation, Grant No.2018M640102

National Key R&D Plan: grant No. 2018YFB0605504

Fundamental Research Funds for the Central Universities : grant No. JB2019078

Yan Qiang Kong male, born in 1989, he was received a Ph.D degree of Power Engineering and Engineering Thermophysics, North China Electric Power University, Beijing, China, 2018, June; He was received a master’s degree of Engineering Thermophysics, North China Electric Power University, Beijing, China; His research interests include solar thermal power generation, hydrogen production from renewable sources, and cold end optimization of power station.

References (28)

  • WangZ. et al.

    Video quality assessment based on structural distortion measurement

    Signal Process.: Image Commun.

    (2004)
  • SheikhH.R. et al.

    A statistical evaluation of recent full reference image quality assessment algorithms

    IEEE Trans. Image Process.

    (2006)
  • LuoW. et al.

    Content-based photo quality assessment

  • LinT.Y. et al.

    Microsoft coco: Common objects in context

  • W. Osberger, N. Bergmann, A. Maeder, An automatic image quality assessment technique incorporating high level...
  • R.J. Safranek, J.D. Johnston, A perceptually tuned sub-band image coder with image dependent quantization and...
  • E.C. Larson, D.M. Chandler, Unveiling relationships between regions of interest and image fidelity metrics, in: Proc....
  • U. Engelke, V.X. Nguyen, H.-J. Zepernick, Regional attention to structural degradations for perceptual image quality...
  • Perceptual criteria for image quality evaluation

  • J. Chen, Y. Zhang, L. Liang, S. Ma, R. Wang, W. Gao, A no-reference blocking artifacts metric using selective gradient...
  • WangZ. et al.

    Image quality assessment: from error visibility to structural similarity

    IEEE Trans. Image Process.

    (2004)
  • WangZ. et al.

    Multiscale structural similarity for image quality assessment

  • LiuZ. et al.

    Deep active learning with contaminated tags for image aesthetics assessment

    IEEE Trans. Image Process.

    (2018)
  • ZhangL. et al.

    FSIM: A feature similarity index for image quality assessment

    IEEE Trans. Image Process.

    (2011)
  • Cited by (7)

    • A no-reference underwater image quality evaluator via quality-aware features

      2023, Journal of Visual Communication and Image Representation
    • Stitched image quality assessment based on local measurement errors and global statistical properties

      2021, Journal of Visual Communication and Image Representation
      Citation Excerpt :

      Thus, designing image quality assessment (IQA) for stitched images as an evaluator is highly demanded. IQA has been extensively researched in the last two decades, where a variety of full-reference and no-reference methods are presented to assess the image quality, such as structural similarity index matrix (SSIM) [6], feature-similarity index matrix (FSIM) [7], pixel-wise gradient magnitude similarity (GMS) [8], picture-wise just noticeable difference (JND) [9], distortion identification-based image verity and integrity evaluation (DIIVINE) [10], blind/reference image spatial quality evaluator (BRISQUE) [11], natural image quality evaluator (NIQE) [12], quality assessment based on spatial and spectral entropies (SSEQ) [13], blind image quality indices (BIQI) [14], blind image integrity notator using DCT statistics (BLIINDS-II) [15], etc. Despite the promising performance achieved with these methods in 2D IQA tasks, these methods are unable to capture the stitching artifacts in panoramic images with the following issues: Firstly, distortions of the stitched image are mainly induced by local errors, e.g., misalignment and ghosting.

    View all citing articles on Scopus

    Yan Qiang Kong male, born in 1989, he was received a Ph.D degree of Power Engineering and Engineering Thermophysics, North China Electric Power University, Beijing, China, 2018, June; He was received a master’s degree of Engineering Thermophysics, North China Electric Power University, Beijing, China; His research interests include solar thermal power generation, hydrogen production from renewable sources, and cold end optimization of power station.

    Rui Hou male, born in 1979, 2017 June, he was received a Ph.D degree of communication engineering, Tianjing University, Tianjin, China; 2005 June, he was received a master’s degree of software engineering, NanKai University, Tianjin, China; He has been an PMP member since 2013.. His research interests include mobile platform security, wireless network, IoT, AI, New energy and CCUS.

    No author associated with this paper has disclosed any potential or pertinent conflicts which may be perceived to have impending conflict with this work. For full disclosure statements refer to https://doi.org/10.1016/j.image.2020.115779.

    View full text