research-article

RIRNet: Recurrent-In-Recurrent Network for Video Quality Assessment

Authors:

Pengfei Chen,

Leida Li,

Lei Ma,

Jinjian Wu,

Guangming ShiAuthors Info & Claims

MM '20: Proceedings of the 28th ACM International Conference on Multimedia

Pages 834 - 842

https://doi.org/10.1145/3394171.3413717

Published: 12 October 2020 Publication History

Get Access

Abstract

Video quality assessment (VQA), which is capable of automatically predicting the perceptual quality of source videos especially when reference information is not available, has become a major concern for video service providers due to the growing demand for video quality of experience (QoE) by end users. While significant advances have been achieved from the recent deep learning techniques, they often lead to misleading results in VQA tasks given their limitations on describing 3D spatio-temporal regularities using only fixed temporal frequency. Partially inspired by psychophysical and vision science studies revealing the speed tuning property of neurons in visual cortex when performing motion perception (i.e., sensitive to different temporal frequencies), we propose a novel no-reference (NR) VQA framework named Recurrent-In-Recurrent Network (RIRNet) to incorporate this characteristic to prompt an accurate representation of motion perception in VQA task. By fusing motion information derived from different temporal frequencies in a more efficient way, the resulting temporal modeling scheme is formulated to quantify the temporal motion effect via a hierarchical distortion description. It is found that the proposed framework is in closer agreement with quality perception of the distorted videos since it integrates concepts from motion perception in human visual system (HVS), which is manifested in the designed network structure composed of low- and high- level processing. A holistic validation of our methods on four challenging video quality databases demonstrates the superior performances over the state-of-the-art methods.

Supplementary Material

MP4 File (3394171.3413717.mp4)

This is a video presentation of oral paper ?RIRNet: Recurrent-In-Recurrent Network for Video Quality Assessment? in MM?20 conference. We have carried out a brief background introduction and framework description in this video. For more detailed contents, please refer to our paper.

Download
79.44 MB

References

[1]

S. Bae and T. Lee. 2011. Product type and consumers' perception of online consumer reviews. Electronic Markets, Vol. 21, 4 (2011), 255--266.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Modeling motion visual perception for video quality assessment

The Role of Audio in Visual Perception of Quality

Foveation-based content adaptive root mean squared error for video quality assessment

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations