ABSTRACT
Video applications continue to gain a lot of traction and have an enormous demand. As a result, there is a strong demand to decrease the video transmission bitrate without reducing visual quality [1],[2]. The efforts to achieve bitrate savings, especially for high-definition video content started in 2010, when the ITU-T Video Coding Experts Group (VCEG) and ISO/IEC Moving Pictures Expert Group (MPEG) established a Joint Collaborative Team on Video Coding (JCT-VC) to work on the High-Efficiency Video Coding (HEVC) standard. Then, in 2013, the 1st version of the HEVC specification was approved by ITU-T as Recommendation H.265 and by ISO/IEC as MPEG-H, Part 2 [3]. When developing the H.265/MPEG-HEVC standard, high-resolution and high frame-rate video coding was considered as one of its main potential application scenarios, while keeping it applicable to almost all existing use cases that were already targeted by H.264/MPEG-AVC.
However, more efficient video compression techniques were still desired, especially for streaming Ultra HD video content as well as Panorama video content (so called, 360° video content) from concerts, shows, sport events, etc. Therefore, in order to fulfill this demand, the exploration phase for future video coding technologies beyond HEVC (ITU-T H.265 | ISO/IEC 23008-2) started in October 2015 by establishing a Joint Video Exploration Team (JVET) on Future Video Coding of ITU-T VCEG and ISO/IEC MPEG [4]. The development process of the video coding standard beyond HEVC was driven by the most recent scientific and technological achievements in the video coding field, and under the JVET development it was titled "Versatile Video Coding" [5], or in short, VVC [6]. The 1st version of the VVC standard (i.e. VVC v1) was officially finalized during the 19th JVET meeting, which took place between June 22 and July 1, 2020, and it was approved by ITU-T as Recommendation H.266 and by ISO/IEC as MPEG-I, Part 3 [6].
While the joint video coding standardization activities of ITU-T and ISO/IEC organizations rely on an open and collaborative process driven by its active members, several companies individually developed their own video coding formats [7], [8]. Such in 2015, the Alliance for Open Media(AOM) was formed with the objective to work towards next-generation media formats in general and with a particular short-term focus on the development of a video-coding scheme [9],[10],[11]. In April 2016, AOM released a baseline version of the developed video-coding scheme, which got the name of AV1. In turn, the final version of the 1st edition of AV1 released in 2018, claiming to provide a significant coding-efficiency gain over the current state-of-the-art video codecs [10]. In May 2020, AV1 codec had a major update and its 2nd edition, i.e. AV1 2.0, was released [12],[13]. In turn, AV1 3.0 with better compression performance was released in 2021 [12],[13]. However, experimental results related to coding efficiency comparison of AV1 versus standardized codecs such as HEVC and VVC, which are reported in the literature [14], are not consistent and are sometimes even contradictory. As a result, there is a lot of confusion about the ability of future versions of AV1 to compete with HEVC/VVC-based encoder implementations.
In order to put things into perspective and to provide relevant information in a reproducible and reliable form, this work presents detailed experimental results of a coding-efficiency comparison of AV1 (aomenc) v3.6 [12] versus VVC-based codecs -the open-source VVenC codec v1.7 [15] and VVC reference software model (VTM) v17.0, and versus HEVC-based codecs - the open-source x265 codec v3.5 [16],[17] and HEVC reference software model (HM) v17.0 [18], along with a detailed discussion of the selected software implementations, the choice of coding parameters, and the corresponding evaluation setup. All tested encoders have been set to the best quality operation modes (i.e. the slowest encoding options) with a Group of Pictures (GOP) size equal to 16. Specifically, in this work, the authors focus on 4K and 1080p video content (selected from JVET CTC [19]), since it is considered to be the most popular nowadays, especially within the consumer multimedia applications, and encoding such content is typically the most challenging due to significantly larger computational complexity in terms of encoding times compared to lower video resolutions. For the rate-distortion (R-D) performance assessment, the authors used a Bjøntegaard-Delta bit-rate (BD-BR) measurement method for calculating average bit-rate differences between R-D curves for the same objective quality (e.g., for the same PSNRYUV) [20].
According to the experimental results, the coding efficiency of both AV1 and x265 was found to be significantly inferior to VVenC, while having the overhead of 31.2% and 118%, respectively. In terms of the encoding speed, the typical encoding time of AV1 is ~5 times slower than HM, and ~2 times slower than VVenC. On the other hand, the encoding time of x265 for the given best quality mode configuration is similar to that of HM, while HM provides bitrate savings of more than 22%. When compared to VTM, the HM runtime is more than 12 times faster, but with an overhead of ~52%.
- "Cisco Visual Networking Index: Forecast and Methodology, 2018--2023", Online: https://www.cisco.com/c/en/us/solutions/collateral/executive-perspectives/annual-internet-report/white-paper-c11-741490.pdf, Cisco Systems Inc., 9 Mar. 2020.Google Scholar
- D. Grois, and A. Giladi, "Perceptual quantization matrices for high dynamic range H.265/MPEG-HEVC video coding", Proc. SPIE 11137, Applications of Digital Image Processing XLII, 111370O, 2020.Google ScholarCross Ref
- ITU-T, Recommendation H.265 (04/13), Series H: Audiovisual and Multimedia Systems, Infrastructure of audiovisual services - Coding of Moving Video, High Efficiency Video Coding.Google Scholar
- JVET JEM reference software, Online: https://jvet.hhi.fraunhofer.de/svn/svn_HMJEMSoftware/Google Scholar
- JVET VTM refence software, Online: https://vcgit.hhi.fraunhofer.de/jvet/VVCSoftware_VTMGoogle Scholar
- ISO/IEC 23090-3:2021, "Information technology --- Coded representation of immersive media --- Part 3: Versatile video coding", Online: https://www.iso.org/standard/73022.htmlGoogle Scholar
- D. Grois, D. Marpe, A. Mulayoff, B. Itzhaky, and O. Hadar, "Performance comparison of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC encoders," Picture Coding Symposium (PCS), 2013, pp.394--397, 8--11 Dec. 2013.Google Scholar
- D. Grois, D. Marpe, T. Nguyen, and O. Hadar, "Comparative Assessment of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC Encoders for Low-Delay Video Applications", Proc. SPIE 9217, Applications of Digital Image Processing XXXVII, 92170Q, Sept. 23, 2014.Google Scholar
- D. Grois, T. Nguyen, and D. Marpe, "Coding Efficiency Comparison of AV1/VP9, H.265/MPEG-HEVC, and H.264/MPEG-AVC Encoders," Picture Coding Symposium (PCS), 2016, 4--7 Dec. 2016.Google Scholar
- Alliance for Open Media, Press Release Online: http://aomedia.org/press-release/Google Scholar
- D. Grois, T. Nguyen, and D. Marpe, "Performance comparison of AV1, JEM, VP9, and HEVC encoders," Proc. SPIE 10396, Applications of Digital Image Processing XL, 103960L, 2018.Google ScholarCross Ref
- Alliance for Open Media, aomenc git repository, Online at: https://aomedia.googlesource.com/aomGoogle Scholar
- D. Grois et al., "Performance Comparison of Emerging EVC and VVC Video Coding Standards with HEVC and AV1," in SMPTE Motion Imaging Journal, vol. 130, no. 4, pp. 1--12, May 2021.Google ScholarCross Ref
- Y. Liu, "AV1 beats x264 and libvpx-vp9 in practical use case", https://engineering.fb.com/2018/04/10/video-engineering/av1-beats-x264-and-libvpx-vp9-in-practical-use-case/Google Scholar
- "VVenC Software," Online: https://github.com/fraunhoferhhi/vvenc.Google Scholar
- Projects from VideoLAN, x265 software library and application, Online: https://www.videolan.org/developers/x265.html.Google Scholar
- x265 Documentation, Online: https://x265.readthedocs.io/en/stable/index.htmlGoogle Scholar
- JCT-VC HM refence software, Online: https://hevc.hhi.fraunhofer.de/Google Scholar
- F. Bossen, J. Boyce, X. Li, and V. Seregin, K. Sühring, "JVET common test conditions and software reference configurations for SDR video," document JVET-N1010, 14th JVET meeting, Geneva, CH, 19--27 Mar. 2019.Google Scholar
- G. Bjøntegaard, "Calculation of average PSNR differences between RD-curves", ITU-T Q.6/SG16 VCEG 13th Meeting, Document VCEG-M33, Austin, USA, Apr. 2001.Google Scholar
Index Terms
- Performance Assessment of AV1, x265 and VVenC Open-Source Encoder Implementations Compared to VVC and HEVC Reference Software Models
Recommendations
Novel Histogram-Based Scene Change Detection Scheme for x265 Open-Source Video Encoder
MHV '23: Proceedings of the 2nd Mile-High Video ConferenceThere is an increasing need in bandwidth requirements, particularly, due to the increase in the device resolution [1]. As a result, there is a strong demand to decrease the video transmission bitrate without reducing visual quality. In 2010, a Joint ...
Novel Motion-Compensated Spatio-Temporal Filtering Scheme for x265 Open-Source Video Encoder
MHV '23: Proceedings of the 2nd Mile-High Video ConferenceThere is a strong demand to decrease the video transmission bitrate without reducing visual quality [1]. The x265 encoder [2]-[4] is a popular open-source encoder, which generates bitstreams compliant with the H.265/MPEG-HEVC video coding standard [5]. ...
Bitrate and Adaptive Streaming: What are We Measuring and Why?
MHV '23: Proceedings of the 2nd Mile-High Video ConferenceCurrently, a significant increase in bandwidth requirements is expected in the next couple of years, particularly due to an increase in the device resolution [1]: a typical bit-rate for the 4K video is between 15 to 18 Mbps, while it is considered to be ...
Comments