Wavelet-Based Learned Scalable Video Coding | IEEE Conference Publication | IEEE Xplore

Wavelet-Based Learned Scalable Video Coding


Abstract:

Scalability is an important requirement for video coding when coded videos stream over dynamic-bandwidth networks. The state-of-the-art scalable video coding schemes adop...Show More

Abstract:

Scalability is an important requirement for video coding when coded videos stream over dynamic-bandwidth networks. The state-of-the-art scalable video coding schemes adopt layer-based methods upon H.265, represented by the SHVC standard. Compared to layer-based schemes, wavelet-based schemes were suspected less efficient for a long while. We try to improve the compression efficiency of wavelet-based scalable video coding by leveraging the recent progresses of deep learning. First, we propose an entropy coding method, using trained convolutional neural networks (CNNs) for probability estimation, to compress the wavelet-transformed subbands. Second, we design a CNN-based method for inverse temporal wavelet transform. We integrate the two proposed methods into a traditional wavelet-based scalable video coding scheme, named Interframe-EZBC. The two methods together achieve more than 20% bits savings. Then, our scheme outperforms the SHVC reference software by 9.09%, 6.55%, and 8.66% BD-rate reductions in YUV respectively.
Date of Conference: 27 May 2022 - 01 June 2022
Date Added to IEEE Xplore: 11 November 2022
ISBN Information:

ISSN Information:

Conference Location: Austin, TX, USA
References is not available for this document.

I. Introduction

Video streaming is a fundamental technology for many video-based applications, such as live streaming, video on demand, real-time communications, and so on. In order to enable users to obtain the best video quality according to their current network bandwidth when using video streaming-based applications, scalable video coding technologies are believed promising. Scalable video coding compresses videos into coarse-to-fine hierarchical bitstreams. Users can determine the amount of bits used to decode videos according to their current bandwidth, which enables that the user can get the best possible video quality.

Select All
1.
B.-J. Kim and W. A. Pearlman, "An embedded wavelet video coder using three-dimensional set partitioning in hierarchical trees (SPIHT)" in DCC, IEEE, pp. 251-260, 1997.
2.
Y. Wu, K. Hanke, T. Rusert and J. W. Woods, "Enhanced MC-EZBC scalable video coder", IEEE Transactions on Circuits and Systems for Video Technology, vol. 18, no. 10, pp. 1432-1436, 2008.
3.
A. Secker and D. Taubman, "Lifting-based invertible motion adaptive transform (LIMAT) framework for highly scalable video compression", IEEE Transactions on Image Processing, vol. 12, no. 12, pp. 1530-1542, 2003.
4.
H. Schwarz, D. Marpe and T. Wiegand, "Overview of the scalable video coding extension of the H.264/AVC standard", IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 9, pp. 1103-1120, 2007.
5.
J. M. Boyce, Y. Ye, J. Chen and A. K. Ramasubramonian, "Overview of SHVC: Scalable extensions of the high efficiency video coding standard", IEEE Transactions on Circuits and Systems for Video Technology, vol. 26, no. 1, pp. 20-34, 2015.
6.
S.-T. Hsiang and J. W. Woods, "Embedded video coding using invertible motion compensated 3-D subband/wavelet filter bank", Signal Processing: Image Communication, vol. 16, no. 8, pp. 705-724, 2001.
7.
D. Taubman, "High performance scalable image compression with EBCOT", IEEE Transactions on Image Processing, vol. 9, no. 7, pp. 1158-1170, 2000.
8.
J. M. Shapiro, "An embedded hierarchical image coder using zerotrees of wavelet coefficients", DCC, pp. 214-223, 1993.
9.
H. Ma, D. Liu, R. Xiong and F. Wu, "A CNN-based image compression scheme compatible with JPEG-2000" in ICIP, IEEE, pp. 704-708, 2019.
10.
H. Ma, D. Liu, N. Yan, H. Li and F. Wu, "End-to-end optimized versatile image compression with wavelet-like transform", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 3, pp. 1247-1263, 2022.
11.
Y. Liu and J. W. Woods, "New and efficient interframe extensions of EZBC and JPEG 2000", MMSP, pp. 1-6, 2017.
12.
F. Auli-Llinas, "2-step scalar deadzone quantization for bitplane image coding", IEEE Transactions on Image Processing, vol. 22, no. 12, pp. 4678-4688, 2013.
13.
Z. Cheng, H. Sun, M. Takeuchi and J. Katto, "Learned image compression with discretized gaussian mixture likelihoods and attention modules", CVPR, pp. 7939-7948, 2020.
14.
D. Ma, F. Zhang and D. Bull, "BVI-DVC: A training database for deep video compression", IEEE Transactions on Multimedia, 2021.
15.
F. Bosson, "Common HM test conditions and software reference configurations", Document JCTVC-L1100 12th JCT-VC Meeting, January 2013.

Contact IEEE to Subscribe

References

References is not available for this document.