Loading [MathJax]/extensions/MathMenu.js
Learning a Single Tucker Decomposition Network for Lossy Image Compression With Multiple Bits-per-Pixel Rates | IEEE Journals & Magazine | IEEE Xplore

Learning a Single Tucker Decomposition Network for Lossy Image Compression With Multiple Bits-per-Pixel Rates


Abstract:

Lossy image compression (LIC), which aims to utilize inexact approximations to represent an image more compactly, is a classical problem in image processing. Recently, de...Show More

Abstract:

Lossy image compression (LIC), which aims to utilize inexact approximations to represent an image more compactly, is a classical problem in image processing. Recently, deep convolutional neural networks (CNNs) have achieved interesting results in LIC by learning an encoder-quantizer-decoder network from a large amount of data. However, existing CNN-based LIC methods generally train a network for a specific bits-per-pixel (bpp). Such a “one-network-per-bpp” problem limits the generality and flexibility of CNNs to practical LIC applications. In this paper, we propose to learn a single CNN which can perform LIC at multiple bpp rates. A simple yet effective Tucker Decomposition Network (TDNet) is developed, where there is a novel tucker decomposition layer (TDL) to decompose a latent image representation into a set of projection matrices and a core tensor. By changing the rank of core tensor and its quantization, we can easily adjust the bpp rate of latent image representation within a single CNN. Furthermore, an iterative non-uniform quantization scheme is presented to optimize the quantizer, and a coarse-to-fine training strategy is introduced to reconstruct the decompressed images. Extensive experiments demonstrate the state-of-the-art compression performance of TDNet in terms of both PSNR and MS-SSIM indices.
Published in: IEEE Transactions on Image Processing ( Volume: 29)
Page(s): 3612 - 3625
Date of Publication: 09 January 2020

ISSN Information:

PubMed ID: 31940535

Funding Agency:


I. Introduction

As an indispensable step in many image processing applications, lossy image compression (LIC) is a classical yet still active topic. The goal of LIC is to reduce the image storage space without sacrificing much the image quality, and thus provide an economic solution to image storage and transmission systems. Recently, with the development of portable imaging devices and social media (i.e., Facebook), billions of images are transmitted and stored daily on social networks [1]. The explosive growth of the amount of shared images on Internet raises higher requirements on LIC for more effective visual communication systems.

Contact IEEE to Subscribe

References

References is not available for this document.