Journals & Magazines >IEEE Transactions on Pattern ... >Volume: 44 Issue: 4

Learning Image-Adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-Time

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Recent years have witnessed the increasing popularity of learning based methods to enhance the color and tone of photos. However, many existing photo enhancement methods ...Show More

Metadata

Abstract:

Recent years have witnessed the increasing popularity of learning based methods to enhance the color and tone of photos. However, many existing photo enhancement methods either deliver unsatisfactory results or consume too much computational and memory resources, hindering their application to high-resolution images (usually with more than 12 megapixels) in practice. In this paper, we learn image-adaptive 3-dimensional lookup tables (3D LUTs) to achieve fast and robust photo enhancement. 3D LUTs are widely used for manipulating color and tone of photos, but they are usually manually tuned and fixed in camera imaging pipeline or photo editing tools. We, for the first time to our best knowledge, propose to learn 3D LUTs from annotated data using pairwise or unpaired learning. More importantly, our learned 3D LUT is image-adaptive for flexible photo enhancement. We learn multiple basis 3D LUTs and a small convolutional neural network (CNN) simultaneously in an end-to-end manner. The small CNN works on the down-sampled version of the input image to predict content-dependent weights to fuse the multiple basis 3D LUTs into an image-adaptive one, which is employed to transform the color and tone of source images efficiently. Our model contains less than 600K parameters and takes less than 2 ms to process an image of 4K resolution using one Titan RTX GPU. While being highly efficient, our model also outperforms the state-of-the-art photo enhancement methods by a large margin in terms of PSNR, SSIM and a color difference metric on two publically available benchmark datasets. Code will be released at https://github.com/HuiZeng/Image-Adaptive-3DLUT.

Published in: IEEE Transactions on Pattern Analysis and Machine Intelligence ( Volume: 44, Issue: 4, 01 April 2022)

Page(s): 2058 - 2073

Date of Publication: 25 September 2020

ISSN Information:

PubMed ID: 32976094

DOI: 10.1109/TPAMI.2020.3026740

Funding Agency:

Hui Zeng

Department of Computing, The Hong Kong Polytechnic University, Hong Kong

Hui Zeng received the BSc and MSc degrees from the Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, in 2014 and 2016, respectively. He is currently working toward the PhD degree from the Department of Computing, The Hong Kong Polytechnic University. His research interests include image processing and computer vision.

Jianrui Cai

Department of Computing, The Hong Kong Polytechnic University, Hong Kong

Jianrui Cai received the BSc and MSc degrees from the College of Computer Science and Electronic Engineering, Hunan University, China in 2012 and 2015, respectively. He is currently working toward the PhD degree from the Department of Computing, The Hong Kong Polytechnic University. His research interests include image processing, computational photography and computer vision.

Lida Li

Department of Computing, The Hong Kong Polytechnic University, Hong Kong

Lida Li received the BS and MSc degrees from the School of Software Engineering, Tongji University, Shanghai, China, in 2013 and 2016, respectively. He is currently working toward the PhD degree at the Department of Computing, The Hong Kong Polytechnic University. His research interests include machine learning and face analysis.

Zisheng Cao

Camera Group of DJI Innovations Company, Ltd., Shenzhen, China

Zisheng Cao received the BS and MS degrees from Tsinghua University, in 2005 and 2007, respectively, and the PhD degree from the University of Hong Kong, in 2014. He is currently at the imaging group of DJI. His research interests include image signal processing and machine learning. Before joining in DJI, he was a research scientist in Philips.

Lei Zhang

Department of Computing, The Hong Kong Polytechnic University, Hong Kong

Contents

1 Introduction

In the digital camera imaging process, it is an indispensable step to enhance the perceptual quality of output photos by using several cascaded modules such as exposure compensation, hue/saturation adjustment, color space conversion and manipulation, tone mapping and gamma correction [1]. These modules are often manually tuned by experienced engineers, which is very cumbersome since the results need to be evaluated in many different scenes. The images output by digital cameras may still need post-processing/retouching to further enhance their visual quality. Unfortunately, photo retouching is also a demanding and tedious task, which requires expertise in photograph and has complicated procedures when using professional image editing tools such as PhotoShop. It is highly desirable to learn an automatic photo enhancement model, which can robustly and efficiently enhance the perceptual quality of images captured under various scenes [2].

Hui Zeng

Department of Computing, The Hong Kong Polytechnic University, Hong Kong

Jianrui Cai

Department of Computing, The Hong Kong Polytechnic University, Hong Kong

Lida Li

Department of Computing, The Hong Kong Polytechnic University, Hong Kong

Zisheng Cao

Camera Group of DJI Innovations Company, Ltd., Shenzhen, China

Lei Zhang

Department of Computing, The Hong Kong Polytechnic University, Hong Kong

Lei Zhang (Fellow, IEEE) received the BSc degree from the Shenyang Institute of Aeronautical Engineering, Shenyang, P.R. China, in 1995, and the MSc and PhD degrees in control theory and engineering from Northwestern Polytechnical University, Xi'an, P.R. China, in 1998 and 2001, respectively. From 2001 to 2002, he was a research associate with the Department of Computing, The Hong Kong Polytechnic University. From January 2003 to January 2006 he worked as a postdoctoral fellow with the Department of Electrical and Computer Engineering, McMaster University, Canada. In 2006, he joined the Department of Computing, The Hong Kong Polytechnic University, as an assistant professor. Since July 2017, he has been a chair professor with the same department. His research interests include Computer Vision, Image and Video Analysis, Pattern Recognition, and Biometrics, etc. He has published more than 200 papers in those areas. As of 2020, his publications have been cited more than 54,000 times in literature. He is a senior associate editor of the IEEE Transactions on Image Processing, and is/was an associate editor of the IEEE Transactions on Pattern Analysis and Machine Intelligence, the SIAM Journal of Imaging Sciences, the IEEE Transactions on CSVT, and the Image and Vision Computing, etc. He is a “Clarivate Analytics Highly Cited Researcher” from 2015 to 2019. For more information, please visit http://www4.comp.polyu.edu.hk/~cslzhang/.

References is not available for this document.

Learning Image-Adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-Time

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

1 Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Learning Image-Adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-Time

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

1 Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?