Loading [MathJax]/extensions/MathMenu.js
Detailed and Controllable Old Photo Restoration with Diffusion Priors | IEEE Conference Publication | IEEE Xplore

Detailed and Controllable Old Photo Restoration with Diffusion Priors


Abstract:

Restoring old photos that contain numerous unknown and complex defects is a challenging and ill-posed problem. Traditional methods often struggle to address both structur...Show More

Abstract:

Restoring old photos that contain numerous unknown and complex defects is a challenging and ill-posed problem. Traditional methods often struggle to address both structured and unstructured defects in real old photos, frequently leading to over-smoothed and uncompleted results. In this paper, we exploit powerful diffusion priors to construct a novel solution for the restoration of old photos. Our framework begins with a coarse stage model aimed at eliminating unstructured defects, followed by a conditional diffusion model that further refines the content and enriches details. Specifically, we introduce an edge control module, designed to encode the restored edge map into the denoising network. This integration effectively guides the restoration process, allowing the edge conditional diffusion to achieve more precise and controllable results. Additionally, we incorporate a feature fusion VAE to ensure the fidelity of the final outputs. Through extensive qualitative, quantitative, and ablative experiments, we demonstrate the innovation and effectiveness of the proposed method, which offers a complete, detailed, and personalized restoration of old photos.
Date of Conference: 19-21 June 2024
Date Added to IEEE Xplore: 31 July 2024
ISBN Information:

ISSN Information:

Conference Location: Toronto, ON, Canada
References is not available for this document.

I. Introduction

Old photo restoration involves a collection of processes including denoising, deblurring, inpainting, and tonal adjustment. This task is complex and inherently ill-posed due to the variety of defects that old photos may suffer. Defects in old photos can be broadly categorized into two types: structured defects, such as scratches, mold spots, creases, and stains, and unstructured defects, such as noise, fading, and blurring. To address these challenges, image inpainting and image restoration techniques have been developed to deal with structured and unstructured defects respectively.

Select All
1.
Deepak Pathak, Philipp Krahenbuhl, Jeff Donahue, Trevor Darrell and Alexei A Efros, "Context encoders: Feature learning by inpainting", Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2536-2544, 2016.
2.
Roman Suvorov, Elizaveta Logacheva, Anton Mashikhin, Anastasia Remizova, Arsenii Ashukha, Aleksei Silvestrov, et al., "Resolution-robust large mask inpainting with fourier convolutions", Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp. 2149-2159, 2022.
3.
Kamyar Nazeri, Eric Ng, Tony Joseph, Faisal Z Qureshi and Mehran Ebrahimi, "Edgeconnect: Generative image inpainting with adversarial edge learning", arXiv preprint arXiv:1901.00212, 2019.
4.
Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, et al., "Esrgan: Enhanced super-resolution generative adversarial networks", Proceedings of the European conference on computer vision (ECCV) workshops, pp. 0-0, 2018.
5.
Xintao Wang, Liangbin Xie, Chao Dong and Ying Shan, "Real-esrgan: Training real-world blind super-resolution with pure synthetic data", Proceedings of the IEEE/CVF international conference on computer vision, pp. 1905-1914, 2021.
6.
Masanori Suganuma, Xing Liu and Takayuki Okatani, "Attention-based adaptive selection of operations for image restoration in the presence of unknown combined distortions", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9039-9048, 2019.
7.
Boyun Li, Xiao Liu, Peng Hu, Zhongqin Wu, Jiancheng Lv and Xi Peng, "All-in-one image restoration for unknown corruption", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 17452-17462, 2022.
8.
Ziyu Wan, Bo Zhang, Dongdong Chen, Pan Zhang, Dong Chen, Jing Liao, et al., "Bringing old photos back to life", proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 2747-2757, 2020.
9.
Jonathan Ho, Ajay Jain and Pieter Abbeel, "Denoising diffusion probabilistic models", Advances in neural information processing systems, vol. 33, pp. 6840-6851, 2020.
10.
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser and Bjorn Ommer, "High-resolution image synthesis with latent diffu- sion models", Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10684-10695, 2022.
11.
Xibei Liu, Xinning Chai, Hengsheng Zhang, Rong Xie, Xiao Gu, Li Song, et al., "Old-photo restoration with detail-and structure-enhanced cascaded learning", 2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)., pp. 308-313, 2023.
12.
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, et al., "Generative adversarial nets", Advances in neural information processing systems, vol. 27, 2014.
13.
Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz and Bryan Catanzaro, "High-resolution image synthesis and semantic manipulation with conditional gans", Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 8798-8807, 2018.
14.
Chong Mou, Xintao Wang, Liangbin Xie, Jian Zhang, Zhongang Qi, Ying Shan, et al., "T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models", arXiv preprint arXiv:2302.08453, 2023.
15.
Jiaming Song, Chenlin Meng and Stefano Ermon, "Denoising diffusion implicit models", International Conference on Learning Representations, 2020.
16.
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou and Alexei A Efros, "Imageto-image translation with conditional adversarial networks", Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1125-1134, 2017.
17.
Mark Everingham, SM Ali Eslami, Luc Van Gool, Christopher KI Williams, John Winn and Andrew Zisserman, "The pascal visual object classes challenge: A retrospective", International journal of computer vision, vol. 111, pp. 98-136, 2015.
18.
Eirikur Agustsson and Radu Timofte, "Ntire 2017 challenge on single image super-resolution: Dataset and study", Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp. 126-135, 2017.
19.
Junnan Li, Dongxu Li, Silvio Savarese and Steven Hoi, "Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models", arXiv preprint arXiv:2301.12597, 2023.
20.
Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shechtman and Oliver Wang, "The unreasonable effectiveness of deep features as a perceptual metric", Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 586-595, 2018.
21.
Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler and Sepp Hochreiter, "Gans trained by a two time-scale update rule converge to a local nash equilibrium", Advances in neural information processing systems, vol. 30, 2017.

Contact IEEE to Subscribe

References

References is not available for this document.