Loading [MathJax]/extensions/MathZoom.js
Spatial-Frequency Mutual Learning for Face Super-Resolution | IEEE Conference Publication | IEEE Xplore

Spatial-Frequency Mutual Learning for Face Super-Resolution


Abstract:

Face super-resolution (FSR) aims to reconstruct high-resolution (HR) face images from the low-resolution (LR) ones. With the advent of deep learning, the FSR technique ha...Show More

Abstract:

Face super-resolution (FSR) aims to reconstruct high-resolution (HR) face images from the low-resolution (LR) ones. With the advent of deep learning, the FSR technique has achieved significant breakthroughs. However, existing FSR methods either have a fixed receptive field or fail to maintain facial structure, limiting the FSRperformance. To circumvent this problem, Fourier transform is introduced, which can capture global facial structure information and achieve image-size receptive field. Relying on the Fourier transform, we devise a spatial-frequency mutual network (SFMNet) for FSR, which is the first FSR method to explore the correlations between spatial and frequency domains as far as we know. To be specific, our SFMNet is a two-branch network equipped with a spatial branch and a frequency branch. Benefiting from the property of Fourier transform, the frequency branch can achieve image-size receptive field and capture global dependency while the spatial branch can extract local dependency. Considering that these dependencies are complementary and both favorable for FSR, we further develop a frequency-spatial interaction block (FSIB) which mutually amalgamates the complementary spatial and frequency information to enhance the capability of the model. Quantitative and qualitative experimental results show that the proposed method out-performs state-of-the-art FSR methods in recovering face images. The implementation and model will be released at https://github.com/wcy-cs/SFMNet.
Date of Conference: 17-24 June 2023
Date Added to IEEE Xplore: 22 August 2023
ISBN Information:

ISSN Information:

Conference Location: Vancouver, BC, Canada
References is not available for this document.

1. Introduction

Face super-resolution (FSR), also known as face halluci-nation, is a technology which can transform low-resolution (LR) face images into the corresponding high-resolution (HR) ones. Limited by low-cost cameras and imaging con-ditions, the obtained face images are always low-quality, resulting in a poor visual effect and deteriorating the down-stream tasks, such as face recognition, face attribute analysis, face editing, etc. Therefore, FSR has become an emerging scientific tool and has gained more of the spotlight in the computer vision and image processing communities [20].

Decomposition and reconstruction of face image in the frequency domain. (a) Denote face images; (b) are their amplitude spectrum; (c) show their phase spectrum; (d) present the reconstructed images with amplitude information only; (e) are the re-constructed images with phase information only.

Select All
1.
Simon Baker and Takeo Kanade, "Hallucinating faces", IEEE Int. conf. automatic Face I and Gesture Recognition, pp. 83-88, 2000.
2.
Tadas Baltrusaitis, Peter Robinson and Louis-Philippe Morency, "Constrained local neural fields for robust facial landmark detection in the wild", Proceedings of the IEEE International Conference on Computer Vision Workshops, pp. 354-361, 2013.
3.
Tadas Baltrusaitis, Amir Zadeh, Yao Chong Lim and Louis-Philippe Morency, "Openface 2. 0: Facial behavior analysis toolkit", Proceedings of the IEEE International Conference on Automatic Face & Gesture Recognition, pp. 59-66, 2018.
4.
Adrian Bulat and Georgios Tzimiropoulos, "Super-fan: In-tegrated facial landmark localization and super-resolution of real-world low resolution faces in arbitrary poses with GANs", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 109-117, 2018.
5.
Qingxing Cao, Liang Lin, Yukai Shi, Xiaodan Liang and Guanbin Li, "Attention-aware face hallucination via deep reinforcement learning", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 690-698, 2017.
6.
Ayan Chakrabarti, A. N. Rajagopalan and Rama Chellappa, "Super-resolution of face images using kernel pea-based prior", IEEE Transactions on Multimedia, vol. 9, no. 4, pp. 888-892, 2007.
7.
Kelvin CK Chan, Xintao Wang, Xiangyu Xu, Jinwei Gu and Chen Change Loy, "Glean: Generative latent bank for large-factor image super-resolution", Proceedings of the IEEE/CVF conference on computer vision and pattern rcognition, pp. 14245-14254, 2021.
8.
Chaofeng Chen, Dihong Gong, Hao Wang, Zhifeng Li and Kwan-Yee K. Wong, "Learning spatial attention for face super-resolution", IEEE Trans. Image Processing, vol. 30, pp. 1219-1231, 2021.
9.
Yu Chen, Ying Tai, Xiaoming Liu, Chunhua Shen and Jian Yang, "FSRNet: End-to-end learning face super-resolution with facial priors", Proceedings of The IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2492-2501, 2018.
10.
Chao Dong, Chen Change Loy, Kaiming He and Xiaoou Tang, "Image super-resolution using deep convolutional networks", IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 38, no. 2, pp. 295-307, 2015.
11.
Hao Dou, Chen Chen, Xiyuan Hu, Zuxing Xuan, Zhisen Hu and Silong Pengo, "Pca-srgan: Incremental orthogonal projection discrimination for face super-resolution", Proceedings of the ACM International Conference on Multimedia, pp. 1891-1899, 2020.
12.
Dario Fuoli, Luc Van Gool and Radu Timofte, "Fourier space losses for efficient perceptual image super-resolution", Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2360-2369, 2021.
13.
Yuchao Gu, Xintao Wang, Liangbin Xie, Chao Dong, Gen Li, Ying Shan, et al., "VQFR: Blind face restoration with vector-quantized dictionary and parallel de-coder", ECCV, 2022.
14.
Kaiming He, Xiangyu Zhang, Shaoqing Ren and Sun Jian, "Deep residual learning for image recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016.
15.
G. B. Huang, M. Mattar, T. Berg and E. Learned-Miller, Labeled faces in the wild: A database for studying face recognition in unconstrained environments, 2007.
16.
Huaibo Huang, Ran He, Zhenan Sun and Tieniu Tan, "Wavelet-SRNet: A wavelet-based cnn for multi-scale face super resolution", Proceedings of the IEEE International Conference on Computer Vision, pp. 1689-1697, 2017.
17.
Huaibo Huang, Ran He, Zhenan Sun and Tieniu Tan, "Wavelet domain generative adversarial network for multi-scale face hallucination", International Journal of Computer Vision, vol. 127, no. 6-7, pp. 763-784, 2019.
18.
Jie Huang, Yajing Liu, Feng Zhao, Keyu Yan, Jinghao Zhang, Yukun Huang, et al., "Deep fourier-based exposure correction network with spatial-frequency interaction", Computer Vision-ECCV 2022: 17th European Conference, pp. 163-180, October 23-27, 2022.
19.
Junjun Jiang, Ruimin Hu, Zhongyuan Wang and Zhen Han, "Face super-resolution via multilayer locality-constrained it-erative neighbor embedding and intermediate dictionary learning", IEEE Trans. Image Process., vol. 23, no. 10, pp. 4220-4231, 2014.
20.
Junjun Jiang, Chenyang Wang, Xianming Liu and Jiayi Ma, "Deep learning-based face super-resolution: A survey", ACM Computing Surveys, vol. 55, no. 1, pp. 1-36, 2023.
21.
Junjun Jiang, Yu Yi, Jinhui Hu, Suhua Tang and Jiayi Ma, "Deep cnn denoiser and multi-layer neighbor component embedding for face hallucination", Proceedings of the Inter-national Joint Conference on Artificial Intelligence, pp. 771-778, 2018.
22.
Liming Jiang, Bo Dai, Wayne Wu and Chen Change Loy, "Focal frequency loss for image reconstruction and synthesis", Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 13919-13929, 2021.
23.
Cheolkon Jung, Licheng Jiao, Liu Bing and Maoguo Gong, "Position-patch based face hallucination using convex opti-mization", IEEE Signal Processing Letters, vol. 18, no. 6, pp. 367-370, 2011.
24.
Junhyung Kwak and Donghee Son, "Fractal residual network and solutions for real super-resolution", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 2114-2121, 2019.
25.
Vuong Le, Jonathan Brandt, Zhe Lin, Lubomir Bourdev and Thomas S. Huang, "Interactive facial feature localization", Proceedings of the European Conference on Computer Vision, pp. 679-692, 2012.
26.
Xiaoming Li, Chaofeng Chen, Shangchen Zhou, Xianhui Lin, Wangmeng Zuo and Lei Zhang, "Blind face restoration via deep multi-scale component dictionaries", computer vision-eccv 2020: 16th european conference, pp. 399-415, august 23-28, 2020.
27.
Xiaoming Li, Wenyu Li, Dongwei Ren, Hongzhi Zhang, Meng Wang and Wangmeng Zuo, "Enhanced blind face restoration with multi-exemplar images and adaptive spatial feature fusion", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2706-2715, 2020.
28.
Xiaoming Li, Ming Liu, Yuting Ye, Wangmeng Zuo, Liang Lin and Ruigang Yang, "Learning warped guidance for blind face restoration", Proceedings of the European conference on computer vision (ECCV), pp. 272-289, 2018.
29.
Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah and Kyoung Mu Lee, "Enhanced deep residual networks for single image super-resolution", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Wo rkshops, pp. 1132-1940, 2017.
30.
Ziwei Liu, Luo Ping, Xiaogang Wang and Xiaoou Tang, "Deep learning face attributes in the wild", Proceedings of the IEEE International Conference on Computer Vision, pp. 3730-3738, 2016.
Contact IEEE to Subscribe

References

References is not available for this document.