Generative adversarial networks using variational autoencoder discrimination for super-resolution | IEEE Conference Publication | IEEE Xplore

Generative adversarial networks using variational autoencoder discrimination for super-resolution


Abstract:

Training generative adversarial networks (GANs) relies on the game between the generator and the discriminator, so the improvement of the discriminator can promote the im...Show More

Abstract:

Training generative adversarial networks (GANs) relies on the game between the generator and the discriminator, so the improvement of the discriminator can promote the improvement of the generator. A variational autoencoder (VAE) is useful for classification because it can learn the probability distribution of an image and provide latent variables as output features. Therefore, we propose a new network structure using a variational autoencoder in the GAN discriminator for super-resolution (SR). This network uses the latent variables generated by the VAE to extract the probability distribution of a reconstructed image and an original high-resolution (HR) image so the latent variables are used as features for discrimination. In addition, we propose to train the whole GAN and the VAE network alternately to optimize the network parameters. We verify the proposed method on five GAN-based image super-resolution methods. Experimental results show that the proposed algorithm leads to state-of-the-art results when plugged into existing GAN based SR methods.
Date of Conference: 22-24 March 2024
Date Added to IEEE Xplore: 04 October 2024
ISBN Information:
Conference Location: Xi'an, China

Funding Agency:

References is not available for this document.

I. Introduction

Single image super-resolution reconstructs HR images from one or more low-resolution images [1], [2], which is a hot topic in computer vision. In recent years, neural networks have been widely used in single image super-resolution(SR) of their nonlinear feature representation ability.

Select All
1.
Z. Wang, J. Chen and S. C. H. Hoi, "Deep learning for image super-resolution: A survey", IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1-22, 2021.
2.
M. E. Helou, R. Zhou and S. Susstrunk, Stochastic frequency masking to improve super-resolution and denoising networks, pp. 1-18, 2020, [online] Available: .
3.
S. Anwar and N. Barnes, "Densely residual laplacian super-resolution", IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1-12, 2021.
4.
B. Niu, W. Wen, W. Ren, X. Zhang, L. Yang, S. Wang, et al., "Single image super-resolution via a holistic attention network", European Conference on Computer Vision, pp. 1-16, 2020, [online] Available: .
5.
Y. Blau and T. Michaeli, "The perception-distortion tradeoff", IEEE Conference on Computer Vision and Pattern Recognition, pp. 6228-6237, 2018.
6.
C. Ledig, L. Theis, F. Huszar, J. Caballero, A. Cun-ningham, A. Acosta, et al., "Photo-realistic single image super-resolution using a generative adversarial network", IEEE Conference on Computer Vision and Pattern Recognition, pp. 4681-4690, 2017.
7.
I. J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozairy, et al., "Generative adversarial nets" in Neural Information Processing Systems, Montral, Montral,CANADA, pp. 2672-2680, 2014.
8.
X. Wang, K. Yu, S. Wu, J. Gu, Y. Liu, C. Dong, et al., "Esrgan: Enhanced super-resolution generative adversarial networks", European Conference on Computer Vision Workshop, pp. 63-79, 2018.
9.
X. Wang, L. Xie, C. Dong and Y. Shan, "Real-esrgan: Training real-world blind super-resolution with pure synthetic data", IEEE International Conference on Computer Vision, 2021, [online] Available: .
10.
J. Liang, H. Zeng and L. Zhang, "Details or artifacts: A locally discriminative learning approach to realistic image super-resolution", IEEEE Conference on Computer Vision and Pattern Recognition, pp. 5647-5656, 2022.
11.
T. I. Suhail Hamdan, Yohei Fukumizu and H. Yamauchi, "Face image super-resolution with adaptive patch size to scaling factor", Journal of Image and Graphics, vol. 6, pp. 167-173, 2018.
12.
W. Li, K. Zhou, L. Qi, L. Lu, N. Jiang, J. Lu, et al., "Best-buddy gans for highly detailed image super-resolution", 2021.
13.
D. P. Kingma and M. Welling, "Auto-encoding variational bayes", 2013.
14.
H. Hu, M. Liao, W. Mao, W. Liu, C. Zhang and Y. Jing, "Variational auto-encoder for text generation", IEEE Information Technology and Mechatronics Engineering Conference, pp. 595-598, 2020.
15.
H. Guojie, J. Hui and Y. Yibiao, "Enhanced variational auto-encoder for voice conversion using non-parallel corpora", IEEE International Conference on Signal Processing, pp. 46-49, 2018.
16.
C. Bhushan, Z. Yang, N. Virani and N. Iyer, "Variational encoder-based reliable classification", IEEE International Conference on Image Processing, pp. 1-5, 2020.
17.
J. Li, Q. Huang, Y. Du, X. Zhen, S. Chen and L. Shao, "Variational abnormal behavior detection with motion consistency", IEEE Transactions on Image Processing, vol. 31, pp. 275-286, 2022.
18.
A. Abdella and I. Uysal, "A statistical comparative study on image reconstruction and clustering with novel vae cost function", IEEE Access, vol. 8, pp. 25626-25637, 2020.
19.
T. Chen, H. Liu, Z. Ma, Q. Shen, X. Cao and Y. Wang, "End-to-end learnt image compression via non-local attention optimization and improved context modeling", IEEE Transactions on Image Processing, vol. 30, pp. 3179-3191, 2021.
20.
J. Peng, D. Liu, S. Xu and H. Li, "Generating diverse structure for image inpainting with hierarchical vq-vae", IEEE Conference on Computer Vision and Pattern Recognition, pp. 10770-10779, 2021.
21.
F. Wu and X. Zhuang, "Unsupervised domain adaptation with variational approximation for cardiac segmentation", IEEE Transactions on Medical Imaging, vol. 40, no. 12, pp. 3555-3567, 2021.
22.
W. Sun, S. Yu and B. Dong, "Vdgan: A collaborative filtering framework based on variational denoising with gans", International Joint Conference on Neural Networks, pp. 1-8, 2021.
23.
M. Yang and J. Qi, "Reference-based image super-resolution by dual-variational autoencoder", International Conference on Communications Computing Cy-bersecurity and Informatics, pp. 1-5, 2021.
24.
R. Chen and Y. Zhang, "Learning dynamic generative attention for single image super-resolution", IEEE Transactions on Circuits and Systems for Video Technology, vol. 32, no. 12, pp. 8368-8382, 2022.
25.
Z.-S. Liu, W.-C. Siu, L.-W. Wang, C.-T. Li, M.-P. Cani and Y.-L. Chan, "Unsupervised real image super-resolution via generative variational autoencoder", IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 1788-1797, 2020.
26.
Z.-S. Liu, W.-C. Siu and Y.-L. Chan, "Photo-realistic image super-resolution via variational autoencoders", IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 4, pp. 1351-1365, 2021.
27.
Z.-S. Liu, W.-C. Siu and L.-W. Wang, "Variational autoencoder for reference based image super-resolution", IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 516-525, 2021.
28.
Z.-S. Liu, W.-C. Siu and Y.-L. Chan, "Reference based face super-resolution", IEEE Access, vol. 7, pp. 129112-129126, 2019.
29.
T. G. Keigo Kano and S. Hirano, "Performance improvement of face image super- resolution processing by high-precision skin color detection", Journal of Image and Graphics, vol. 7, pp. 82-89, 2019.
30.
I. Goodfellow, Y. Bengio and A. Cou, Deep Learning., Cambridge, Massachusetts, USA:The MIT Press, 2016.
Contact IEEE to Subscribe

References

References is not available for this document.