Loading [MathJax]/extensions/MathMenu.js
Video Super-Resolution Based on 3D-CNNS with Consideration of Scene Change | IEEE Conference Publication | IEEE Xplore

Video Super-Resolution Based on 3D-CNNS with Consideration of Scene Change


Abstract:

In video super-resolution, the spatio-temporal coherence between, and among the frames must be exploited appropriately for the accurate prediction of the high resolution ...Show More

Abstract:

In video super-resolution, the spatio-temporal coherence between, and among the frames must be exploited appropriately for the accurate prediction of the high resolution frames. Although 2D-CNNs are powerful in modelling images, 3D-CNNs are more suitable for spatio-temporal feature extraction as they can preserve the temporal information. To this end, we propose an effective 3D-CNN for video super-resolution that does not require motion alignment as preprocessing. The proposed 3DSRnet maintains the temporal depth of spatio-temporal feature maps to maximally capture the temporally nonlinear characteristics between low and high resolution frames, and adopts residual learning in conjunction with the sub-pixel outputs. It outperforms the state-of-the-art method with average 0.45 dB and 0.36 dB higher in PSNR, for scale 3 and 4, in the Vidset4 benchmark. Our 3DSRnet first deals with the performance drop due to scene change, which is important in practice but has not been previously considered.
Date of Conference: 22-25 September 2019
Date Added to IEEE Xplore: 26 August 2019
ISBN Information:

ISSN Information:

Conference Location: Taipei, Taiwan
References is not available for this document.

1. INTRODUCTION

Super-resolution (SR) methods upscale low resolution (LR) images or video frames to high resolution (HR) ones. One application of SR is in solving the lack of Ultra High Definition (UHD) visual content. Likewise, many SR applications are in videos, where the reconstruction of HR frames may benefit from additional information contained in the previous and future LR frames. This can be used to constrain the solution space of SR, which is an ill-posed problem. While video frames exhibit high temporal coherence, the camera or object motion can also provide a different angle or scale of the parts in the current frame in the consecutive surrounding frames, which can be effectively utilized as crucial clues in constructing high quality HR frames. Consequently, a video SR algorithm should fully exploit the temporal relations between the frames to aggregate them with the spatial information.

Select All
1.
W. Shi, J. Caballero, F. Huszár, J. Totz, A. P Aitken, R. Bishop, et al., "Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1874-1883, 2016.
2.
C. Dong, C. C. Loy, K. He and X. Tang, "Image super-resolution using deep convolutional networks", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 38, no. 2, pp. 295-307, 2016.
3.
C. Dong, C. C. Loy, K. He and X. Tang, "Learning a deep convolutional network for image super-resolution", Proceedings of the European Conference on Computer Vision, pp. 184-199, 2014.
4.
J. Kim, J. K. Lee and K. M. Lee, "Accurate image super-resolution using very deep convolutional networks", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1646-1654, 2016.
5.
A. Kappeler, S. Yoo, Q. Dai and A. K Katsaggelos, "Video super-resolution with convolutional neural networks", IEEE Transactions on Computational Imaging, vol. 2, no. 2, pp. 109-122, 2016.
6.
J. Caballero, C. Ledig, A. Aitken, A. Acosta, J. Totz, Z. Wang, et al., "Real-time video super-resolution with spatio-temporal networks and motion compensation", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2848-2857, 2017.
7.
X. Tao, H. Gao, R. Liao, J. Wang and J. Jia, "Detail-revealing deep video super-resolution", Proceedings of the IEEE International Conference on Computer Vision, pp. 4472-4480, 2017.
8.
M. SM Sajjadi, R. Vemulapalli and M. Brown, "Frame-recurrent video super-resolution", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6626-6634, 2018.
9.
A. Lucas, S. Lopez-Tapiad, R. Molinae and A. K. Katsaggelos, "Generative adversarial networks and perceptual losses for video super-resolution", IEEE Transactions on Image Processing, 2019.
10.
D. Tran, L. Bourdev, R. Fergus, L. Torresani and M. Paluri, "Learning spatiotemporal features with 3d convolutional networks", IEEE International Conference on Computer Vision, pp. 4489-4497, 2015.
11.
S. Ji, W. Xu, M. Yang and K. Yu, "3d convolutional neural networks for human action recognition", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 1, pp. 221-231, 2013.
12.
I. Teivas, "Video event classification using 3d convolutional neural networks", M.S. thesis, 2016.
13.
X. Glorot, A. Bordes and Y. Bengio, "Deep sparse rectifier neural networks", Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, pp. 315-323, 2011.
14.
K. He, X. Zhang, S. Ren and J. Sun, "Deep residual learning for image recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016.
15.
X. Glorot and Y. Bengio, "Understanding the difficulty of training deep feedforward neural networks", Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp. 249-256, 2010.
16.
D. P Kingma and J. Ba, "Adam: A method for stochastic optimization", Proceedings of the IEEE International Conference on Learning Representations, 2015.
17.
A. Vedaldi and K. Lenc, "Matconvnet: Convolutional neural networks for matlab", Proceedings of the 23rd ACM International Conference on Multimedia, pp. 689-692, 2015.
18.
"3d convolution implementation", [online] Available: https://github.com/pengsun/MexConv3D.
19.
C. Liu and D. Sun, "On bayesian adaptive video super resolution", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 36, no. 2, pp. 346-360, 2014.
20.
R. Liao, X. Tao, R. Li, Z. Ma and J. Jia, "Video super-resolution via deep draft-ensemble learning", Proceedings of the IEEE International Conference on Computer Vision, pp. 531-539, 2015.
21.
D. Liu, Z. Wang, Y. Fan, X. Liu, Z. Wang, S. Chang, et al., "Robust video super-resolution with learned temporal dynamics", Proceedings of the IEEE International Conference on Computer Vision, pp. 2526-2534, 2017.
22.
S. Y. Kim, J. Lim, T. Na and M. Kim, "3DSRnet: Video Super-resolution using 3D Convolutional Neural Networks", Dec. 2018.

Contact IEEE to Subscribe

References

References is not available for this document.