Deformable 3D Convolution for Video Super-Resolution | IEEE Journals & Magazine | IEEE Xplore

Deformable 3D Convolution for Video Super-Resolution


Abstract:

The spatio-temporal information among video sequences is significant for video super-resolution (SR). However, the spatio-temporal information cannot be fully used by exi...Show More

Abstract:

The spatio-temporal information among video sequences is significant for video super-resolution (SR). However, the spatio-temporal information cannot be fully used by existing video SR methods since spatial feature extraction and temporal motion compensation are usually performed sequentially. In this paper, we propose a deformable 3D convolution network (D3Dnet) to incorporate spatio-temporal information from both spatial and temporal dimensions for video SR. Specifically, we introduce deformable 3D convolution (D3D) to integrate deformable convolution with 3D convolution, obtaining both superior spatio-temporal modeling capability and motion-aware modeling flexibility. Extensive experiments have demonstrated the effectiveness of D3D in exploiting spatio-temporal information. Comparative results show that our network achieves state-of-the-art SR performance. Code is available at: https://github.com/XinyiYing/D3Dnet.
Published in: IEEE Signal Processing Letters ( Volume: 27)
Page(s): 1500 - 1504
Date of Publication: 31 July 2020

ISSN Information:

Funding Agency:


I. Introduction

Video super-resolution (SR) aims at recovering high-resolution (HR) images from low-resolution (LR) video sequences. This technique has been widely employed in many applications such as video surveillance [1] and high-definition devices [2], [3]. Since video frames provide additional information in temporal dimension, it is important to fully use the spatio-temporal dependency to enhance the performance of video SR.

Contact IEEE to Subscribe

References

References is not available for this document.