1. Introduction
Super-resolution (SR) is an essential visual task with the aim of creating a high-resolution (HR) image based on a low-resolution image (LR) by compensating for the missing details in the low-resolution image. Video super-resolution (VSR) is an important embranchment of research in super-resolution methods. Since the video input is composed of consecutive frames, the correlation between video frames is particularly important for the performance of video super-resolution. In recent years, due to the breakthroughs in convolutional neural networks (CNN), CNN-based VSR methods have performed better than traditional VSR methods in terms of Peak Signal-to-Noise Ratio (PSNR), Structural Similarity index (SSIM) and other evaluation metrics.