I. Introduction
Nowadays, computer vision technology plays an important role in research and industry communities, but high-resolution information is usually not easy to obtain, especially for videos. Hence, video super-resolution is a good solution. However, video super-resolution algorithms are facing two challenges. On the one hand, the accuracy is not so satisfactory. On the other hand, many video applications require high-speed models, even real-time models. Traditional algorithms, such as bicubic interpolation and bilinear interpolation, cannot gain the ideal output, while machine learning-based algorithms can get better results than previous methods, but usually at the cost of time-consuming training and enormous model parameters.