I. Introduction
Super-resolution (SR) aims at generating high-resolution (HR) images from their low-resolution (LR) counterparts. As a typical low-level computer vision problem, SR has been investigated for decades [1]–[3]. Recently, converting LR videos into HR ones, namely video SR, is under great demand due to the prevalence of high-definition displays. Compared to a single image, adjacent frames in a video clip provide additional information for SR. Therefore, exploiting temporal dependency between consecutive frames plays an important role in video SR.