I. Introduction
Super-resolution (SR) [1] refers to the process of generating high-resolution (HR) images or videos from low-resolution (LR) inputs and has been an important topic of research for several decades. Early methods relied on spatial interpolation techniques [2], but their inability to generate high-frequency details soon motivated the introduction of more complex approaches, such as statistical [3], prediction-based [4], patch-based [5] or edge-based methods [6]. The most significant advances were however delivered by emerging deep-learning (DL) techniques [7] and in particular by convolutional neural networks (CNNs).