I. Introduction
High dynamic range television (HDRTV1) videos have significantly improved the visual quality. Compared to standard dynamic range television (SDRTV) videos, HDRTV videos offer a broader range of brightness levels and a wider color gamut, enabling a more realistic representation of the natural world. With the increasing support for HDR displays in devices like televisions, smartphones, and computers, there is a growing demand for converting existing SDRTV videos to HDRTV in order to enhance video broadcasting services. In this paper, we refer to the task of converting SDRTV videos to HDRTV videos as SDRTV-to-HDRTV, adopting the definition provided by HDRTVNet [1]. It is essential to note that the intent of SDRTV-to-HDRTV differs from Inverse Tone Mapping (ITM), as illustrated in Fig. 2. ITM aims to restore the original luminance of the captured scene, focusing on recovering information in saturated areas. In contrast, SDRTV-to-HDRTV aims to improve the visual quality of video on HDR displays, offering comprehensive enhancements in dynamic range, gamut, and local contrast.
We add a suffix TV after HDR to indicate content in HDR-TV format and standard(HDR10 and Dolby Vision, etc.).
Efficiency comparison between our method and other works. EffiHDR-Base is our SDRTV-to-HDRTV Reconstruction model with the downsampling factor 8. EffiHDR-Enhancer is the cascade of our EffiHDR-Base model and HDRTV Enhancement model.
SDRTV-to-HDRTV differs from Inverse Tone Mapping (ITM) functionally. ITM focuses on restoring the original luminance of the captured scene. To display predicted HDR contents, tone mapping may be needed to compress the dynamic range. Because the dynamic range that HDR displays can present is still limited. In contrast, SDRTV-to-HDRTV aims to directly predict the HDRTV videos for HDR display.