I. Introduction
When taking photos in front of semi-reflectors like glass windows, photographers prevailingly attempt to capture transmission scenes behind the glass, while reflection contamination often degrades the image quality. Consequently, single-image reflection removal has become an attractive topic in computational photography [4], [5], [15], [22], [47], [64], which aims at removing undesirable glass reflections and recovering the clear transmission scene from a contaminated mixture image. The mixture image (denoted as \mathbf {M}) can be considered as the combination of two components: the transmission scene (denoted as \mathbf {T}_\mathrm{S}) and the reflection layer (denoted as \mathbf {R}_\mathrm{L}) [4], [22]. The major challenge of such an ill-posed layer separation problem is that both transmission scenes and reflection layers are from natural scenes, whose image content can be arbitrary, arousing the difficulty of differentiating the dominant content for mixture images. We call it content ambiguity in this article.