Noise Robust Face Hallucination via Locality-Constrained Representation | IEEE Journals & Magazine | IEEE Xplore

Noise Robust Face Hallucination via Locality-Constrained Representation


Abstract:

Recently, position-patch based approaches have been proposed to replace the probabilistic graph-based or manifold learning-based models for face hallucination. In order t...Show More

Abstract:

Recently, position-patch based approaches have been proposed to replace the probabilistic graph-based or manifold learning-based models for face hallucination. In order to obtain the optimal weights of face hallucination, these approaches represent one image patch through other patches at the same position of training faces by employing least square estimation or sparse coding. However, they cannot provide unbiased approximations or satisfy rational priors, thus the obtained representation is not satisfactory. In this paper, we propose a simpler yet more effective scheme called Locality-constrained Representation (LcR). Compared with Least Square Representation (LSR) and Sparse Representation (SR), our scheme incorporates a locality constraint into the least square inversion problem to maintain locality and sparsity simultaneously. Our scheme is capable of capturing the non-linear manifold structure of image patch samples while exploiting the sparse property of the redundant data representation. Moreover, when the locality constraint is satisfied, face hallucination is robust to noise, a property that is desirable for video surveillance applications. A statistical analysis of the properties of LcR is given together with experimental results on some public face databases and surveillance images to show the superiority of our proposed scheme over state-of-the-art face hallucination approaches.
Published in: IEEE Transactions on Multimedia ( Volume: 16, Issue: 5, August 2014)
Page(s): 1268 - 1281
Date of Publication: 11 March 2014

ISSN Information:

Funding Agency:


I. Introduction

With the rapid development of intelligent surveillance systems, surveillance cameras have been deployed in various areas including security and protection systems. Surveillance images, especially face images, can provide very important clues to criminal investigation. However, the resolution of a video camera is usually not High-Definition (HD) (see Fig. 1(a)), and the low resolution of the interested face in the picture resulted from the long distance between the object and the camera (see Fig. 1(c)) makes it almost impossible to provide useful information (see Fig. 1(b)). Moreover, in real surveillance scenarios, the qualities of the surveillance images are deteriorated by many environmental factors, such as underexposure, optical blurring, and defocusing. Consequently, the face images of interest are too blurred to be identifiable by humans. In order to obtain enough facial feature details for recognition, a new technique called face super-resolution or face hallucination is adopted to generate High-Resolution (HR) face image from Low-Resolution (LR) images. Existing image hallucination methods mainly fall into two categories: reconstruction-based techniques and learning-based techniques. Based on registration and alignment of multiple LR images of the same scene in sub-pixel accuracy, the former are more susceptible to ill- conditioned registration and inappropriate blurring operators [1], while the latter can generate better performance and higher magnification factor—with the help of a set of training examples. We focus on learning-based method in the sequel. Typical frames from surveillance videos. (a) and (c) are the surveillance images from a camera with CIF size ( pixels) and a camera with 720P size ( pixels) respectively; (b) shows two interested faces extracted from (a) and (c).

Contact IEEE to Subscribe

References

References is not available for this document.