Abstract:
This paper presents an analysis of the performance of two different skin chrominance models and of nine different chrominance spaces for the color segmentation and subseq...Show MoreMetadata
Abstract:
This paper presents an analysis of the performance of two different skin chrominance models and of nine different chrominance spaces for the color segmentation and subsequent detection of human faces in two-dimensional static images. For each space, we use the single Gaussian model based on the Mahalanobis metric and a Gaussian mixture density model to segment faces from scene backgrounds. In the case of the mixture density model, the skin chrominance distribution is estimated by use of the expectation-maximisation (EM) algorithm. Feature extraction is performed on the segmented images by use of invariant Fourier-Mellin moments. A multilayer perceptron neural network (NN), with the invariant moments as the input vector, is then applied to distinguish faces from distractors. With the single Gaussian model, normalized color spaces are shown to produce the best segmentation results, and subsequently the highest rate of face detection. The results are comparable to those obtained with the more sophisticated mixture density model. However, the mixture density model improves the segmentation and face detection results significantly for most of the un-normalized color spaces. Ultimately, we show that, for each chrominance space, the detection efficiency depends on the capacity of each model to estimate the skin chrominance distribution and, most importantly, on the discriminability between skin and "non-skin" distributions.
Published in: Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580)
Date of Conference: 28-30 March 2000
Date Added to IEEE Xplore: 06 August 2002
Print ISBN:0-7695-0580-5
References is not available for this document.
Select All
1.
H. Akaike. Information theory and an extension of the maximum likelihood principle. In Second International Symposium on Information Theory Budapest 1973. pp. 267-281.
2.
Q. Chen H. Wu and M. Yachida. Face detection by fuzzy pattern matching. In Proc. of the 5th International Conference on Computer Vision MIT Boston 1995. pp. 591-596.
3.
Y. Dai and Y. Nakano. Extraction of facial images from complex background using color information and sgld matrices. In Proc. of the International Workshop on Automatic Face and Gesture Recognition Zurich 1995. pp. 238-242.
4.
D. A. Forsyth M. Fleck and C. Bregler. Finding naked people. In Proc. 4th Conf. ECCV 1996. pp. 593-602.
5.
H. P. Graf E. Cosatto D. Gibbon M. Kocheisen and E. Petajan. Multi-modal system for locating heads and faces. In Proc. of the Second Intern. Conf. on Automatic Face and Gesture Recognition Killington Vermont 1996. pp. 88-93.
6.
T. S. Jebara and A. Pentland. Pa rameterized structure from motion for 3d adaptive feedback tracking of faces. In Proc.computer Vision and Pattern Recog. 1997. pp. 144-150.
7.
M. J. Jones and J. M. Rehg. Statistical color models with appl ication to skin detection. In Proc.computer Vision and Pattern Recognition 1999. pp. 274-280.
8.
S.-H. Kim N.-K. Kim S. C. Ahn and H.-G. Kim. Object oriented face detection using range and color information. In Proc. of the Third International Conference on Automatic Face and Gesture Recognition Nara Japan 1998. pp. 76-81.
9.
R. Kjeldsen and J. Kender. Finding skin in color images. In Proc. of the 2nd Intern. Conf. on Automatic Face and Gesture Recognition Killington Vermont 1996. pp. 312-317.
10.
C. H. Lee J. S. Kim and K. H. Park. Automatic face location in a complex background using motion and color information. Pattern Recognition 29(11):1877-1889 1996.
11.
Y. Li. Reforming the theory of invariant moments for pattern recognition. Pattern Recognition 25(7):723-730 1992.
12.
S. J. McKenna S. Gong and Y. Raja. Modeling facial color and identity with gaussian mixtures. Pattern Recognition 31(12):1883-1892 1998.
13.
C. Olivier F. Jouzel and A. E. Matouat. Choice of the number of component clusters in mixture models by information criteria. In Proc. of the 12th Conference on Vision Interface Trois-Rivìeres Canada 1999. pp. 74-81.
14.
R. Redner and H. Walker. Mixture densities maximum likelihood and the em algorithm. SIAM Review 26:195-239 1994.
15.
H. Rowley S. Baluja and T. Kanade. Neural network-based face detection. In Proc. CVPR 1996. pp. 203-208.
16.
E. Saber A. M. Tekalp R. Eschbach and K. Knox. Automatic image annotation using adaptive color classification. Graph. Models and Image Proc. 58(2):115-126 1996.
17.
D. Saxe and R. Foulds. Toward robust skin identification in video images. In Proc. of the Second International Conference on Automatic Face and Gesture Recognition Killington Vermont 1996. pp. 379-384.
18.
B. Schiele and A. Waibel. Gaze tracking based on face-color. In P roc. of the International Workshop on Automatic Faceand Gesture-Recognition 1995. pp. 344-349.
19.
K. Sobottka and I. Pitas. Segmentation and tracking of faces in color images. In Proc. of the Second International Conference on Automatic Face and Gesture Recognition Killington Vermont 1996. pp. 236-241.
20.
Q. B. Sun W. M. Huang J. K Wu. Face detection based on color and local symmetry information. In Proc. of the Third International Conference on Automatic Face and Gesture Recognition Nara Japan 1998. pp. 130-135.
21.
J.-C. Terrillon M. David and S. Akamatsu. Automatic detection of human faces in natural scene images by use of a skin color model and of invariant moments. In Proc. of the Third International Conference on Automatic Face and Gesture Recognition Nara Japan 1998. pp. 112-117.
22.
M. Yamada K. Ebihara and J. Ohya. A new robust real-time method for extracting human silhouettes from color images. In Proc. of the Third International Conf. on Automatic Face and Gesture Recognition Nara Japan 1998. pp. 528-533.
23.
J. Yang W. Lu and A. Waibel. Skin-color modeling and adaptation. In Proc. ACCV 1998. pp. 687-694.