Spatial clustering of pixels in the mouth area of face images

We propose a method of image segmentation using a Gaussian mixture model of the colour image histogram. The model construction is based on the model validation philosophy of architecture selection (Kittler et al., 2001). In contrast with the k-means clustering approach, the number of segments in the proposed scheme is determined completely automatically. We show that the modelling method can be strengthened by incorporating spatial contextual information. The proposed approach speeds up the modelling process by a factor of three. The advocated methodology is successfully applied to the problem of lip pixel segmentation in face images.

[1]  R. Hathaway Another interpretation of the EM algorithm for mixture distributions , 1986 .

[2]  Alexander H. Waibel,et al.  A real-time face tracker , 1996, Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96.

[3]  Gérard Govaert,et al.  Convergence of an EM-type algorithm for spatial clustering , 1998, Pattern Recognit. Lett..

[4]  Andrew R. Barron,et al.  Minimum complexity density estimation , 1991, IEEE Trans. Inf. Theory.

[5]  H. Akaike A new look at the statistical model identification , 1974 .

[6]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[7]  Juergen Luettin,et al.  Audio-Visual Speech Modeling for Continuous Speech Recognition , 2000, IEEE Trans. Multim..

[8]  Josef Kittler,et al.  Model complexity validation for PDF estimation using Gaussian mixtures , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[9]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[10]  William H. Press,et al.  The Art of Scientific Computing Second Edition , 1998 .

[11]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[12]  J. Rissanen Stochastic complexity and the mdl principle , 1987 .

[13]  Jiri Matas,et al.  XM2VTSDB: The Extended M2VTS Database , 1999 .

[14]  Jiri Matas,et al.  Statistical chromaticity-based lip tracking with B-splines , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  Josef Kittler,et al.  Model Validation for Model Selection , 2001, ICAPR.

[16]  Josef Kittler,et al.  Segmentation of lip pixels for lip tracker initialisation , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[17]  Andrew Blake,et al.  Real-time lip trackers for use in audio-visual speech recognition , 1996 .