FPGA implementation of real-time skin color detection with mean-based surface flattening

Skin color is widely used in many applications because of its merit in human-machine interactions. However, detecting skin color requires repetitive operations on all pixels in the image, similar to other vision-based applications. Since the per-pixel processing is difficult to perform efficiently in conventional computers, many real-time image processing applications have problems with performance. In this paper, we propose FPGA implementation of a real-time skin color detection system. Among the various skin color detection methods, we chose a parametric skin distribution modeling method based on a Gaussian mixture, due to its acceptable training amount and skin detection performance. In addition, a mean-based surface flattening method was also proposed and implemented to improve the detection performance. The proposed method flattens the surface of objects in the scene by replacing the pixel value with the mean of its similar neighborhoods to remove the color noise. After this flattening process, the pixel values of the analogous adjacent pixels are located within a narrow range and are easily segmented to a different region. To consider the inherent parallelism of local image processing, all these functions are implemented within the FPGA to meet the demands of real-time performance.

[1]  P.C. Arribas Real time hardware vision system applications: optical flow and time to contact detector units , 2004, Proceedings of the Fifth IEEE International Caracas Conference on Devices, Circuits and Systems, 2004..

[2]  César Torres-Huitzil,et al.  A reconfigurable vision system for real-time applications , 2002, 2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings..

[3]  Vladimir Vezhnevets,et al.  A Survey on Pixel-Based Skin Color Detection Techniques , 2003 .

[4]  Azhar A. Sufi,et al.  An FPGA-Based Verification Framework for Real-Time Vision Systems , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[5]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Richard W. Conners,et al.  A development system for creating real-time machine vision hardware using field programmable gate arrays , 1999, Proceedings of the 32nd Annual Hawaii International Conference on Systems Sciences. 1999. HICSS-32. Abstracts and CD-ROM of Full Papers.

[7]  Changjiu Zhou,et al.  Vision based motion planning of humanoid robots , 2004 .

[8]  William J. Dally,et al.  VLSI architecture: past, present, and future , 1999, Proceedings 20th Anniversary Conference on Advanced Research in VLSI.

[9]  Francis K. H. Quek,et al.  Extraction of Hand Gestures with Adaptive Skin Color Models and Its Applications to Meeting Analysis , 2006, Eighth IEEE International Symposium on Multimedia (ISM'06).