Multi-clue based multi-camera face capturing

This paper presents a novel face capturing approach that integrates several cooperative cameras using multiple clues. Being different from face detection, face capturing means active face gathering by cameras. Our idea is that face pose can be separated into adjacent pose subspaces by cooperative cameras in different directions. In this way, face capturing may be easier than face detection since face detection probably surfers from the pose problem if people do no like to see cameras directly. When different cameras are mounted in different face directions, high performance face detection algorithms can be used in each camera. Consequently, the multi-pose face capturing problem is transformed to face correspondence among cameras after face detection. In the proposed approach, face detection is first carried out in each camera. The detected faces are then corresponded by dynamic programming. For efficient face correspondence, the object features (face position, face appearance) and context features (clothes color, face order) are elaborately combined in the feature extraction and feature matching step. Experiments are carried out in a real scene. The results showed very good performance with 95.6% correct correspondence and 0.9% errors.

[1]  Hideki Hashimoto,et al.  Global Color Model Based Object Matching in the Multi-Camera Environment , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  D. Scharstein,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision (SMBV 2001).

[3]  Pradeep K. Khosla,et al.  Agent-based moving object correspondence using differential discriminative diagnosis , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[4]  Jiansheng Chen,et al.  Real-time object correspondence in stereo camera system , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Mohammed Ghanbari,et al.  Establishing Object Correspondences by Utilizing Surrounding Information , 2006, 2006 International Conference on Image Processing.

[6]  Miguel A. Patricio,et al.  Robust data fusion in a visual sensor multi-agent architecture , 2007, 2007 10th International Conference on Information Fusion.

[7]  Ming Liu,et al.  Robust Multi-View Multi-Camera Face Detection inside Smart Rooms Using Spatio-Temporal Dynamic Programming , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[8]  Azriel Rosenfeld,et al.  Face recognition: A literature survey , 2003, CSUR.

[9]  Emilio Maggio,et al.  Learning Scene Context for Multiple Object Tracking , 2009, IEEE Transactions on Image Processing.