Fisheye camera modeling for human segmentation refinement in indoor videos

In this paper, we concentrate on refining the results of segmenting human presence from indoors videos acquired by a fisheye camera, using a 3D mathematical model of the camera. The model has been calibrated according to the specific indoor environment that is being monitored. Human segmentation is implemented using a standard established technique. The fisheye camera used for video acquisition is modeled using a spherical element, while the parameters of the camera model are determined only once, using the correspondence of a number of user-defined landmarks, both in real world coordinates and on the acquired video frame. Subsequently, each pixel of the video frame is inversely mapped to the direction of view in the real world and the relevant data are stored in look-up tables for very fast utilization in real-time video processing. The proposed fisheye camera model enables the inference of possible real world positions of a segmented cluster of pixels in the video frame. In this work, we utilize the constructed camera model to achieve a simple geometric reasoning that corrects gaps and mistakes of the human figure segmentation. Initial results are also presented for a small number of video sequences, which prove the efficiency of the proposed method.

[1]  Keiichi Kemmotsu,et al.  Recognizing human behaviors with vision sensors in a network robot system , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[2]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Nigel J. B. McFarlane,et al.  Segmentation and tracking of piglets in images , 1995, Machine Vision and Applications.

[4]  Rita Cucchiara,et al.  Detecting Moving Objects, Ghosts, and Shadows in Video Streams , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[6]  Hongdong Li,et al.  Plane-Based Calibration and Auto-calibration of a Fish-Eye Camera , 2006, ACCV.

[7]  Kostas Delibasis,et al.  Near real-time human silhouette and movement detection in indoor environments using fixed cameras , 2012, PETRA '12.

[8]  Shanq-Jang Ruan,et al.  Illumination-Sensitive Background Modeling Approach for Accurate Moving Object Detection , 2011, IEEE Transactions on Broadcasting.

[9]  Tomás Pajdla,et al.  Structure from motion with wide circular field of view cameras , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Xi Chen,et al.  Activity Analysis, Summarization, and Visualization for Indoor Human Activity Monitoring , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Bart Vanrumste,et al.  How to detect human fall in video? An overview , 2009 .

[12]  Jake K. Aggarwal,et al.  Intrinsic parameter calibration procedure for a (high-distortion) fish-eye lens camera with distortion model and accuracy estimation , 1996, Pattern Recognit..

[13]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Ned Greene,et al.  Environment Mapping and Other Applications of World Projections , 1986, IEEE Computer Graphics and Applications.

[15]  Anup Basu,et al.  Modeling fish-eye lenses , 1993, Proceedings of 1993 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '93).

[16]  Jin Wei,et al.  Fisheye Video Correction , 2012, IEEE Transactions on Visualization and Computer Graphics.