Non-parametric human segmentation using support vector machine

The human category is among the most commonly captured and therefore important subject in digital cameras. As a result, human segmentation also plays an important role in the camera system. This paper presents a framework of non-parametric learning that achieves human segmentation using support vector machine (SVM). In the training stage, the training human windows are warped to a normalized size and oversegmented into regular superpixels. A two-dimensional (2D) array of SVM models is then trained by extracting various edge and color features from each superpixel. In this process of SVM training, the 2D array of SVM models automatically and effectively learns the characteristics of the human shape. Given an affinely warped human window for testing, the proposed method calculates superpixels' initial scores of belonging to a human (foreground) using the trained SVM models. Finally, the initial prediction scores are effectively propagated by optimizing a well-defined energy function using an estimated confidence map. In experiments on a publicly available challenging dataset, the proposed framework rapidly yields excellent results in human segmentation both qualitatively and quantitatively1.

[1]  Xiaokang Yang,et al.  Efficient Spatio-temporal Segmentation for Extracting Moving Objects in Video Sequences , 2007, IEEE Transactions on Consumer Electronics.

[2]  Kang-Ping Liu,et al.  Exploring interaction modalities for a selfie drone , 2015, SIGGRAPH Asia Posters.

[3]  Sergio Escalera,et al.  Learning to Segment Humans by Stacking Their Body Parts , 2014, ECCV Workshops.

[4]  Vibhav Vineet,et al.  Conditional Random Fields as Recurrent Neural Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[5]  Lena Gorelick,et al.  GrabCut in One Cut , 2013, 2013 IEEE International Conference on Computer Vision.

[6]  Kwanghoon Sohn,et al.  Real-time Human Detection based on Personness Estimation , 2015, BMVC.

[7]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[8]  Andrew Zisserman,et al.  Symbiotic Segmentation and Part Localization for Fine-Grained Categorization , 2013, 2013 IEEE International Conference on Computer Vision.

[9]  E. Rückert Detecting Pedestrians by Learning Shapelet Features , 2007 .

[10]  N. A. Abdul Rahim,et al.  RGB-H-CbCr skin colour model for human face detection , 2006 .

[11]  Carsten Rother,et al.  DenseCut: Densely Connected CRFs for Realtime GrabCut , 2015, Comput. Graph. Forum.

[12]  Shi-Min Hu,et al.  PoseShop: Human Image Database Construction and Personalized Content Synthesis , 2013, IEEE Transactions on Visualization and Computer Graphics.

[13]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Bernt Schiele,et al.  Ten Years of Pedestrian Detection, What Have We Learned? , 2014, ECCV Workshops.

[15]  Jian Sun,et al.  BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[16]  Cheng-Jian Lin,et al.  Image backlight compensation using neuro-fuzzy networks with immune particle swarm optimization , 2009, Expert Syst. Appl..

[17]  Kwanghoon Sohn,et al.  Non-parametric human segmentation using support vector machine , 2016, 2016 IEEE International Conference on Consumer Electronics (ICCE).

[18]  Ramakant Nevatia,et al.  Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[19]  Sang Uk Lee,et al.  Region-based backlight compensation algorithm for images and videos , 2010, 2010 IEEE International Conference on Image Processing.

[20]  Marie-Pierre Jolly,et al.  Interactive Graph Cuts for Optimal Boundary and Region Segmentation of Objects in N-D Images , 2001, ICCV.

[21]  Shu Wang,et al.  A new edge feature for head-shoulder detection , 2013, 2013 IEEE International Conference on Image Processing.

[22]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[23]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[24]  Thomas Deselaers,et al.  Measuring the Objectness of Image Windows , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[26]  Mario Cifrek,et al.  A brief introduction to OpenCV , 2012, 2012 Proceedings of the 35th International Convention MIPRO.

[27]  Subbarao Kambhampati,et al.  What We Instagram: A First Analysis of Instagram Photo Content and User Types , 2014, ICWSM.

[28]  Vibhav Vineet,et al.  Human Instance Segmentation from Video using Detector-based Conditional Random Fields , 2011, BMVC.

[29]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[30]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Alexei A. Efros,et al.  Photo clip art , 2007, ACM Trans. Graph..

[32]  Richard Szeliski,et al.  Efficient preconditioning of laplacian matrices for computer graphics , 2013, ACM Trans. Graph..

[33]  Seungryong Kim,et al.  Robust Stereo Matching Using Probabilistic Laplacian Surface Propagation , 2014, ACCV.

[34]  C. Lawrence Zitnick,et al.  Structured Forests for Fast Edge Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[35]  Holly Peek The selfie in the digital age: from social media to sexting , 2014 .

[36]  T. Haruk,et al.  Video Camera System Using Fuzzy Logic , 1992, IEEE 1992 International Conference on Consumer Electronics Digest of Technical Papers.

[37]  Minh N. Do,et al.  Fast Global Image Smoothing Based on Weighted Least Squares , 2014, IEEE Transactions on Image Processing.