Task-based control of articulated human pose detection for OpenVL

Human pose detection is the foundation for many applications, particularly those using gestures as part of a natural user interface. We introduce a novel task-based control method for human pose detection, encoding specialist knowledge in a descriptive abstraction for application by non-experts, such as developers, artists and students. The abstraction hides the details of a set of algorithms which specialise either in different estimations of pose (e.g. articulated, body part) or under different conditions (e.g. occlusion, clutter). Users describe the conditions of their problem, which is used to select the most suitable algorithm (and automatically set up the parameters). The task-based control is evaluated with images described using the abstraction. Expected outcomes are compared to results and demonstrate that describing the conditions is sufficient to allow the abstraction to produce the required result.

[1]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Giorgio Panin,et al.  Model-based Visual Tracking: The OpenTL Framework , 2011 .

[3]  Alan L. Yuille,et al.  An Approach to Pose-Based Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Rama Chellappa,et al.  Computationally Efficient Regression on a Dependency Graph for Human Pose Estimation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Marinette Revenu,et al.  Borg: A Knowledge-Based System for Automatic Generation of Image Processing Programs , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Paul Hudak,et al.  FVision: A Declarative Language for Visual Tracking , 2001, PADL.

[7]  G. Miller,et al.  Developer-friendly segmentation using OpenVL, a high-level task-based abstraction , 2013, 2013 1st IEEE Workshop on User-Centered Computer Vision (UCCV).

[8]  Gary R. Bradski,et al.  Learning OpenCV - computer vision with the OpenCV library: software that sees , 2008 .

[9]  Francesc Moreno-Noguer,et al.  A Joint Model for 2D and 3D Pose Estimation from a Single Image , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Andrew Zisserman,et al.  Progressive search space reduction for human pose estimation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Takashi Matsuyama,et al.  SIGMA: A Framework for Image Understanding - Integration of Bottom-Up and Top-Down Analysis , 1985, IJCAI.

[12]  Song-Chun Zhu,et al.  Integrating Grammar and Segmentation for Human Pose Estimation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Gregor Miller,et al.  User oriented language model for face detection , 2011, 2011 IEEE Workshop on Person-Oriented Vision.

[14]  Deva Ramanan,et al.  Learning to parse images of articulated bodies , 2006, NIPS.

[15]  Joe L. Mundy,et al.  The development of the Image Understanding Environment , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Tae-Kyun Kim,et al.  Unconstrained Monocular 3D Human Pose Estimation by Action Detection and Cross-Modality Regression Forest , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Sidney S. Fels,et al.  OpenVL: A task-based abstraction for developer-friendly computer vision , 2013, 2013 IEEE Workshop on Applications of Computer Vision (WACV).

[19]  Subhransu Maji,et al.  Action recognition from a distributed representation of pose and appearance , 2011, CVPR 2011.

[20]  Andrew Zisserman,et al.  Human Pose Estimation Using a Joint Pixel-wise and Part-wise Formulation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Konstantinos Konstantinides,et al.  The Khoros software development environment for image and signal processing , 1994, IEEE Trans. Image Process..

[22]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[23]  Mark Everingham,et al.  Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation , 2010, BMVC.

[24]  Vittorio Ferrari,et al.  Human Pose Co-Estimation and Applications , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Bernt Schiele,et al.  Multi-view Pictorial Structures for 3D Human Pose Estimation , 2013, BMVC.

[26]  Yi Yang,et al.  Articulated pose estimation with flexible mixtures-of-parts , 2011, CVPR 2011.

[27]  Luc Van Gool,et al.  Real time head pose estimation with random regression forests , 2011, CVPR 2011.

[28]  Ramesh Raskar,et al.  Computer vision on tap , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.