AN ANNOTATED DATA SET FOR POSE ESTIMATION OF SWIMMERS

In this work we present an annotated data set for two– dimensional pose estimation of swimmers. The data set contains fifteen cycles of swimmers swimming backstroke with more than 1200 annotated video frames. A wide variety of subjects was used to create this data set, ranging from adult to teenage swimmers, both, male and female. For each frame of a cycle, the absolute positions of fourteen points corresponding to human joints were manually labeled. The data set proves to be very challenging with respect to partial occlusions and high amounts of background noise, however, it does not contain any out–of–plane motions that would further complicate the task of full body pose estimation. It thus aims at pose estimation and pose tracking algorithms trying to advance the field of recovering human poses in videos with frequently missing parts and under difficult conditions. We explain in detail the creation of the data set, discuss the difficulties we faced, and finally demonstrate how it is used to create a training data set containing normalized cycles for action–specific pose tracking. Keywords— data set, pose estimation, pose tracking, human motion, swimmers

[1]  Michael J. Black,et al.  Learning and Tracking Cyclic Human Motion , 2000, NIPS.

[2]  Michael J. Black,et al.  HumanEva: Synchronized Video and Motion Capture Dataset for Evaluation of Articulated Human Motion , 2006 .

[3]  Deva Ramanan,et al.  Learning to parse images of articulated bodies , 2006, NIPS.

[4]  Xavier Varona,et al.  Action Spaces for Efficient Bayesian Tracking of Human Motion , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[5]  Philip H. S. Torr,et al.  Randomized trees for human pose detection , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Andrew Zisserman,et al.  Progressive search space reduction for human pose estimation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Yihong Gong,et al.  Discriminative learning of visual words for 3D human pose estimation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Bernt Schiele,et al.  Pictorial structures revisited: People detection and articulated pose estimation , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Rainer Lienhart,et al.  Automatic pose initialization of swimmers in videos , 2010, Electronic Imaging.

[10]  Rainer Lienhart,et al.  A kinematic model for Bayesian tracking of cyclic human motion , 2010, Electronic Imaging.