Motion fields to predict play evolution in dynamic sport scenes

Videos of multi-player team sports provide a challenging domain for dynamic scene analysis. Player actions and interactions are complex as they are driven by many factors, such as the short-term goals of the individual player, the overall team strategy, the rules of the sport, and the current context of the game. We show that constrained multi-agent events can be analyzed and even predicted from video. Such analysis requires estimating the global movements of all players in the scene at any time, and is needed for modeling and predicting how the multi-agent play evolves over time on the field. To this end, we propose a novel approach to detect the locations of where the play evolution will proceed, e.g. where interesting events will occur, by tracking player positions and movements over time. We start by extracting the ground level sparse movement of players in each time-step, and then generate a dense motion field. Using this field we detect locations where the motion converges, implying positions towards which the play is evolving. We evaluate our approach by analyzing videos of a variety of complex soccer plays.

[1]  Ramesh C. Jain,et al.  Computerized Flow Field Analysis: Oriented Texture Fields , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Yizong Cheng,et al.  Mean Shift, Mode Seeking, and Clustering , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Isabelle Herlin,et al.  Optical Flow and Phase Portrait Methods for Environmental Satellite Image Sequences , 1996, ECCV.

[4]  Claudio S. Pinhanez,et al.  Intelligent Studios Modeling Space and Action to Control TV Cameras , 1997, Appl. Artif. Intell..

[5]  Richard K. Beatson,et al.  Surface interpolation with radial basis functions for medical imaging , 1997, IEEE Transactions on Medical Imaging.

[6]  James F. O'Brien,et al.  Shape transformation using variational implicit functions , 1999, SIGGRAPH Courses.

[7]  Marcelo Gattass,et al.  Automatic Camera Calibration for Image Sequences of a Football Match , 2001, ICAPR.

[8]  Cor J. Veenman,et al.  Resolving Motion Correspondence for Densely Moving Points , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Santiago V. Lombeyda,et al.  Discrete multiscale vector field decomposition , 2003, ACM Trans. Graph..

[10]  Ian D. Reid,et al.  Single View Metrology , 2000, International Journal of Computer Vision.

[11]  Patrick Pérez,et al.  Extraction of Singular Points from Dense Motion Fields: An Analytic Approach , 2003, Journal of Mathematical Imaging and Vision.

[12]  Dar-Shyang Lee,et al.  Effective Gaussian mixture learning for video background subtraction , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Mubarak Shah,et al.  A Multiview Approach to Tracking People in Crowded Scenes Using a Planar Homography Constraint , 2006, ECCV.

[14]  Yasuo Ariki,et al.  Automatic Production System of Soccer Sports Video by Digital Camera Work Based on Situation Recognition , 2006, Eighth IEEE International Symposium on Multimedia (ISM'06).

[15]  Larry S. Davis,et al.  Multi-camera Tracking and Segmentation of Occluded People on Ground Plane Using Search-Guided Particle Filtering , 2006, ECCV.

[16]  Yael Moses,et al.  Homography based multiple camera detection and tracking of people in a dense crowd , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Martin D. Buhmann,et al.  Radial Basis Functions: Theory and Implementations: Preface , 2003 .

[18]  Fanglin Chen,et al.  A Novel Algorithm for Detecting Singular Points from Fingerprint Images , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.