Count on Me: Learning to Count on a Single Image

Individuating and locating repetitive patterns in still images is a fundamental task in image processing, typically achieved by means of correlation strategies. In this paper, we provide a solid solution to this task using a differential geometry approach, operating on Lie algebra, and exploiting a mixture of templates. The proposed method asks the user to locate a few instances of the target patterns (seeds) that become visual templates used to explore the image. We propose an iterative algorithm to locate patches similar to the seeds working in three steps: first, clustering the detected patches to generate templates of different classes, then looking for the affine transformations, living on a Lie algebra that best links the templates and the detected patches, and finally detecting new patches with a convolutional strategy. The process ends when no new patches are found. We will show how our method is able to process heterogeneous unstructured images with multiple visual motifs and extremely crowded scenarios with high precision and recall, outperforming all the state-of-the-art methods.

[1]  Andrew Zisserman,et al.  Learning To Count Objects in Images , 2010, NIPS.

[2]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[3]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[4]  Jan-Michael Frahm,et al.  Detecting Large Repetitive Structures with Salient Boundaries , 2010, ECCV.

[5]  Kyoung Mu Lee,et al.  Visual tracking via geometric particle filtering on the affine group with optimal importance functions , 2009, CVPR.

[6]  Yanxi Liu,et al.  A Lattice-Based MRF Model for Dynamic Near-Regular Texture Tracking , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Visvanathan Ramesh,et al.  Fast Crowd Segmentation Using Shape Indexing , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[8]  Sridha Sridharan,et al.  Least squares congealing for unsupervised alignment of images , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  George Baciu,et al.  Detection of repetitive patterns in near regular texture images , 2011, 2011 IEEE 10th IVMSP Workshop: Perception and Visual Signal Analysis.

[10]  A. Marana,et al.  Estimation of crowd density using image processing , 1997 .

[11]  Haroon Idrees,et al.  Multi-source Multi-scale Counting in Extremely Dense Crowd Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Charless C. Fowlkes,et al.  Do We Need More Training Data or Better Models for Object Detection? , 2012, BMVC.

[13]  Fatih Murat Porikli,et al.  Learning on Manifolds , 2010, SSPR/SPR.

[14]  Francesco Setti,et al.  Semantically-driven automatic creation of training sets for object recognition , 2015, Comput. Vis. Image Underst..

[15]  Andrew Zisserman,et al.  Geometric Grouping of Repeated Elements within Images , 1999, Shape, Contour and Grouping in Computer Vision.

[16]  Sridha Sridharan,et al.  Crowd Counting Using Multiple Local Features , 2009, 2009 Digital Image Computing: Techniques and Applications.

[17]  Tommy W. S. Chow,et al.  A neural-based crowd estimation by hybrid global learning algorithm , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[18]  Alexei A. Efros,et al.  Discovering Texture Regularity as a Higher-Order Correspondence Problem , 2006, ECCV.

[19]  Nuno Vasconcelos,et al.  Privacy preserving crowd monitoring: Counting people without people models or tracking , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[22]  Hai Tao,et al.  A Viewpoint Invariant Approach for Crowd Counting , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[23]  Pushmeet Kohli,et al.  On Detection of Multiple Object Instances Using Hough Transforms , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Ramakant Nevatia,et al.  Segmentation of multiple, partially occluded objects by grouping, merging, assigning part detection responses , 2008, CVPR.

[25]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[26]  Song-Chun Zhu,et al.  What are Textons? , 2005 .

[27]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[28]  Xiaoming Liu,et al.  Automatic facial landmark labeling with minimal supervision , 2009, CVPR.

[29]  Roland Siegwart,et al.  Exploiting Repetitive Object Patterns for Model Compression and Completion , 2010, ECCV.

[30]  Andrew Zisserman,et al.  Interactive Object Counting , 2014, ECCV.

[31]  Bernt Schiele,et al.  Robust Object Detection with Interleaved Categorization and Segmentation , 2008, International Journal of Computer Vision.

[32]  George Baciu,et al.  Higher level segmentation: Detecting and grouping of invariant repetitive patterns , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Yanxi Liu,et al.  Deformed Lattice Detection in Real-World Images Using Mean-Shift Belief Propagation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Jitendra Malik,et al.  Detecting, localizing and grouping repeated scene elements from an image , 1996, ECCV.

[35]  F. Park,et al.  Visual tracking via particle filtering on the affine group , 2008, ICIA 2008.

[36]  George Baciu,et al.  Detecting, Grouping, and Structure Inference for Invariant Repetitive Patterns in Images , 2013, IEEE Transactions on Image Processing.

[37]  Nicu Sebe,et al.  The S-HOCK dataset: Analyzing crowds at the stadium , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Erik G. Learned-Miller,et al.  Data driven image models through continuous joint alignment , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Stefano Soatto,et al.  A Complexity-Distortion Approach to Joint Pattern Alignment , 2006, NIPS.

[40]  B. Hall Lie Groups, Lie Algebras, and Representations: An Elementary Introduction , 2004 .