论文信息 - Learning Class-Specific Segmentation

Learning Class-Specific Segmentation

This report details the work undertaken this year towards class-specific segmentation. The aim is to take an image known to contain an object of a particular class, and return for each pixel a figure-ground segmentation value. A training corpus consisting of images and their ground-truth segmentation masks is used to learn shape and appearance models. Our shape model consists of local shape patches learned using a new translationallyinvariant clustering algorithm, together with learned adjacency statistics applied to enforce consistency between neighbouring patches. Our appearance model is a database of patches. Given a novel test image, hypotheses of underlying shape and appearance are constructed, and a final beliefpropagation algorithm enforces global consistency.

Jamie Shotton | J. Shotton

[1] John F. Canny,et al. A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] Tony Lindeberg,et al. Shape-adapted smoothing in estimation of 3-D shape cues from affine deformations of local 2-D brightness structure , 1997, Image Vis. Comput..

[3] Pietro Perona,et al. Recognition of planar object classes , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4] Bernt Schiele,et al. Scale-Invariant Object Categorization Using a Scale-Adaptive Mean-Shift Search , 2004, DAGM-Symposium.

[5] Patrick Pérez,et al. Interactive Image Segmentation Using an Adaptive GMMRF Model , 2004, ECCV.

[6] Luc Van Gool,et al. Affine/ Photometric Invariants for Planar Intensity Patterns , 1996, ECCV.

[7] David G. Stork,et al. Pattern Classification , 1973 .

[8] Pietro Perona,et al. Unsupervised Learning of Models for Recognition , 2000, ECCV.

[9] Andrew Blake,et al. Probabilistic Tracking with Exemplars in a Metric Space , 2002, International Journal of Computer Vision.

[10] Cordelia Schmid,et al. A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] Björn Stenger,et al. Learning a Kinematic Prior for Tree-Based Filtering , 2003, BMVC.

[12] Shimon Ullman,et al. Combining Top-Down and Bottom-Up Segmentation , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[13] Luc Van Gool,et al. Edinburgh Research Explorer Simultaneous Object Recognition and Segmentation by Image Exploration , 2022 .

[14] Michel Vidal-Naquet,et al. A Fragment-Based Approach to Object Representation and Classification , 2001, IWVF.

[15] Olga Veksler,et al. Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[16] Cordelia Schmid,et al. 3D object modeling and recognition using affine-invariant patches and multi-view spatial constraints , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[17] Adam Baumberg,et al. Reliable feature matching across widely separated views , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[18] B. Schiele,et al. Interleaved Object Categorization and Segmentation , 2003, BMVC.

[19] Jitendra Malik,et al. Contour and Texture Analysis for Image Segmentation , 2001, International Journal of Computer Vision.

[20] Daniel P. Huttenlocher,et al. Efficient Belief Propagation for Early Vision , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[21] William T. Freeman,et al. Correctness of Belief Propagation in Gaussian Graphical Models of Arbitrary Topology , 1999, Neural Computation.

[22] Luc Van Gool,et al. Wide Baseline Stereo Matching based on Local, Affinely Invariant Regions , 2000, BMVC.

[23] S. Beucher,et al. Watersheds of functions and picture segmentation , 1982, ICASSP.

[24] B. Schiele,et al. Combined Object Categorization and Segmentation With an Implicit Shape Model , 2004 .

[25] Cordelia Schmid,et al. Shape recognition with edge-based features , 2003, BMVC.

[26] Tony Lindeberg,et al. Scale-Space Theory in Computer Vision , 1993, Lecture Notes in Computer Science.

[27] Tony Lindeberg,et al. Shape-Adapted Smoothing in Estimation of 3-D Depth Cues from Affine Distortions of Local 2-D Brightness Structure , 1994, ECCV.

[28] Pietro Perona,et al. A Visual Category Filter for Google Images , 2004, ECCV.

[29] B. Frey,et al. Transformation-Invariant Clustering Using the EM Algorithm , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[30] Antonio Torralba,et al. Context-based vision system for place and object recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[31] Michael Brady,et al. An analysis of the Scale Saliency algorithm , 2003 .

[32] Roberto Cipolla,et al. Likelihood Models For Template Matching using the PDF Projection Theorem , 2004, BMVC.

[33] Andrew Zisserman,et al. Multi-view Matching for Unordered Image Sets, or "How Do I Organize My Holiday Snaps?" , 2002, ECCV.

[34] Brendan J. Frey,et al. Epitomic analysis of appearance and shape , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[35] Michael Brady,et al. Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[36] Michael I. Jordan,et al. An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[37] Shimon Ullman,et al. Learning to Segment , 2004, ECCV.

[38] Christopher G. Harris,et al. A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[39] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[40] Cordelia Schmid,et al. An Affine Invariant Interest Point Detector , 2002, ECCV.

[41] Takeo Kanade,et al. Object Detection Using the Statistics of Parts , 2004, International Journal of Computer Vision.

[42] Wolfgang Förstner,et al. A Framework for Low Level Feature Extraction , 1994, ECCV.

[43] Carlo Tomasi,et al. Alpha estimation in natural images , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[44] Marie-Pierre Jolly,et al. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[45] William T. Freeman,et al. Understanding belief propagation and its generalizations , 2003 .

[46] R. Gregory. Eye and Brain: The Psychology of Seeing , 1966 .

[47] Björn Stenger,et al. Shape context and chamfer matching in cluttered scenes , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[48] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[49] Pietro Perona,et al. A Probabilistic Approach to Object Recognition Using Local Photometry and Global Geometry , 1998, ECCV.

[50] Pietro Perona,et al. A Bayesian approach to unsupervised one-shot learning of object categories , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[51] Judea Pearl,et al. Probabilistic reasoning in intelligent systems , 1988 .

[52] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[53] M. Tribus,et al. Probability theory: the logic of science , 2003 .

[54] Pietro Perona,et al. Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[55] Donald Geman,et al. Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56] William T. Freeman,et al. Learning Low-Level Vision , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[57] Michael I. Jordan,et al. Loopy Belief Propagation for Approximate Inference: An Empirical Study , 1999, UAI.

[58] Andrew Blake,et al. Gaze manipulation for one-to-one teleconferencing , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[59] J. Aloimonos. Shape from texture , 1988, Biological cybernetics.

[60] S. C. Sahasrabudhe,et al. A fresh look at the Hough transform , 1996, Pattern Recognit. Lett..

[61] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[62] Shigeo Abe DrEng. Pattern Classification , 2001, Springer London.

[63] Ivan Laptev,et al. On Space-Time Interest Points , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[64] Pedro F. Felzenszwalb. Object recognition with pictorial structures , 2001 .

[65] Marie-Pierre Jolly,et al. Interactive Graph Cuts for Optimal Boundary and Region Segmentation of Objects in N-D Images , 2001, ICCV.

[66] David G. Lowe,et al. Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[67] D. Scharstein,et al. A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision (SMBV 2001).

[68] W. Freeman,et al. Generalized Belief Propagation , 2000, NIPS.

[69] Dan Roth,et al. Learning a Sparse Representation for Object Detection , 2002, ECCV.

[70] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[71] David J. C. MacKay,et al. Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[72] Cordelia Schmid,et al. Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[73] Andrew Blake,et al. "GrabCut" , 2004, ACM Trans. Graph..

[74] Zhuowen Tu,et al. Image Parsing: Unifying Segmentation, Detection, and Recognition , 2005, International Journal of Computer Vision.

[75] Andrew W. Fitzgibbon,et al. On Affine Invariant Clustering and Automatic Cast Listing in Movies , 2002, ECCV.

[76] Andrew Zisserman,et al. Automated Scene Matching in Movies , 2002, CIVR.

[77] Andrew Zisserman,et al. An Affine Invariant Salient Region Detector , 2004, ECCV.

[78] David Salesin,et al. A Bayesian approach to digital matting , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[79] Shimon Ullman,et al. Class-Specific, Top-Down Segmentation , 2002, ECCV.

[80] Cordelia Schmid,et al. Indexing Based on Scale Invariant Interest Points , 2001, ICCV.

[81] Andrew Blake,et al. Shape from Texture: Estimation, Isotropy and Moments , 1990, Artif. Intell..

[82] Jiri Matas,et al. Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..