Learning spatiotemporal T-junctions for occlusion detection

The goal of motion segmentation and layer extraction can be viewed as the detection and localization of occluding surfaces. A feature that has been shown to be a particularly strong indicator of occlusion, in both computer vision and neuroscience, is the T-junction; however, little progress has been made in T-junction detection. One reason for this is the difficulty in distinguishing false T-junctions (i.e. those not on an occluding edge) and real T-junctions in cluttered images. In addition to this, their photometric profile alone is not enough for reliable detection. This paper overcomes the first problem by searching for T-junctions not in space, but in space-time. This removes many false T-junctions and creates a simpler image structure to explore. The second problem is mitigated by learning the appearance of T-junctions in these spatiotemporal images. An RVM T-junction classifier is learnt from hand-labelled data using SIFT to capture its redundancy. This detector is then demonstrated in a novel occlusion detector that fuses Canny edges and T-junctions in the spatiotemporal domain to detect occluding edges in the spatial domain.

[1]  Edward H. Adelson,et al.  Spatio-temporal segmentation of video data , 1994, Electronic Imaging.

[2]  Song-Chun Zhu,et al.  Prior Learning and Gibbs Reaction-Diffusion , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Michael J. Black,et al.  Robust dynamic motion estimation over time , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[5]  Eero P. Simoncelli,et al.  Steerable wedge filters for local orientation analysis , 1996, IEEE Trans. Image Process..

[6]  Robert C. Bolles,et al.  Epipolar-plane image analysis: An approach to determining structure from motion , 1987, International Journal of Computer Vision.

[7]  Michal Irani,et al.  Multi-frame optical flow estimation using subspace constraints , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[8]  Michael E. Tipping The Relevance Vector Machine , 1999, NIPS.

[9]  Ivan Laptev,et al.  Local spatio-temporal image features for motion interpretation , 2004 .

[10]  David J. Fleet,et al.  Probabilistic detection and tracking of motion discontinuities , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[11]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[12]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[13]  Michal Irani,et al.  Computing occluding and transparent motions , 1994, International Journal of Computer Vision.

[14]  Chuan Yi Tang,et al.  A 2.|E|-Bit Distributed Algorithm for the Directed Euler Trail Problem , 1993, Inf. Process. Lett..

[15]  Stefano Soatto,et al.  On exploiting occlusions in multiple-view geometry , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[16]  D J Field,et al.  Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[17]  Robert C. Bolles,et al.  Generalizing Epipolar-Plane Image Analysis on the spatiotemporal surface , 2004, International Journal of Computer Vision.

[18]  P. Anandan,et al.  A computational framework and an algorithm for the measurement of visual motion , 1987, International Journal of Computer Vision.

[19]  Sourabh A. Niyogi,et al.  Detecting kinetic occlusion , 1995, Proceedings of IEEE International Conference on Computer Vision.

[20]  Li Du,et al.  Edge Detection at Junctions , 1989, Alvey Vision Conference.

[21]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Jon Louis Bentley,et al.  Data Structures for Range Searching , 1979, CSUR.

[23]  David J. Beymer,et al.  Finding junctions using the image gradient , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[24]  Edward H. Adelson,et al.  Analyzing and recognizing walking figures in XYT , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Robert C. Bolles,et al.  3DPO: A Three- Dimensional Part Orientation System , 1986, IJCAI.

[26]  Peter Eisert,et al.  Extension of epipolar image analysis to circular camera movements , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[27]  Roberto Cipolla,et al.  The Applications of Uncalibrated Occlusion Junctions , 1999, BMVC.

[28]  E. Adelson,et al.  Analyzing gait with spatiotemporal surfaces , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[29]  William Bialek,et al.  Statistics of Natural Images: Scaling in the Woods , 1993, NIPS.

[30]  Pietro Perona Steerable-scalable kernels for edge detection and junction analysis , 1992, Image Vis. Comput..

[31]  Sourabh A. Niyogi,et al.  Spatiotemporal junction analysis for motion boundary detection , 1995, Proceedings., International Conference on Image Processing.

[32]  Richard Szeliski,et al.  Extracting layers and analyzing their specular properties using epipolar-plane-image analysis , 2005, Comput. Vis. Image Underst..

[33]  Laxmi Parida,et al.  Junctions: Detection, Classification, and Reconstruction , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  G. J. Burton,et al.  Color and spatial structure in natural scenes. , 1987, Applied optics.

[35]  Hans-Hellmut Nagel,et al.  Displacement vectors derived from second-order intensity variations in image sequences , 1983, Comput. Vis. Graph. Image Process..