Seeing Tree Structure from Vibration

Humans recognize object structure from both their appearance and motion; often, motion helps to resolve ambiguities in object structure that arise when we observe object appearance only. There are particular scenarios, however, where neither appearance nor spatial-temporal motion signals are informative: occluding twigs may look connected and have almost identical movements, though they belong to different, possibly disconnected branches. We propose to tackle this problem through spectrum analysis of motion signals, because vibrations of disconnected branches, though visually similar, often have distinctive natural frequencies. We propose a novel formulation of tree structure based on a physics-based link model, and validate its effectiveness by theoretical analysis, numerical simulation, and empirical experiments. With this formulation, we use nonparametric Bayesian inference to reconstruct tree structure from both spectral vibration signals and appearance cues. Our model performs well in recognizing hierarchical tree structure from real-world videos of trees and vessels.

[1]  Donald Geman,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .

[2]  Bunyarit Uyyanonvara,et al.  Blood vessel segmentation methodologies in retinal images - A survey , 2012, Comput. Methods Programs Biomed..

[3]  Ce Liu,et al.  Exploring new representations and applications for motion analysis , 2009 .

[4]  E. Adelson,et al.  Slow and Smooth: A Bayesian theory for the combination of local motion signals in human vision , 1998 .

[5]  Ce Liu,et al.  Towards Longer Long-Range Motion Trajectories , 2012, BMVC.

[6]  Badrinath Roysam,et al.  Novel 4-D Open-Curve Active Contour and curve completion approach for automated tree structure extraction , 2011, CVPR 2011.

[7]  W. Richards,et al.  Perception as Bayesian Inference , 2008 .

[8]  O. Braddick Segmentation versus integration in visual motion processing , 1993, Trends in Neurosciences.

[9]  E. Spelke,et al.  Origins of knowledge. , 1992, Psychological review.

[10]  Sabine Himmel,et al.  Partial Differential Equations For Scientists And Engineers , 2016 .

[11]  D. Knill,et al.  Bayesian sampling in visual perception , 2011, Proceedings of the National Academy of Sciences.

[12]  Norbert Wiener,et al.  Extrapolation, Interpolation, and Smoothing of Stationary Time Series, with Engineering Applications , 1949 .

[13]  Patrick R. Green,et al.  Perception and Motor Control in Birds: An Ecological Approach , 2011 .

[14]  Mei Han,et al.  Efficient hierarchical graph-based video segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Takanobu Nishiura,et al.  Detection for Lombard speech with second-order mel-frequency cepstral coefficient and spectral envelope in beginning of talking-speech , 2013 .

[18]  Frédo Durand,et al.  Visual vibrometry: Estimating material properties from small motions in video , 2015, CVPR.

[19]  David J. Fleet,et al.  A Layered Motion Representation with Occlusion and Compact Spatial Support , 2002, ECCV.

[20]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[21]  Andrew Blake,et al.  Motion Deblurring and Super-resolution from an Image Sequence , 1996, ECCV.

[22]  Trevor Darrell,et al.  Learning Features by Watching Objects Move , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  N. Haritos,et al.  Mechanical stability of trees under dynamic loads. , 2006, American journal of botany.

[24]  Bolei Zhou,et al.  A Phase Discrepancy Analysis of Object Motion , 2010, ACCV.

[25]  Laura A Miller,et al.  Structural dynamics and resonance in plants with nonlinear stiffness. , 2005, Journal of theoretical biology.

[26]  Edward H. Adelson,et al.  Layered representation for motion analysis , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Mark Rudnicki,et al.  A physics-based link model for tree vibrations. , 2012, American journal of botany.

[28]  Marc M. Van Hulle,et al.  A phase-based approach to the estimation of the optical flow field using spatial filtering , 2002, IEEE Trans. Neural Networks.

[29]  Rangasami L. Kashyap,et al.  Building Skeleton Models via 3-D Medial Surface/Axis Thinning Algorithms , 1994, CVGIP Graph. Model. Image Process..

[30]  Luc Van Gool,et al.  Deep Retinal Image Understanding , 2016, MICCAI.

[31]  Samuel J. Gershman,et al.  Discovering hierarchical motion structure , 2016, Vision Research.

[32]  William T. Freeman,et al.  A computational approach for obstruction-free photography , 2015, ACM Trans. Graph..

[33]  Norbert Wiener,et al.  Extrapolation, Interpolation, and Smoothing of Stationary Time Series , 1964 .

[34]  N Haritos,et al.  Branches and damping on trees in winds , 2014 .

[35]  Takanobu Nishiura,et al.  Suppression of clipping noise in observed speech based on spectral compensation with Gaussian mixture models and reference of clean speech , 2013 .

[36]  Pascal Fua,et al.  Automated Reconstruction of Dendritic and Axonal Trees by Global Optimization with Geometric Priors , 2011, Neuroinformatics.

[37]  Gregory A. Dahle,et al.  Tree Biomechanics Literature Review: Dynamics , 2014, Arboriculture & Urban Forestry.

[38]  Michael Rubinstein,et al.  Analysis and visualization of temporal variations in video , 2014 .

[39]  William T. Freeman,et al.  Estimating the Material Properties of Fabric from Video , 2013, 2013 IEEE International Conference on Computer Vision.

[40]  Frédo Durand,et al.  Eulerian video magnification for revealing subtle changes in the world , 2012, ACM Trans. Graph..

[41]  Deqing Sun,et al.  Local Layering for Joint Motion Estimation and Occlusion Detection , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[42]  Tai Sing Lee,et al.  Hierarchical Bayesian inference in the visual cortex. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[43]  David J. Fleet,et al.  Computation of component image velocity from local phase information , 1990, International Journal of Computer Vision.

[44]  A. P. French,et al.  Vibrations and Waves , 1971 .

[45]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[46]  Pascal Fua,et al.  Reconstructing Curvilinear Networks Using Path Classifiers and Integer Programming , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Thomas L. Griffiths,et al.  The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies , 2007, JACM.

[48]  Douglas A. Maguire,et al.  Natural sway frequencies and damping ratios of trees: influence of crown structure , 2005, Trees.

[49]  Michael J. Black,et al.  Layered segmentation and optical flow estimation over time , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.