Go with the Flow: Perception-refined Physics Simulation

For many of the physical phenomena around us, we have developed sophisticated models explaining their behavior. Nevertheless, inferring specifics from visual observations is challenging due to the high number of causally underlying physical parameters -- including material properties and external forces. This paper addresses the problem of inferring such latent physical properties from observations. Our solution is an iterative refinement procedure with simulation at its core. The algorithm gradually updates the physical model parameters by running a simulation of the observed phenomenon and comparing the current simulation to a real-world observation. The physical similarity is computed using an embedding function that maps physically similar examples to nearby points. As a tangible example, we concentrate on flags curling in the wind -- a seemingly simple phenomenon but physically highly involved. Based on its underlying physical model and visual manifestation, we propose an instantiation of the embedding function. For this mapping, modeled as a deep network, we introduce a spectral decomposition layer that decomposes a video volume into its temporal spectral power and corresponding frequencies. In experiments, we demonstrate our method's ability to recover intrinsic and extrinsic physical parameters from both simulated and real-world video.

[1]  Jennifer L Cardona,et al.  Seeing the Wind: Visual Wind Speed Prediction with a Coupled Convolutional and Recurrent Neural Network , 2019, NeurIPS.

[2]  Arnold W. M. Smeulders,et al.  Repetition Estimation , 2018, International Journal of Computer Vision.

[3]  Ke Wang,et al.  Physics-Inspired Garment Recovery from a Single-View Image , 2018, ACM Trans. Graph..

[4]  Jiajun Wu,et al.  Seeing Tree Structure from Vibration , 2018, ECCV.

[5]  Max Welling,et al.  BOCK : Bayesian Optimization with Cylindrical Kernels , 2018, ICML.

[6]  Hans-Peter Seidel,et al.  LIME: Live Intrinsic Material Estimation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Tony Lindeberg,et al.  Dense scale selection over space, time and space-time , 2017, SIAM J. Imaging Sci..

[8]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[9]  Ming C. Lin,et al.  Learning-Based Cloth Material Recovery from Video , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[10]  Frédo Durand,et al.  Visual vibrometry: Estimating material properties from small motions in video , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Lucas Beyer,et al.  In Defense of the Triplet Loss for Person Re-Identification , 2017, ArXiv.

[12]  Serge J. Belongie,et al.  Conditional Similarity Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Mario Fritz,et al.  To Fall Or Not To Fall: A Visual Approach to Physical Stability Prediction , 2016, ArXiv.

[14]  Ali Farhadi,et al.  "What Happens If..." Learning to Predict the Effect of Forces in Images , 2016, ECCV.

[15]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Ali Farhadi,et al.  Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Jiajun Wu,et al.  Physics 101: Learning Physical Object Properties from Unlabeled Videos , 2016, BMVC.

[18]  Jiajun Wu,et al.  Galileo: Perceiving Physical Object Properties by Integrating a Physics Engine with Deep Learning , 2015, NIPS.

[19]  Liang Lin,et al.  Deep feature learning with relative distance comparison for person re-identification , 2015, Pattern Recognit..

[20]  Justin G. Chen,et al.  Visual vibrometry: Estimating material properties from small motions in video , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Laurens van der Maaten,et al.  Accelerating t-SNE using tree-based algorithms , 2014, J. Mach. Learn. Res..

[24]  William T. Freeman,et al.  Estimating the Material Properties of Fabric from Video , 2013, 2013 IEEE International Conference on Computer Vision.

[25]  F. Tian Role of mass on the stability of flag/flags in uniform flow , 2013 .

[26]  James F. O'Brien,et al.  Adaptive anisotropic remeshing for cloth simulation , 2012, ACM Trans. Graph..

[27]  Jasper Snoek,et al.  Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[28]  Huamin Wang,et al.  Data-driven elastic models for cloth: modeling and measurement , 2011, ACM Trans. Graph..

[29]  Jun Zhang,et al.  Flapping and Bending Bodies Interacting with Fluid Flows , 2011 .

[30]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[31]  Hidetomo Sakaino Fluid motion estimation method based on physical properties of waves , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  C. Eloy,et al.  Aeroelastic instability of cantilevered flexible plates in uniform flow , 2008, Journal of Fluid Mechanics.

[33]  D. Forsyth,et al.  Capturing and animating occluded cloth , 2007, SIGGRAPH '07.

[34]  Yann LeCun,et al.  Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[35]  Mubarak Shah,et al.  Water video analysis , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[36]  M. Hegarty Mechanical reasoning by mental simulation , 2004, Trends in Cognitive Sciences.

[37]  Fabrice Labeau,et al.  Discrete Time Signal Processing , 2004 .

[38]  Meng Sun,et al.  Video input driven animation (VIDA) , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[39]  Ronald Fedkiw,et al.  Simulation of clothing with folds and wrinkles , 2003, SCA '03.

[40]  Jessica K. Hodgins,et al.  Estimating cloth simulation parameters from video , 2003, SCA '03.

[41]  Andrew P. Witkin,et al.  Untangling cloth , 2003, ACM Trans. Graph..

[42]  Andrew P. Witkin,et al.  Large steps in cloth simulation , 1998, SIGGRAPH.

[43]  David A. Forsyth,et al.  Shading primitives: finding folds and shallow grooves , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[44]  Xavier Provot,et al.  Deformation Constraints in a Mass-Spring Model to Describe Rigid Cloth Behavior , 1995 .

[45]  Yann LeCun,et al.  Signature Verification Using A "Siamese" Time Delay Neural Network , 1993, Int. J. Pattern Recognit. Artif. Intell..

[46]  Jakub Wejchert,et al.  Animation aerodynamics , 1991, SIGGRAPH.

[47]  J. Achenbach THE LINEARIZED THEORY OF ELASTICITY , 1973 .

[48]  Sadatoshi Taneda,et al.  Waving Motions of Flags , 1968 .

[49]  G. Batchelor,et al.  An Introduction to Fluid Dynamics , 1968 .

[50]  W. H. F. Barnes The Nature of Explanation , 1944, Nature.