Gravity as a Reference for Estimating a Person’s Height From Video

Estimating the metric height of a person from monocular imagery without additional assumptions is ill-posed. Existing solutions either require manual calibration of ground plane and camera geometry, special cameras, or reference objects of known size. We focus on motion cues and exploit gravity on earth as an omnipresent reference 'object' to translate acceleration, and subsequently height, measured in image-pixels to values in meters. We require videos of motion as input, where gravity is the only external force. This limitation is different to those of existing solutions that recover a person's height and, therefore, our method opens up new application fields. We show theoretically and empirically that a simple motion trajectory analysis suffices to translate from pixel measurements to the person's metric height, reaching a MAE of up to 3.9 cm on jumping motions, and that this works without camera and ground plane calibration.

[1]  Cewu Lu,et al.  RMPE: Regional Multi-person Pose Estimation , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[2]  Cordelia Schmid,et al.  LCR-Net++: Multi-Person 2D and 3D Pose Detection in Natural Images , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Antitza Dantcheva,et al.  Show me your face and I will tell you your height, weight and body mass index , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[4]  Mingjie Ma,et al.  A simplified nonlinear regression method for human height estimation in video surveillance , 2015, EURASIP J. Image Video Process..

[5]  Claire C. Gordon,et al.  2012 Anthropometric Survey of U.S. Army Personnel: Methods and Summary Statistics , 2014 .

[6]  Bin Zhang,et al.  The Measurement of Human Height Based on Coordinate Transformation , 2016, ICIC.

[7]  Jan-Michael Frahm,et al.  Structure-from-Motion Revisited , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[9]  Ian D. Reid,et al.  Single View Metrology , 2000, International Journal of Computer Vision.

[10]  Johan Vester Estimating the Height of an Unknown Object in a 2D Image. , 2012 .

[11]  Arun Ross,et al.  Predictability and correlation in human metrology , 2010, 2010 IEEE International Workshop on Information Forensics and Security.

[12]  Ye-Peng Guan Unsupervised human height estimation from a single image , 2009 .

[13]  Jenny Ljungberg,et al.  Estimation of human height from surveillance camera footage - a reliability study , 2008 .

[14]  David Salesin,et al.  Panoramic video textures , 2005, ACM Trans. Graph..

[15]  Yoshiaki Shirai,et al.  Tracking players and estimation of the 3D position of a ball in soccer games , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[16]  Yongduek Seo,et al.  Physics-based 3D position analysis of a soccer ball from monocular image sequences , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[17]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[18]  Li Xu,et al.  Break Ames room illusion , 2015, ACM Trans. Graph..

[19]  Pascal Fua,et al.  What Face and Body Shapes Can Tell About Height , 2018, ArXiv.

[20]  T Y Shiang A statistical approach to data analysis and 3-D geometric description of the human head and face. , 1999, Proceedings of the National Science Council, Republic of China. Part B, Life sciences.

[21]  Noel E. O'Connor,et al.  3D Estimation and Visualization of Motion in a Multicamera Network for Sports , 2011, 2011 Irish Machine Vision and Image Processing Conference.

[22]  Keith W. Ross,et al.  Estimating heights from photo collections: a data-driven approach , 2014, COSN '14.

[23]  William Emerson,et al.  The Mathematical Principles of Natural Philosophy , 2013 .

[24]  Jian Sun,et al.  SteadyFlow: Spatially Smooth Optical Flow for Video Stabilization , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Zhan Yu,et al.  Lytro camera technology: theory, algorithms, performance analysis , 2013, Electronic Imaging.

[26]  Estimating 3D-trajectories from Monocular Video Sequences , 2015 .

[27]  George Mather,et al.  Image blur as a pictorial depth cue , 1996, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[28]  Yaser Yacoob,et al.  Statistical body height estimation from a single image , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[29]  A. Higashiyama,et al.  Estimation of height for persons in pictures , 1998, Perception & psychophysics.

[30]  C. E. Clauser,et al.  Weight, volume, and center of mass of segments of the human body , 1969 .

[31]  Nikolaos Papanikolopoulos,et al.  Estimating 3D Positions and Velocities of Projectiles from Monocular Views , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.