mmMesh: towards 3D real-time dynamic human mesh construction using millimeter-wave

In this paper, we present mmMesh, the first real-time 3D human mesh estimation system using commercial portable millimeter-wave devices. mmMesh is built upon a novel deep learning framework that can dynamically locate the moving subject and capture his/her body shape and pose by analyzing the 3D point cloud generated from the mmWave signals that bounce off the human body. The proposed deep learning framework addresses a series of challenges. First, it encodes a 3D human body model, which enables mmMesh to estimate complex and realistic-looking 3D human meshes from sparse point clouds. Second, it can accurately align the 3D points with their corresponding body segments despite the influence of ambient points as well as the error-prone nature and the multi-path effect of the RF signals. Third, the proposed model can infer missing body parts from the information of the previous frames. Our evaluation results on a commercial mmWave sensing testbed show that our mmMesh system can accurately localize the vertices on the human mesh with an average error of 2.47 cm. The superior experimental results demonstrate the effectiveness of our proposed human mesh construction system.

[1]  H. M. Finn,et al.  Adaptive detection mode with threshold control as a function of spatially sampled clutter level estimates , 1968 .

[2]  Marcus A. Magnor,et al.  Video Based Reconstruction of 3D People Models , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  Masaki Saito,et al.  Normal Estimation for Accurate 3D Mesh Reconstruction with Point Cloud Model Incorporating Spatial Structure , 2019, CVPR Workshops.

[4]  Prasant Mohapatra,et al.  mmSense: Multi-Person Detection and Identification via mmWave Sensing , 2019, mmNets@MobiCom.

[5]  Zhengxiong Li,et al.  VocalPrint: exploring a resilient and secure voice authentication via mmWave biometric interrogation , 2020, SenSys.

[6]  Meng Jin,et al.  mmVib: micrometer-level vibration measurement with mmwave radar , 2020, MobiCom.

[7]  Christian Theobalt,et al.  MonoPerfCap , 2017, ACM Trans. Graph..

[8]  Liu Wu,et al.  Human Mesh Recovery From Monocular Images via a Skeleton-Disentangled Representation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[9]  Sen Wang,et al.  3D Human Shape Reconstruction from a Polarization Image , 2020, ECCV.

[10]  Dina Katabi,et al.  Through-Wall Human Mesh Recovery Using Radio Signals , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[11]  Frédo Durand,et al.  Capturing the human figure through a wall , 2015, ACM Trans. Graph..

[12]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Jin Zhang,et al.  Capturing Human Pose Using mmWave Radar , 2020, 2020 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops).

[14]  Yohan Dupuis,et al.  A Study of Vicon System Positioning Performance , 2017, Sensors.

[15]  Jianfei Cai,et al.  Skeleton-Aware 3D Human Shape Reconstruction From Point Clouds , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[16]  Karly A. Smith,et al.  Gesture Recognition Using mm-Wave Sensor for Human-Car Interface , 2018, IEEE Sensors Letters.

[17]  Fei Wang,et al.  Person-in-WiFi: Fine-Grained Person Perception Using WiFi , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[18]  Chenglin Miao,et al.  Towards 3D human pose construction using wifi , 2020, MobiCom.

[19]  Peter V. Gehler,et al.  Unite the People: Closing the Loop Between 3D and 2D Human Representations , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Siyang Cao,et al.  Real-Time Human Motion Behavior Detection via CNN Using mmWave Radar , 2019, IEEE Sensors Letters.

[21]  Tao Yu,et al.  BodyFusion: Real-Time Capture of Human Motion and Surface Geometry Using a Single Depth Camera , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[22]  Wei Wang,et al.  Real-time Arm Gesture Recognition in Smart Home Scenarios via Millimeter Wave Sensing , 2020, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[23]  Xiaowei Zhou,et al.  Learning to Estimate 3D Human Pose and Shape from a Single Color Image , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Alexander J. Smola,et al.  Deep Sets , 2017, 1703.06114.

[25]  Gerard Pons-Moll,et al.  Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Christian Theobalt,et al.  LiveCap , 2018, ACM Trans. Graph..

[27]  Yi Zhou,et al.  On the Continuity of Rotation Representations in Neural Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Yong Wang,et al.  Remote Monitoring of Human Vital Signs Based on 77-GHz mm-Wave FMCW Radar , 2020, Sensors.

[29]  Parth H. Pathak,et al.  Human tracking and activity monitoring using 60 GHz mmWave: poster abstract , 2016, IPSN 2016.

[30]  Peter V. Gehler,et al.  Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image , 2016, ECCV.

[31]  Feng Jin,et al.  mm-Pose: Real-Time Human Skeletal Posture Estimation Using mmWave Radars and CNNs , 2020, IEEE Sensors Journal.

[32]  Jitendra Malik,et al.  End-to-End Recovery of Human Shape and Pose , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[33]  Jian Liu,et al.  Demo: Hands-Free Human Activity Recognition Using Millimeter-Wave Sensors , 2019, 2019 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN).

[34]  Fei Wang,et al.  Can WiFi Estimate Person Pose? , 2019, ArXiv.

[35]  Guofeng Zhang,et al.  Sequential 3D Human Pose and Shape Estimation From Point Clouds , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Xiaonan Guo,et al.  MU-ID: Multi-user Identification Through Gaits Using Millimeter Wave Radios , 2020, IEEE INFOCOM 2020 - IEEE Conference on Computer Communications.

[37]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[38]  Siyang Cao,et al.  NLP based Skeletal Pose Estimation using mmWave Radar Point-Cloud: A Simulation Approach , 2020, 2020 IEEE Radar Conference (RadarConf20).

[39]  George Shaker,et al.  Remote Monitoring of Human Vital Signs Using mm-Wave FMCW Radar , 2019, IEEE Access.

[40]  Zhengxiong Li,et al.  WaveEar: Exploring a mmWave-based Noise-resistant Speech Sensing for Voice-User Interface , 2019, MobiSys.

[41]  Qingmin Liao,et al.  Estimating Human Shape Under Clothing from Single Frontal View Point Cloud of a Dressed Human , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[42]  Jitendra Malik,et al.  Predicting 3D Human Dynamics From Video , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[43]  Michael J. Black,et al.  SMPL: A Skinned Multi-Person Linear Model , 2023 .

[44]  Jitendra Malik,et al.  Learning 3D Human Dynamics From Video , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Suraj Jog,et al.  Through Fog High-Resolution Imaging Using Millimeter Wave Radar , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Marcus A. Magnor,et al.  Detailed Human Avatars from Monocular Video , 2018, 2018 International Conference on 3D Vision (3DV).

[47]  Antonio Torralba,et al.  Through-Wall Human Pose Estimation Using Radio Signals , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.