DensePose From WiFi

Advances in computer vision and machine learning techniques have led to significant development in 2D and 3D human pose estimation from RGB cameras, LiDAR, and radars. However, human pose estimation from images is adversely affected by occlusion and lighting, which are common in many scenarios of interest. Radar and LiDAR technologies, on the other hand, need specialized hardware that is expensive and power-intensive. Furthermore, placing these sensors in non-public areas raises significant privacy concerns. To address these limitations, recent research has explored the use of WiFi antennas (1D sensors) for body segmentation and key-point body detection. This paper further expands on the use of the WiFi signal in combination with deep learning architectures, commonly used in computer vision, to estimate dense human pose correspondence. We developed a deep neural network that maps the phase and amplitude of WiFi signals to UV coordinates within 24 human regions. The results of the study reveal that our model can estimate the dense pose of multiple subjects, with comparable performance to image-based approaches, by utilizing WiFi signals as the only input. This paves the way for low-cost, broadly accessible, and privacy-preserving algorithms for human sensing.

[1]  Zhenan Sun,et al.  Learning 3D Human Shape and Pose From Dense Body Parts , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Andrea Vedaldi,et al.  DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[3]  Chenglin Miao,et al.  mmMesh: towards 3D real-time dynamic human mesh construction using millimeter-wave , 2021, MobiSys.

[4]  Michael J. Black,et al.  Monocular, One-stage, Regression of Multiple 3D People , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[5]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Andrea Vedaldi,et al.  Continuous Surface Embeddings , 2020, NeurIPS.

[7]  Fu Xiao,et al.  Deep Spatial–Temporal Model Based Cross-Scene Action Recognition Using Commodity WiFi , 2020, IEEE Internet of Things Journal.

[8]  Hassan Aghaeinia,et al.  Wi2Vi: Generating Video Frames From WiFi CSI Samples , 2019, IEEE Sensors Journal.

[9]  Michael J. Black,et al.  VIBE: Video Inference for Human Body Pose and Shape Estimation , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Raffaello D'Andrea,et al.  Ultra-Wideband Angle of Arrival Estimation Based on Angle-Dependent Antenna Transfer Function , 2019, Sensors.

[11]  Song-Chun Zhu,et al.  DenseRaC: Joint 3D Pose and Shape Estimation by Dense Render-and-Compare , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[12]  Hao Li,et al.  PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[13]  Fei Wang,et al.  Temporal Unet: Sample Level Human Action Recognition using WiFi , 2019, ArXiv.

[14]  Marcus A. Magnor,et al.  Tex2Shape: Detailed Full Human Body Geometry From a Single Image , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[15]  Fei Wang,et al.  Person-in-WiFi: Fine-Grained Person Perception Using WiFi , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[16]  Zheng Fang,et al.  DenseBody: Directly Regressing Dense 3D Human Pose and Shape From a Single Color Image , 2019, ArXiv.

[17]  Kamin Whitehouse,et al.  Multipath Triangulation: Decimeter-level WiFi Localization and Orientation with a Single Unaided Receiver , 2018, MobiSys.

[18]  Antonio Torralba,et al.  Through-Wall Human Pose Estimation Using Radio Signals , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[19]  Yichen Wei,et al.  Simple Baselines for Human Pose Estimation and Tracking , 2018, ECCV.

[20]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[21]  Iasonas Kokkinos,et al.  DensePose: Dense Human Pose Estimation in the Wild , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22]  Bernt Schiele,et al.  PoseTrack: A Benchmark for Human Pose Estimation and Tracking , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[23]  Lujia Wang,et al.  Characterization of a RS-LiDAR for 3D Perception , 2017, 2018 IEEE 8th Annual International Conference on CYBER Technology in Automation, Control, and Intelligent Systems (CYBER).

[24]  Yun Lei,et al.  For Better CSI Fingerprinting Based Localization: A Novel Phase Sanitization Method and a Distance Metric , 2017, 2017 IEEE 85th Vehicular Technology Conference (VTC Spring).

[25]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Ahmed El Khadimi,et al.  Survey on indoor localization system and recent advances of WIFI fingerprinting technique , 2016, 2016 5th International Conference on Multimedia Computing and Systems (ICMCS).

[27]  Alexei A. Efros,et al.  Learning Dense Correspondence via 3D-Guided Cycle Consistency , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Varun Ramakrishna,et al.  Convolutional Pose Machines , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Frédo Durand,et al.  Capturing the human figure through a wall , 2015, ACM Trans. Graph..

[30]  Sebastian Scherer,et al.  VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[31]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Simon Lucey,et al.  Dense Semantic Correspondence Where Every Pixel is a Classifier , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[33]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[35]  Rob Miller,et al.  3D Tracking via Body Radio Reflections , 2014, NSDI.

[36]  Tom Minka,et al.  You are facing the Mona Lisa: spot localization using PHY layer information , 2012, MobiSys '12.