End-to-End Feature Pyramid Network for Real-Time Multi-Person Pose Estimation

In computer vision, pose estimation system is widely used to construct human body transformation. However, it is hard to achieve these targets together: stable real-time speed, variance human number and high accuracy. This paper proposes an end-to-end pose estimation network. It contains a neural network friendly representation of human pose. Then it proposes a correspond real-time end-to-end pose estimation network based on feature pyramid network structure with attention-based detection modules. This network can detect multiple humans in more than 60 fps with 384 x 384 resolution on GTX 1070 with affordable accuracy. This work shows the potential of this network structure can perform both faster and better compared with state-of-the-art results.

[1]  Thomas B. Moeslund,et al.  A Survey of Computer Vision-Based Human Motion Capture , 2001, Comput. Vis. Image Underst..

[2]  Kaiming He,et al.  Mask R-CNN , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[3]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[5]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Chris Button,et al.  A Review of Vision-Based Motion Analysis in Sport , 2008, Sports medicine.

[8]  Dieter Schmalstieg,et al.  Global pose estimation using multi-sensor fusion for outdoor Augmented Reality , 2009, 2009 8th IEEE International Symposium on Mixed and Augmented Reality.

[9]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[10]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[11]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.