论文信息 - Flexible top-view human pose estimation for detection system via CNN

Flexible top-view human pose estimation for detection system via CNN

We propose the DeepPose-based pose estimation system that is flexible with the change of bounding-box range for top-view images. Our purpose is to link person detection system and pose estimation system. We introduce Bounding-box Curriculum Learning (BCL) and Recurrent Pose Estimation (RPE). BCL is a learning technique of CNN inspired from Curriculum Learning. RPE is a recurrent process of pose estimation that fixes the bounding-box range in response to the estimated results. We show the effect of proposed methods compared to normal learned CNN-based pose estimator on our original top-view dataset.

Yoshimitsu Aoki | Ryuji Go

[1] Kang Zheng,et al. Combining local appearance and holistic view: Dual-Source Deep Neural Networks for human pose estimation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.

[3] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4] Jonathan Tompson,et al. Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation , 2014, NIPS.

[5] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[6] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[7] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8] Jason Weston,et al. Curriculum learning , 2009, ICML '09.

[9] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Christian Szegedy,et al. DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[12] Andrew Zisserman,et al. Flowing ConvNets for Human Pose Estimation in Videos , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).