Pedestrian Occlusion Level Classification using Keypoint Detection and 2D Body Surface Area Estimation

Effective and reliable pedestrian detection is among the most safety-critical features of semi-autonomous and autonomous vehicles. One of the most complex detection challenges is that of partial occlusion, where a target object is only partially available to the sensor due to obstruction by another foreground object. A number of current pedestrian detection benchmarks provide annotation for partial occlusion to assess algorithm performance in these scenarios, however each benchmark varies greatly in their definition of the occurrence and severity of occlusion. In addition, current occlusion level annotation methods contain a high degree of subjectivity by the human annotator. This can lead to inaccurate or inconsistent reporting of an algorithm’s detection performance for partially occluded pedestrians, depending on which benchmark is used. This research presents a novel, objective method for pedestrian occlusion level classification for ground truth annotation. Occlusion level classification is achieved through the identification of visible pedestrian keypoints and through the use of a novel, effective method of 2D body surface area estimation. Experimental results demonstrate that the proposed method reflects the pixel-wise occlusion level of pedestrians in images and is effective for all forms of occlusion, including challenging edge cases such as self-occlusion, truncation and inter-occluding pedestrians.

[1]  Assessment of burn size in obese adults; a literature review , 2017, Journal of plastic surgery and hand surgery.

[2]  P. Chaudhary,et al.  FLOOD-WATER LEVEL ESTIMATION FROM SOCIAL MEDIA IMAGES , 2019, ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[3]  Martin Glavin,et al.  Overcoming Occlusion in the Automotive Environment—A Review , 2021, IEEE Transactions on Intelligent Transportation Systems.

[4]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[5]  Namil Kim,et al.  Multispectral pedestrian detection: Benchmark dataset and baseline , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  A B WALLACE,et al.  The exposure treatment of burns. , 1951, Lancet.

[8]  Jian-Huang Lai,et al.  Occluded Person Re-Identification , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[9]  Hui Xiong,et al.  A new benchmark for vision-based cyclist detection , 2016, 2016 IEEE Intelligent Vehicles Symposium (IV).

[10]  Bernt Schiele,et al.  CityPersons: A Diverse Dataset for Pedestrian Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Song Bai,et al.  Occluded Video Instance Segmentation , 2021, ArXiv.

[12]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  B. Schiele,et al.  How Far are We from Solving Pedestrian Detection? , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Ling Shao,et al.  Generalizable Pedestrian Detection: The Elephant In The Room , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Monika Sester,et al.  Flood severity mapping from Volunteered Geographic Information by interpreting water level from images containing people: a case study of Hurricane Harvey , 2020, ArXiv.

[16]  C Quan Khanh-An,et al.  Flood Level Prediction via Human Pose Estimation from Social Media Images , 2020 .

[17]  B. Cosman,et al.  THE RULE OF NINES: ITS HISTORY AND ACCURACY , 1968, Plastic and reconstructive surgery.

[18]  Dariu Gavrila,et al.  EuroCity Persons: A Novel Benchmark for Person Detection in Traffic Scenes , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[20]  Vinh-Tiep Nguyen,et al.  Flood Level Prediction via Human Pose Estimation from Social Media Images , 2020, ICMR.

[21]  F. Huss,et al.  Want Correct Percentage of TBSA Burned? Let a Layman Do the Assessment. , 2017, Journal of burn care & research : official publication of the American Burn Association.

[22]  E. Livingston,et al.  Percentage of burned body surface area determination in obese and nonobese patients. , 2000, The Journal of surgical research.