Weakly Supervised Deep Learning Method for Vulnerable Road User Detection in FMCW Radar

Millimeter-wave radar is currently the most effective automotive sensor capable of all-weather perception. In order to detect Vulnerable Road Users (VRUs) in cluttered radar data, it is necessary to model the time-frequency signal patterns of human motion, i.e. the micro-Doppler signature. In this paper we propose a spatio-temporal Convolutional Neural Network (CNN) capable of detecting VRUs in cluttered radar data. The main contribution is a weakly supervised training method which uses abundant, automatically generated labels from camera and lidar for training the model. The input to the network is a tensor of temporally concatenated range-azimuth-Doppler arrays, while the ground truth is an occupancy grid formed by objects detected jointly in-camera images and lidar. Lidar provides accurate ranging ground truth, while camera information helps distinguish between VRUs and background. Experimental evaluation shows that the CNN model has superior detection performance compared to classical techniques. Moreover, the model trained with imperfect, weak supervision labels outperforms the one trained with a limited number of perfect, hand-annotated labels. Finally, the proposed method has excellent scalability due to the low cost of automatic annotation.

[1]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[2]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Phillipp Meister,et al.  Statistical Signal Processing Detection Estimation And Time Series Analysis , 2016 .

[4]  Hermann Rohling,et al.  Pedestrian recognition in automotive radar sensors , 2013, 2013 14th International Radar Symposium (IRS).

[5]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[6]  H. Rohling,et al.  SOME RADAR TOPICS: WAVEFORM DESIGN, RANGE CFAR AND TARGET RECOGNITION , 2006 .

[7]  Amin Ansari,et al.  Vehicle Detection With Automotive Radar Using Deep Learning on Range-Azimuth-Doppler Tensors , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[8]  Jürgen Dickmann,et al.  Semantic radar grids , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[9]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[10]  Wilfried Philips,et al.  Semantically aware multilateral filter for depth upsampling in automotive LiDAR point clouds , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[11]  Jürgen Dickmann,et al.  Comparison of random forest and long short-term memory network performances in classification tasks using radar , 2017, 2017 Sensor Data Fusion: Trends, Solutions, Applications (SDF).

[12]  Dariu M. Gavrila,et al.  CNN Based Road User Detection Using the 3D Radar Cube , 2020, IEEE Robotics and Automation Letters.

[13]  Raja Giryes,et al.  Deep Radar Detector , 2019, 2019 IEEE Radar Conference (RadarConf).

[14]  Sebastian Thrun,et al.  Learning Occupancy Grid Maps with Forward Sensor Models , 2003, Auton. Robots.

[15]  Haopeng Li,et al.  Object Detection and 3d Estimation Via an FMCW Radar Using a Fully Convolutional Network , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Nils Appenrodt,et al.  "Automotive radar the key technology for autonomous driving: From detection and ranging to environmental understanding" , 2016, 2016 IEEE Radar Conference (RadarConf).

[17]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.