Novel On-Road Vehicle Detection System Using Multi-Stage Convolutional Neural Network

General object-detection methods based on deep learning have received considerable attention in the field of computer vision. However, when they are applied to vehicle detection (VD) in a straightforward manner to realize an intelligent vehicle (IV), a graphics processing unit (GPU) is required for their real-time implementation. The use of GPUs is unacceptable in commercial VD systems. A novel on-road VD method comprising the use of a multi-stage convolutional neural network (MSCNN) is proposed to solve this problem. In the MSCNN, the properties of the vehicles are exploited, and an efficient region proposal specialized for vehicles is developed. The proposed MSCNN comprises four stages: vehicle lower-boundary detection, vehicle upper-boundary detection, region proposal network (RPN), and vehicle classification. Effective anchor boxes are generated in the lower- and upper-boundary-detection stages with appropriate sizes for vehicles of all scales. The bounding box of the vehicle within the anchor box is determined in the RPN stage. In the last stage, the predicted bounding boxes are classified as vehicles or non-vehicles. Finally, the proposed method is applied to the KITTI, CrowdAI, and AUTTI datasets, and its advantages are demonstrated by comparing its performance with those presented in previous studies. The proposed MSCNN realizes an average precision (72.1%) on the KITTI dataset while running on a central processing unit (CPU).