Hierarchical Sliding Slice Regression for Vehicle Viewing Angle Estimation

We propose a novel hierarchical sliding slice regression which in a coarse-to-fine manner represents global circular target space with a number of ordinally localized and overlapping subspaces. Our method is particularly suitable for visual regression problems where the regression target is circular (e.g., car viewing angle) and visual similarity inconsistent over the target space (e.g., repetitive appearance). A good application example is the camera-based car viewing angle estimation problem, where visual similarity of different views is highly inconsistent—front and back views and left and right side views are pair-wise similar, but appear at the far ends of the circular view angle space. In practice, the problem is even more complicated due to large visual variation of objects (e.g., different car models). We perform extensive experiments on the Lausanne Federal of Institute of Technology Multi-view Car and KITTI Data Sets as well as the Technische Universitat Darmstadt Multi-view Pedestrians Data Set and achieve superior performance as compared to the state-of-the-art algorithms.

[1]  Zhi-Hua Zhou,et al.  Automatic Age Estimation Based on Facial Aging Patterns , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Xiaoou Tang,et al.  Object Detection and Viewpoint Estimation with Auto-masking Neural Network , 2014, ECCV.

[3]  Alfred DeMaris,et al.  A Tutorial in Logistic Regression , 1995 .

[4]  Bodo Rosenhahn,et al.  Class Generative Models Based on Feature Regression for Pose Estimation of Object Categories , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Jonas Fredriksson,et al.  Reliable Vehicle Pose Estimation Using Vision and a Single-Track Model , 2014, IEEE Transactions on Intelligent Transportation Systems.

[6]  Alexandre Heili,et al.  Combined estimation of location and body pose in surveillance video , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[7]  Kun He,et al.  Parameterizing Object Detectors in the Continuous Pose Space , 2014, ECCV.

[8]  Jörn Ostermann,et al.  Continuous Pose Estimation with a Spatial Ensemble of Fisher Regressors , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[9]  Shaogang Gong,et al.  Human pose estimation using structural support vector machines , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[10]  Bernt Schiele,et al.  Monocular 3D pose estimation and tracking by detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Luc Van Gool,et al.  Random Forests for Real Time 3D Face Analysis , 2012, International Journal of Computer Vision.

[12]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[13]  R. Kashyap A Bayesian comparison of different classes of dynamic models using empirical data , 1977 .

[14]  Jindong Tan,et al.  Recognition of Car Makes and Models From a Single Traffic-Camera Image , 2015, IEEE Transactions on Intelligent Transportation Systems.

[15]  Roman Rosipal,et al.  Kernel Partial Least Squares Regression in Reproducing Kernel Hilbert Space , 2002, J. Mach. Learn. Res..

[16]  Rama Chellappa,et al.  Growing Regression Tree Forests by Classification for Continuous Object Pose Estimation , 2017, International Journal of Computer Vision.

[17]  Ahmed M. Elgammal,et al.  Joint Object and Pose Recognition Using Homeomorphic Manifold Analysis , 2013, AAAI.

[18]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[19]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Justus H. Piater,et al.  Multiview feature distributions for object detection and continuous pose estimation , 2014, Comput. Vis. Image Underst..

[21]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[22]  Luc Van Gool,et al.  Body Parts Dependent Joint Regressors for Human Pose Estimation in Still Images , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Bin Dai,et al.  Likelihood-Field-Model-Based Dynamic Vehicle Detection and Tracking for Self-Driving , 2016, IEEE Transactions on Intelligent Transportation Systems.

[24]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[25]  P. Fua,et al.  Pose estimation for category specific multiview object localization , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Chi Fang,et al.  Head Pose Estimation Based on Random Forests for Multiclass Classification , 2010, 2010 20th International Conference on Pattern Recognition.

[27]  Shuicheng Yan,et al.  Age Estimation via Grouping and Decision Fusion , 2015, IEEE Transactions on Information Forensics and Security.

[28]  Rama Chellappa,et al.  Growing Regression Forests by Classification: Applications to Object Pose Estimation , 2013, ECCV.

[29]  Tinne Tuytelaars,et al.  All together now: Simultaneous Detection and Continuous Pose Estimation using a Hough Forest with Probabilistic Locally Enhanced Voting , 2014, BMVC.

[30]  Ahmed M. Elgammal,et al.  Regression from local features for viewpoint and pose estimation , 2011, 2011 International Conference on Computer Vision.

[31]  Luc Van Gool,et al.  Real-time facial feature detection using conditional regression forests , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  M.M. Trivedi,et al.  Image based estimation of pedestrian orientation for improving path prediction , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[33]  Vijayan K. Asari,et al.  A Two-Layer Framework for Piecewise Linear Manifold-Based Head Pose Estimation , 2012, International Journal of Computer Vision.

[34]  Rita Cucchiara,et al.  People Orientation Recognition by Mixtures of Wrapped Distributions on Random Trees , 2012, ECCV.

[35]  Pietro Perona,et al.  Self-Tuning Spectral Clustering , 2004, NIPS.

[36]  Jayan Eledath,et al.  Collision sensing by stereo vision and radar sensor fusion , 2008 .

[37]  Jörn Ostermann,et al.  Embedding Geometry in Generative Models for Pose Estimation of Object Categories , 2014, BMVC.

[38]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[39]  Rafael Muñoz-Salinas,et al.  Multi-camera head pose estimation , 2012, Machine Vision and Applications.

[40]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Shaogang Gong,et al.  Head Pose Classification in Crowded Scenes , 2009, BMVC.

[42]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[43]  Jenn-Jier James Lien,et al.  Automatic Vehicle Detection Using Local Features—A Statistical Approach , 2008, IEEE Transactions on Intelligent Transportation Systems.

[44]  Fadi Dornaika,et al.  An Efficient Approach to Onboard Stereo Vision System Pose Estimation , 2008, IEEE Transactions on Intelligent Transportation Systems.