Descriptor Matching for a Discrete Spherical Image With a Convolutional Neural Network

In this paper, we propose a method of extracting feature descriptors from discrete spherical images using convolutional neural networks (CNNs). First, a captured full-view image is mapped to a discrete spherical image. Second, the features-from-accelerated-segment test algorithm is used to extract feature points in the discrete spherical image. Finally, an unsupervised CNN is used to obtain the descriptors of patches around each feature point. In the experiments, we compare these descriptors’ performance to the closest existing state-of-the-art feature descriptors of discrete spherical images, spherical oriented FAST and rotated BRIEF (SPHORB), for image pairs having different camera rotation, noise levels, and general motions. The experimental results demonstrate that our proposed CNN-based discrete spherical image feature descriptors clearly outperform SPHORB both in accuracy and robustness.

[1]  Juan Song,et al.  Multimodal Gesture Recognition Using 3-D Convolution and Convolutional LSTM , 2017, IEEE Access.

[2]  Shigang Li,et al.  Discrete spherical Harris corner detector , 2016, 2016 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[3]  Yinda Zhang,et al.  PanoContext: A Whole-Room 3D Context Model for Panoramic Scene Understanding , 2014, ECCV.

[4]  Shin-Jye Lee,et al.  Image Classification Based on the Boost Convolutional Neural Network , 2018, IEEE Access.

[5]  Kristen Grauman,et al.  Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Berthold K. P. Horn Extended Gaussian images , 1984, Proceedings of the IEEE.

[7]  Krista A. Ehinger,et al.  Recognizing scene viewpoint using panoramic place representation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Rongrong Ni,et al.  Facial Expression Recognition Using Weighted Mixture Deep Neural Network Based on Double-Channel Facial Images , 2018, IEEE Access.

[9]  Thomas Brox,et al.  Descriptor Matching with Convolutional Neural Networks: a Comparison to SIFT , 2014, ArXiv.

[10]  Hao Guan,et al.  BRISKS: Binary Features for Spherical Images on a Geodesic Grid , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Shigang Li Spherical gradient operator , 2013 .

[12]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[13]  Thomas Brox,et al.  Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Andrew Zisserman,et al.  Convolutional Two-Stream Network Fusion for Video Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Xuebin Qin,et al.  Finding scale-invariant corner feature from full-view image based on discrete spherical model , 2012, 2012 International Conference on Systems and Informatics (ICSAI2012).

[16]  Shigang Li,et al.  A Full-View Spherical Image Format , 2010, 2010 20th International Conference on Pattern Recognition.

[17]  Sung Wook Baik,et al.  Action Recognition in Video Sequences using Deep Bi-Directional LSTM With CNN Features , 2018, IEEE Access.

[18]  Kristen Grauman,et al.  Flat2Sphere: Learning Spherical Convolution for Fast Features from 360° Imagery , 2017, NIPS 2017.

[19]  Katsushi Ikeuchi,et al.  Generating an interpretation tree from a CAD model for 3D-object recognition in bin-picking tasks , 1987, International Journal of Computer Vision.

[20]  Avinash C. Kak,et al.  A robot vision system for recognizing 3D objects in low-order polynomial time , 1989, IEEE Trans. Syst. Man Cybern..

[21]  Ming-Yu Liu,et al.  Deep 360 Pilot: Learning a Deep Agent for Piloting through 360° Sports Videos , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Wei Feng,et al.  SPHORB: A Fast and Robust Binary Feature on the Sphere , 2014, International Journal of Computer Vision.

[23]  Shigang Li,et al.  Discrete Spherical Laplacian Operator , 2016, IEICE Trans. Inf. Syst..

[24]  Ming-Hsuan Yang,et al.  Semantic-driven Generation of Hyperlapse from 360° Video , 2017, ArXiv.

[25]  Ming-Hsuan Yang,et al.  Semantic-driven Generation of Hyperlapse from $360^\circ$ Video , 2017 .

[26]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Isao Nakanishi,et al.  Spherical FAST corner detector , 2015, 2015 IEEE International Conference on Mechatronics and Automation (ICMA).

[28]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Isao Nakanishi,et al.  Computing optical flow from bio-inspired spherical retina , 2014, 2014 IEEE International Conference on Mechatronics and Automation.