Learning Optical Flow Using Deep Dilated Residual Networks

Nowadays, convolutional neural networks achieve remarkable performance on optical flow estimation because of its strong non-linear fitting ability. Most of them adopt the U-Net architecture, which contains an encoder part and a decoder part. In the encoder part, the resolution of the feature map is reduced with the deepening of the network layer. In the decoder part, the feature map is enlarged by the deconvolution layer to recover the estimated flow as full resolution. However, the motion details are usually lost with the contracting and expanding operations. Moreover, learning methods, especially supervised networks, always ignore the advantages of many well-proven constraints used in the variational model. In this paper, we introduce a novel architecture named dilated residual networks for learning optical flow, which can avoid the loss of details of the U-Net architecture and can directly learn the residual functions rather than the unreferenced functions to enhance the learning ability of the network. Furthermore, inspired by variational methods, the traditional prior assumptions, such as brightness constancy, gradient constancy, and smoothness assumption, are used in the supervised network as extra auxiliary terms to guide the training of network. Our method is tested on several benchmarks, such as MPI-Sintel, KITTI2012, and KITTI2015. The experimental results show that the dilated residual network is suitable for dense optical flow estimation due to the capability of preserving motion details and can boost the accuracy of optical flow estimation.

[1]  Thomas Brox,et al.  FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2]  Xiaoou Tang,et al.  LiteFlowNet: A Lightweight Convolutional Neural Network for Optical Flow Estimation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  Wee Hong Ong,et al.  Object detection via convolutional neural network , 2018 .

[4]  Daniel Cremers,et al.  Anisotropic Huber-L1 Optical Flow , 2009, BMVC.

[5]  Jianjun Lei,et al.  Color-Guided Depth Map Super Resolution Using Convolutional Neural Network , 2017, IEEE Access.

[6]  Stefan Roth,et al.  UnFlow: Unsupervised Learning of Optical Flow with a Bidirectional Census Loss , 2017, AAAI.

[7]  Yi Yang,et al.  Occlusion Aware Unsupervised Learning of Optical Flow , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8]  Konstantinos G. Derpanis,et al.  Back to Basics: Unsupervised Learning of Optical Flow via Brightness Constancy and Motion Smoothness , 2016, ECCV Workshops.

[9]  Andreas Geiger,et al.  Object scene flow for autonomous vehicles , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Andrew Zisserman,et al.  Spatial Transformer Networks , 2015, NIPS.

[11]  Wuzhen Shi,et al.  Single image super-resolution with dilated convolution based multi-scale information learning inception module , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[12]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[14]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Thomas Brox,et al.  FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Michael J. Black,et al.  Secrets of optical flow estimation and their principles , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[18]  Bo Du,et al.  Robust Graph-Based Semisupervised Learning for Noisy Labeled Data via Maximum Correntropy Criterion , 2019, IEEE Transactions on Cybernetics.

[19]  Michael J. Black,et al.  Optical Flow Estimation Using a Spatial Pyramid Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  George Papandreou,et al.  Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.

[22]  Francesc Moreno-Noguer,et al.  Joint coarse-and-fine reasoning for deep optical flow , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[23]  Xuezhi Xiang,et al.  Deep Optical Flow Supervised Learning With Prior Assumptions , 2018, IEEE Access.

[24]  Jitendra Malik,et al.  Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Wei Xiong,et al.  Stacked Convolutional Denoising Auto-Encoders for Feature Representation , 2017, IEEE Transactions on Cybernetics.

[26]  Nengcheng Chen,et al.  GDMN: Group Decision-Making Network for Person Re-Identification , 2018, IEEE Access.

[27]  Xuelong Li,et al.  VCF: Velocity Correlation Filter, Towards Space-Borne Satellite Video Tracking , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[28]  Michael J. Black,et al.  Optical Flow Estimation with Channel Constancy , 2014, ECCV.

[29]  Thomas Brox,et al.  High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.

[30]  Ioannis Patras,et al.  Unsupervised convolutional neural networks for motion estimation , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[31]  Michael J. Black,et al.  A Naturalistic Open Source Movie for Optical Flow Evaluation , 2012, ECCV.

[32]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Didier Stricker,et al.  Flow Fields: Dense Correspondence Fields for Highly Accurate Large Displacement Optical Flow Estimation , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Gang Wang,et al.  Recurrent Spatial Pyramid CNN for Optical Flow Estimation , 2018, IEEE Transactions on Multimedia.

[35]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[36]  Aidong Men,et al.  G-CNN: Object Detection via Grid Convolutional Neural Network , 2017, IEEE Access.

[37]  Bo Du,et al.  Saliency-Guided Unsupervised Feature Learning for Scene Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[38]  Yi Zhu,et al.  DenseNet for dense flow , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[39]  Yi Li,et al.  Deformable Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[40]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[41]  Thomas A. Funkhouser,et al.  Dilated Residual Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Bingbing Ni,et al.  Unsupervised Deep Learning for Optical Flow Estimation , 2017, AAAI.