Onboard Detection-Tracking-Localization

This article investigates long-term positioning of moving objects by monocular vision of a miniature fixed-wing unmanned aerial vehicle. It is challenging to perform a real-time onboard vision processing task, due to the strict payload capacity and power budget limitations of microflying vehicles. We propose a parallel onboard architecture that explicitly decouples the long-term positioning task into iteratively operated detection, tracking, and localization. The proposed approach is eventually called onboard detection-tracking-localization, namely oDTL. The detector automatically extracts and identifies the object from image frames captured at in-flight durations. A learning-based network is constructed to improve detection accuracy and robustness against ever-changing outdoor illumination conditions and flying viewpoints. The tracker follows the object within specified region-of-interest from frame to frame with lower computing consumption. To further reduce target-losing rate, a concept of blind zone is proposed and applied, and its boundaries in sequential images are also theoretically inferred. The position estimator maps the flying vehicle pose, the image coordinates, and calibration specifications into real-world positions of the moving target. An extended Kalman filter is developed for rough position estimation, and a smooth module is introduced for the refinement of the position. Three offline comparative experiments and three online experiments have been conducted respectively to testify the real-time capability of our approach. The collected experimental results also demonstrate the feasible accuracy and robustness of the overall solution within the specified flying onboard scenarios.

[1]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Hyondong Oh,et al.  Communication-aware convoy following guidance for UAVs in a complex urban environment , 2016, 2016 24th Mediterranean Conference on Control and Automation (MED).

[3]  J.K. Hedrick,et al.  A mode-switching path planner for UAV-assisted search and rescue , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[4]  Fang Deng,et al.  Vision-Based Target Three-Dimensional Geolocation Using Unmanned Aerial Vehicles , 2018, IEEE Transactions on Industrial Electronics.

[5]  Xi Liu,et al.  Motion-Estimation-Based Visual Servoing of Nonholonomic Mobile Robots , 2011, IEEE Transactions on Robotics.

[6]  Shuicheng Yan,et al.  Robust Object Tracking with Online Multi-lifespan Dictionary Learning , 2013, 2013 IEEE International Conference on Computer Vision.

[7]  Zdenek Kalal,et al.  Tracking-Learning-Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Pascual Campoy Cervera,et al.  Computer vision based general object following for GPS-denied multirotor unmanned vehicles , 2014, 2014 American Control Conference.

[9]  Yong Liu,et al.  Robust visual SLAM with point and line features , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[10]  Eric W. Frew,et al.  Cooperative Stand-off Tracking of Moving Targets by a Team of Autonomous Aircraft , 2005 .

[11]  Alberto Del Bimbo,et al.  Object Tracking by Oversampling Local Features , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  W. Cleveland,et al.  Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting , 1988 .

[13]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Timothy W. McLain,et al.  Vision-based Target Geo-location using a Fixed-wing Miniature Air Vehicle , 2006, J. Intell. Robotic Syst..

[15]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  W. Cleveland LOWESS: A Program for Smoothing Scatterplots by Robust Locally Weighted Regression , 1981 .

[17]  Éric Marchand,et al.  Chasing a moving target from a flying UAV , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[19]  Yong Liu,et al.  Accurate and Real-Time 3-D Tracking for the Following Robots by Fusing Vision and Ultrasonar Information , 2018, IEEE/ASME Transactions on Mechatronics.

[20]  Fei Wang,et al.  A Robust Real-Time Vision System for Autonomous Cargo Transfer by an Unmanned Helicopter , 2015, IEEE Transactions on Industrial Electronics.

[21]  Xuebo Zhang,et al.  RGB-D SLAM in Indoor Environments With STING-Based Plane Feature Extraction , 2018, IEEE/ASME Transactions on Mechatronics.

[22]  Lincheng Shen,et al.  Online Camera-Gimbal-Odometry System Extrinsic Calibration for Fixed-Wing UAV Swarms , 2019, IEEE Access.

[23]  Ming-Hsuan Yang,et al.  Long-term correlation tracking , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Xiang Chen,et al.  Observer-Based Stabilizing Controllers for Discrete-Time Systems with Quantized Signal and Multiplicative Random Noise , 2016, SIAM J. Control. Optim..

[25]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Peiyao Shen,et al.  Essential Properties of Numerical Integration for Time-Optimal Path-Constrained Trajectory Planning , 2017, IEEE Robotics and Automation Letters.

[27]  Yong Liu,et al.  LOIND: An illumination and scale invariant RGB-D descriptor , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[28]  Vibhav Vineet,et al.  Struck: Structured Output Tracking with Kernels , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Rong Xiong,et al.  Stereo Visual-Inertial Odometry With Multiple Kalman Filters Ensemble , 2016, IEEE Transactions on Industrial Electronics.

[30]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[31]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Peiyao Shen,et al.  Complete and Time-Optimal Path-Constrained Trajectory Planning With Torque and Velocity Constraints: Theory and Applications , 2018, IEEE/ASME Transactions on Mechatronics.

[33]  Lei Shi,et al.  Understand scene categories by objects: A semantic regularized scene classifier using Convolutional Neural Networks , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[34]  Davide Scaramuzza,et al.  SVO: Fast semi-direct monocular visual odometry , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[35]  Farid Melgani,et al.  Detecting Cars in UAV Images With a Catalog-Based Approach , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[36]  Jose-Ernesto Gomez-Balderas,et al.  Tracking a Ground Moving Target with a Quadrotor Using Switching Control , 2012, Journal of Intelligent & Robotic Systems.

[37]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Kilian Q. Weinberger,et al.  CondenseNet: An Efficient DenseNet Using Learned Group Convolutions , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[39]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.