Limited Resource Feature Detection, Description, and Matching

Limited-Resource Feature Detection, Description, and Matching Spencer G Fowers Department of Electrical and Computer Engineering Doctor of Philosophy The aims of this research work are to develop a feature detection, description, and matching system for low-resource applications. This work was motivated by the need for a vision sensor to assist the flight of a quad-rotor UAV. This application presented a realworld challenge of autonomous drift stabilization using vision sensors. The initial solution implemented a basic feature detector and matching system on an FPGA. The research then pursued ways to improve the vision system. Research began with color feature detection, and the Color Difference of Gaussians feature detector was developed. CDoG provides better results than gray scale DoG and does not require any additional processing than gray scale if implemented in a parallel architecture. The CDoG Scale-Invariant Feature Transform modification was developed which provided color feature detection and description to the gray scale SIFT descriptor. To demonstrate the benefits of color information, the CDSIFT algorithm was applied to a real application: library book inventory. While color provides added benefit to the CDSIFT descriptor, CDSIFT descriptors are still computationally intractable for a low-resource hardware implementation. Because of these shortcomings, this research focused on developing a new feature descriptor. The BAsis Sparse-coding Inspired Similarity (BASIS) descriptor was developed with low-resource systems in mind. BASIS utilizes sparse coding to provide a generic description of feature characterstics. The BASIS descriptor provided improved accuracy over SIFT, and similar accuracy to SURF on the task of aerial UAV frame-to-frame feature matching. However, basis dictionaries are non-orthogonal and can contain redundant information. In addition to a feature descriptor, an FPGA-based feature correlation (or matching) system needed to be developed. TreeBASIS was developed to answer this need and address the redundancy issues of BASIS. TreeBASIS utilizes a vocabulary tree to drastically reduce descriptor computation time and descriptor size. TreeBASIS also obtains a higher level of accuracy than SIFT, SURF, and BASIS on the UAV aerial imagery task. Both BASIS and TreeBASIS were implemented in VHDL and are well suited for low-resource FPGA applications. TreeBASIS provides a complete feature detection, description, and correlation system-on-a-chip for low-resource FPGA vision systems.

[1]  Erkki Oja,et al.  Image feature extraction by sparse coding and independent component analysis , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[2]  In-So Kweon,et al.  System-on-Chip Solution of Video Stabilization for CMOS Image Sensors in Hand-Held Devices , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Andrew Zisserman,et al.  Scene Classification Using a Hybrid Generative/Discriminative Approach , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Jonathan P. How Multi-Vehicle Flight Experiments: Recent Results and Future Directions , 2007 .

[5]  M. Tarbouchi,et al.  Neural network based control of a four rotor helicopter , 2004, 2004 IEEE International Conference on Industrial Technology, 2004. IEEE ICIT '04..

[6]  James K. Archibald,et al.  Reconfigurable On-Board Vision Processing for Small Autonomous Vehicles , 2007, EURASIP J. Embed. Syst..

[7]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Mubarak Shah,et al.  Optimal corner detector , 1989, Comput. Vis. Graph. Image Process..

[9]  Cordelia Schmid,et al.  Coloring Local Feature Extraction , 2006, ECCV.

[10]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Paul L. Rosin Measuring Corner Properties , 1999, Comput. Vis. Image Underst..

[12]  Hichem Maaref,et al.  Control of an under-actuated system: application a four rotors rotorcraft , 2005, 2005 IEEE International Conference on Robotics and Biomimetics - ROBIO.

[13]  Kazunobu Ishii,et al.  Field information system using an agricultural helicopter towards precision farming , 2003, Proceedings 2003 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM 2003).

[14]  Dah-Jye Lee,et al.  Dense Disparity Real-Time Stereo Vision Algorithm for Resource-Limited Systems , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[16]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[17]  Spencer G. Fowers Stabilization and Control of a Quad-Rotor Micro-UAV Using Vision Sensors , 2008 .

[18]  Holger Voos Nonlinear state-dependent Riccati equation control of a quadrotor UAV , 2006, 2006 IEEE Conference on Computer Aided Control System Design, 2006 IEEE International Conference on Control Applications, 2006 IEEE International Symposium on Intelligent Control.

[19]  André Dias,et al.  Autonomous Surface Vehicle Docking Manoeuvre with Visual Information , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[20]  Mario J. Valenti,et al.  Estimation and Control of a Quadrotor Vehicle Using Monocular Vision and Moire Patterns , 2006 .

[21]  R. D'Andrea,et al.  Real-time attitude estimation techniques applied to a four rotor helicopter , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[22]  George A. Constantinides,et al.  A Parallel Hardware Architecture for Scale and Rotation Invariant Feature Detection , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Arnold W. M. Smeulders,et al.  Color Invariance , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  José Miguel Almeida,et al.  A Real Time Vision System for Autonomous Systems: Characterization during a Middle Size Match , 2007, RoboCup.

[25]  Joost van de Weijer,et al.  Edge and corner detection by photometric quasi-invariants , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Joost van de Weijer,et al.  Color edge detection by photometric quasi-invariants , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[27]  Carl-Henrik Oertel,et al.  Computer vision sensor for autonomous helicopter hover stabilization , 1997, Defense, Security, and Sensing.

[28]  Paul L. Rosin Augmenting Corner Descriptors , 1996, CVGIP Graph. Model. Image Process..

[29]  Rogelio Lozano,et al.  DYNAMIC MODELLING AND CONFIGURATION STABILIZATION FOR AN X4-FLYER. , 2002 .

[30]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[31]  Graham D. Finlayson,et al.  Color in Perspective , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Robert Mahony,et al.  Design of a four-rotor aerial robot , 2002 .

[33]  Ivan Laptev,et al.  Improving object detection with boosted histograms , 2009, Image Vis. Comput..

[34]  Michael Elad,et al.  Sparse and Redundant Representations - From Theory to Applications in Signal and Image Processing , 2010 .

[35]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[36]  Jurandy Almeida,et al.  Making colors worth more than a thousand words , 2008, SAC '08.

[37]  Norbert Krüger,et al.  A Real-Time Embedded System for Stereo Vision Preprocessing Using an FPGA , 2008, 2008 International Conference on Reconfigurable Computing and FPGAs.

[38]  Robert Mahony,et al.  Modelling and control of a quad-rotor robot , 2006 .

[39]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[40]  Dieter Schmalstieg,et al.  Pose tracking from natural features on mobile phones , 2008, 2008 7th IEEE/ACM International Symposium on Mixed and Augmented Reality.

[41]  Peter I. Corke,et al.  Low-cost flight control system for a small autonomous helicopter , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[42]  Pau-Choo Chung,et al.  Contrast Context Histogram - A Discriminating Local Descriptor for Image Matching , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[43]  Roland Siegwart,et al.  Full control of a quadrotor , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[44]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[45]  Josef Kittler,et al.  Local ordered grey levels as an aid to corner detection , 1984, Pattern Recognit..

[46]  A. Tayebi,et al.  Attitude stabilization of a four-rotor aerial robot , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[47]  Dah-Jye Lee,et al.  An Effective Color Addition to Feature Detection and Description for Book Spine Image Matching , 2012 .

[48]  Dah-Jye Lee,et al.  A Hardware-Friendly Adaptive Tensor Based Optical Flow Algorithm , 2007, ISVC.

[49]  Shinichi Hirai,et al.  Realtime FPGA-Based Vision System , 2005, J. Robotics Mechatronics.

[50]  Aleksandra Mojsilovic,et al.  Color Quantization and Processing by Fibonacci Lattices , 2022 .

[51]  Alexander Verl,et al.  A rotation invariant feature descriptor O-DAISY and its FPGA implementation , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[52]  David J. Field,et al.  What Is the Goal of Sensory Coding? , 1994, Neural Computation.

[53]  Eric W. Frew,et al.  Combined CDGPS and vision-based control of a small autonomous helicopter , 1998, Proceedings of the 1998 American Control Conference. ACC (IEEE Cat. No.98CH36207).

[54]  James Archibald,et al.  A simple, inexpensive, and effective implementation of a vision-guided autonomous robot , 2006, SPIE Optics East.

[55]  Matthew A. Brown,et al.  Invariant Features from Interest Point Groups , 2002, BMVC.

[56]  Paul Beaudet,et al.  Rotationally invariant image operators , 1978 .

[57]  D.M. Dawson,et al.  Velocity control for a quad-rotor uav fly-by-camera interface , 2007, Proceedings 2007 IEEE SoutheastCon.

[58]  S. Shankar Sastry,et al.  A vision system for landing an unmanned aerial vehicle , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[59]  Rachid Deriche,et al.  A computational approach for corner and vertex detection , 1993, International Journal of Computer Vision.

[60]  Frank Archer,et al.  Introduction, overview, and status of the Microwave Autonomous Copter System (MACS) , 2004, IGARSS 2004. 2004 IEEE International Geoscience and Remote Sensing Symposium.

[61]  Franz Pernkopf,et al.  Detection of surface defects on raw steel blocks using Bayesian network classifiers , 2004, Pattern Analysis and Applications.

[62]  George C. Runger,et al.  An Automated Feature Selection Method for Visual Inspection Systems , 2006, IEEE Transactions on Automation Science and Engineering.

[63]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[64]  G. Anitha,et al.  Vision Based Autonomous Landing of an Unmanned Aerial Vehicle , 2012 .

[65]  Li Shang Image Reconstruction Using a Modified Sparse Coding Technique , 2008, ICIC.

[66]  Yun-Ta Tsai,et al.  CDIKP: A highly-compact local feature descriptor , 2008, 2008 19th International Conference on Pattern Recognition.

[67]  Aly A. Farag,et al.  CSIFT: A SIFT Descriptor with Color Invariant Characteristics , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[68]  Gertjan J. Burghouts,et al.  Performance evaluation of local colour invariants , 2009, Comput. Vis. Image Underst..

[69]  Patrick Doherty,et al.  Vision for a UAV helicopter , 2002 .

[70]  Glenn Healey,et al.  The Illumination-Invariant Recognition of 3D Objects Using Local Color Invariants , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[71]  Mario A. Nascimento,et al.  A compact and efficient image retrieval approach based on border/interior pixel classification , 2002, CIKM '02.

[72]  Beau J. Tippetts Real-Time Implementation of Vision Algorithm for Control, Stabilization, and Target Tracking for a Hovering Micro-UAV , 2008 .

[73]  Masanao Koeda,et al.  Development of an immersive teleoperating system for unmanned helicopter , 2002, Proceedings. 11th IEEE International Workshop on Robot and Human Interactive Communication.

[74]  Peter G. Ifju,et al.  Vision-guided flight stability and control for micro air vehicles , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[75]  Michael Elad,et al.  Sparse Representation for Color Image Restoration , 2008, IEEE Transactions on Image Processing.

[76]  Roland Siegwart,et al.  PID vs LQ control techniques applied to an indoor micro quadrotor , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[77]  Michael Carsten Bosse,et al.  A VISION AUGMENTED NAVIGATION SYSTEM FOR AN AUTONOMOUS HELICOPTER , 1997 .

[78]  Steven L. Waslander,et al.  The Stanford testbed of autonomous rotorcraft for multi agent control (STARMAC) , 2004, The 23rd Digital Avionics Systems Conference (IEEE Cat. No.04CH37576).

[79]  Luo Jun,et al.  Subminiature unmanned surveillance aircraft and its ground control station for security , 2005, IEEE International Safety, Security and Rescue Rototics, Workshop, 2005..

[80]  Noel E. O'Connor,et al.  Learning Midlevel Image Features for Natural Scene and Texture Classification , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[81]  Gulzar A. Khuwaja,et al.  Bimodal breast cancer classification system , 2004, Pattern Analysis and Applications.

[82]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[83]  Dah-Jye Lee,et al.  FPGA Implementation of a Feature Detection and Tracking Algorithm for Real-time Applications , 2007, ISVC.

[84]  Wenquan Feng,et al.  An architecture of optimised SIFT feature detection for an FPGA implementation of an image matcher , 2009, 2009 International Conference on Field-Programmable Technology.

[85]  Libor Preucil,et al.  FPGA based Speeded Up Robust Features , 2009, 2009 IEEE International Conference on Technologies for Practical Robot Applications.

[86]  James Archibald,et al.  An embedded vision system for an unmanned four-rotor helicopter , 2006, SPIE Optics East.

[87]  Suchendra M. Bhandarkar,et al.  Automated Planning and Optimization of Lumber Production Using Machine Vision and Computed Tomography , 2008, IEEE Transactions on Automation Science and Engineering.

[88]  T. Lindeberg,et al.  Scale-Space Theory : A Basic Tool for Analysing Structures at Different Scales , 1994 .

[89]  Andrew J. Davison,et al.  Real-time simultaneous localisation and mapping with a single camera , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[90]  Ramin Zabih,et al.  Non-parametric Local Transforms for Computing Visual Correspondence , 1994, ECCV.

[91]  Yuchou Chang,et al.  Matching book-spine images for library shelf-reading process automation , 2008, 2008 IEEE International Conference on Automation Science and Engineering.

[92]  Han Wang,et al.  A practical solution to corner detection , 1994, Proceedings of 1st International Conference on Image Processing.

[93]  Matti Pietikäinen,et al.  Optimising Colour and Texture Features for Real-time Visual Inspection , 2002, Pattern Analysis & Applications.

[94]  Yuan Cheng,et al.  Vision-Based Online Process Control in Manufacturing Applications , 2008, IEEE Transactions on Automation Science and Engineering.

[95]  Doran Wilde,et al.  Color DoG: a three-channel color feature detector for embedded systems , 2010, Electronic Imaging.

[96]  Tom Drummond,et al.  Fusing points and lines for high performance tracking , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[97]  Ning Wu,et al.  Fast Facial Image Super-Resolution via Local Linear Transformations for Resource-Limited Applications , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[98]  Azriel Rosenfeld,et al.  Gray-level corner detection , 1982, Pattern Recognit. Lett..

[99]  Mario Vento,et al.  A multiple expert system for classifying fluorescent intensity in antinuclear autoantibodies analysis , 2009, Pattern Analysis and Applications.

[100]  Steven Lake Waslander,et al.  Multi-agent quadrotor testbed control design: integral sliding mode vs. reinforcement learning , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[101]  G. L. Barrows Future visual microsensors for mini/micro-UAV applications , 2002, Proceedings of the 2002 7th IEEE International Workshop on Cellular Neural Networks and Their Applications.

[102]  Michael Elad,et al.  Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[103]  Azgal Abichou,et al.  Smooth control of an X4 bidirectional rotors flying robot , 2005, Proceedings of the Fifth International Workshop on Robot Motion and Control, 2005. RoMoCo '05..

[104]  Arnold W. M. Smeulders,et al.  Color Based Object Recognition , 1997, ICIAP.

[105]  H. Bischof,et al.  Automatic analysis of collagen fiber orientation in the outermost layer of human arteries , 2004, Pattern Analysis and Applications.

[106]  Timothy W. McLain,et al.  Autonomous Vehicle Technologies for Small Fixed Wing UAVs , 2003 .

[107]  José Hernández Palancar,et al.  A Hardware Architecture for SIFT Candidate Keypoints Detection , 2009, CIARP.

[108]  Rogelio Lozano,et al.  Real-time stabilization and tracking of a four rotor mini-rotorcraft , 2003 .

[109]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.