Rotation-Invariant Object Detection of Remotely Sensed Images Based on Texton Forest and Hough Voting

The Hough forest method is an effective method for object detection in ground-shot images that has received increasing research attention. However, this method lacks the ability to detect objects with arbitrary orientations. This largely constrains the method from being used in detecting geospatial objects from remotely sensed (RS) images since geospatial objects can have many different orientations. In order to achieve rotation invariance and compensate the associated loss of discriminative power, this paper presents a novel color-enhanced rotation-invariant Hough forest (CRIHF) method for detecting geospatial objects in RS images. In our method, we propose to train a Pose-Estimation-based Rotation-invariant Texton Forest (PE-RTF) which first uses dominant gradient orientations to align local image patches. The orientations are then jointly used with coordinates in Hough voting to detect object position. In order to increase discriminative power, Texton Forest is used in codebook generation. Moreover, theoretically sound color-invariant gradients are employed. By rotating split functions rather than image patches in the RTF and sparsely accumulating Hough votes on grid points, computational times can be reduced by two orders of magnitude. The evaluation of the CRIHF method on a data set containing 525 airplanes and a second data set containing 68 residential buildings shows that our method is rotation invariant and robust. The detector achieves around 90% recall rate on both data sets. Experiments also show that our method is noise resistant and can achieve a decent detection performance at a high level (30%) of “salt and pepper” impulsive noise.

[1]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[2]  V. F. Leavers,et al.  Which Hough transform , 1993 .

[3]  Arnold W. M. Smeulders,et al.  Color and Scale: The Spatial Structure of Color Images , 2000, ECCV.

[4]  Arnold W. M. Smeulders,et al.  Color Invariance , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Dan Roth,et al.  Learning a Sparse Representation for Object Detection , 2002, ECCV.

[6]  Mark S. Nixon,et al.  Invariant characterisation of the Hough transform for pose estimation of arbitrary shapes , 2002, Pattern Recognit..

[7]  M. Pal,et al.  Random forests for land cover classification , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  Song-Chun Zhu,et al.  What are Textons? , 2005, International Journal of Computer Vision.

[10]  Pietro Perona,et al.  A Visual Category Filter for Google Images , 2004, ECCV.

[11]  Jitendra Malik,et al.  Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.

[12]  Joydeep Ghosh,et al.  Investigation of the random forest framework for classification of hyperspectral data , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[13]  Carlos López-Martínez,et al.  A novel algorithm for ship detection in SAR imagery based on the wavelet transform , 2005, IEEE Geoscience and Remote Sensing Letters.

[14]  Josiane Zerubia,et al.  Supervised segmentation of remote sensing images based on a tree-structured MRF model , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[15]  Song-Chun Zhu,et al.  What are Textons? , 2005, Int. J. Comput. Vis..

[16]  B. S. Manjunath,et al.  Modeling and Detection of Geospatial Objects Using Texture Motifs , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[17]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[18]  Lorenzo Bruzzone,et al.  A Multilevel Context-Based System for Classification of Very High Spatial Resolution Images , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[19]  Alberto Sanfeliu,et al.  Computation of Rotation Local Invariant Features using the Integral Image for Real Time Object Detection , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[20]  Antonio Criminisi,et al.  TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.

[21]  Bernt Schiele,et al.  Robust Object Detection with Interleaved Categorization and Segmentation , 2008, International Journal of Computer Vision.

[22]  Manik Varma,et al.  Learning The Discriminative Power-Invariance Trade-Off , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[23]  Jordi Inglada,et al.  Automatic recognition of man-made objects in high resolution optical remote sensing images by SVM classification of geometric image features , 2007 .

[24]  Hichem Sahli,et al.  A Stochastic Framework for the Identification of Building Rooftops Using a Single Remote Sensing Image , 2008, IEEE Transactions on Geoscience and Remote Sensing.

[25]  D. Geman,et al.  Stationary Features and Cat Detection , 2008 .

[26]  Huseyin Gokhan Akcay,et al.  Automatic Detection of Geospatial Objects Using Multiple Hierarchical Segmentations , 2008, IEEE Transactions on Geoscience and Remote Sensing.

[27]  Antonio Criminisi,et al.  Object Class Segmentation using Random Forests , 2008, BMVC.

[28]  Roberto Cipolla,et al.  Semantic texton forests for image categorization and segmentation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Pascal Fua,et al.  Joint pose estimator and feature learning for object detection , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[30]  Nikos Paragios,et al.  Recognition-Driven Two-Dimensional Competing Priors Toward Automatic and Accurate Building Detection , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[31]  Cem Ünsalan,et al.  Urban-Area and Building Detection Using SIFT Keypoints and Graph Theory , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[32]  Juergen Gall,et al.  Class-specific Hough forests for object detection , 2009, CVPR.

[33]  Mohammad Mehdi Ebadzadeh,et al.  Fuzzy generalized hough transform invariant to rotation and scale in noisy environment , 2009, 2009 IEEE International Conference on Fuzzy Systems.

[34]  Deren Li,et al.  Histogram of oriented gradient detector with color-invariant gradients in Gaussian color space , 2010 .

[35]  Bo Du,et al.  Hybrid Detectors Based on Selective Endmembers , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[36]  Hui Zhou,et al.  A Novel Hierarchical Method of Ship Detection from Spaceborne Optical Image Based on Shape and Texture Features , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[37]  Nikos Paragios,et al.  Large-Scale Building Reconstruction Through Information Fusion and 3-D Priors , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[38]  Zhen Lei,et al.  Land Cover Classification for Remote Sensing Imagery Using Conditional Texton Forest With Historical Land Cover Map , 2011, IEEE Geoscience and Remote Sensing Letters.

[39]  Stefania Matteoli,et al.  An Automatic Approach to Adaptive Local Background Estimation and Suppression in Hyperspectral Target Detection , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[40]  Caroline Fossati,et al.  Improvement of Target-Detection Algorithms Based on Adaptive Three-Dimensional Filtering , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[41]  Cem Ünsalan,et al.  A Probabilistic Framework to Detect Buildings in Aerial and Satellite Images , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[42]  Stefania Matteoli,et al.  Operational and Performance Considerations of Radiative-Transfer Modeling in Hyperspectral Target Detection , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[43]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[44]  Pushmeet Kohli,et al.  On Detection of Multiple Object Instances Using Hough Transforms , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.