Improving Rooftop Detection in Aerial Images Through Machine Learning

Abstract : In this paper, we examine the use of machine learning to improve a rooftop detection process, which is one step in a vision system that recognizes buildings in overhead imagery. We review the problem of analyzing aerial images and describe an existing vision system that automates the recognition of buildings in such images. After this, we briefly review two well known learning algorithms, representing different inductive biases, that we selected to improve rooftop detection. An important aspect of this problem is that the data sets are highly skewed and the cost of mistakes differs for the two classes, so we evaluate the algorithms under varying misclassification costs using ROC analysis. We report three sets of experiments designed to illuminate facets of applying machine learning to the image analysis task. One set of studies focuses on within image learning, in which both training and testing data are derived from the same image. Another addresses between image learning, in which training and testing sets come from different images. A final set investigates learning using all available images in an effort to determine the best performing method. Experimental results demonstrate that useful generalization occurs when training and testing on data derived from images that differ in location and in aspect. Furthermore, they demonstrate that, under most conditions and across a range of misclassification costs, a trained naive Bayesian classifier exceeded, by as much as a factor of two, the predictive accuracy of nearest neighbor and a handcrafted linear classifier, the solution currently used in the building detection system. Analysis of learning curves reveals that naive Bayes achieved superiority using as little as 6% of the available training data.

[1]  D. M. Green,et al.  Signal detection theory and psychophysics , 1966 .

[2]  James P. Egan,et al.  Signal detection theory and ROC analysis , 1975 .

[3]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[4]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[5]  B. A. Shepherd,et al.  An Appraisal of a Decision Tree Approach to Image Classification , 1983, IJCAI.

[6]  Nada Lavrac,et al.  The Multi-Purpose Incremental Learning System AQ15 and Its Testing Application to Three Medical Domains , 1986, AAAI.

[7]  Michael Brady,et al.  Generating and Generalizing Models of Visual Objects , 1987, Artif. Intell..

[8]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[9]  Avinash C. Kak,et al.  Automatic Generation of Object Class Descriptions Using Symbolic Learning Techniques , 1991, AAAI.

[10]  P. Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.

[11]  Pat Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.

[12]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[13]  Jacek M. Zurada,et al.  Introduction to artificial neural systems , 1992 .

[14]  K. Boyer,et al.  Incremental Modelbase Updating : Learning new model sites , 1993 .

[15]  K. Bowyer,et al.  Learning combination of evidence functions in object recognition , 1993 .

[16]  Patrick GRos,et al.  Matching and Clustering: two Steps towards Automatic Object Model Generation in Computer Vision , 1993 .

[17]  Paul A. Viola Feature-Based Recognition of Objects , 1993 .

[18]  Darrell Conklin,et al.  Transformation-invariant indexing and machine discovery for computer vision , 1993 .

[19]  Wendy G. Lehnert,et al.  Corpus-Driven Knowledge Acquisition for Discourse Analysis , 1994, AAAI.

[20]  Bruce A. Draper,et al.  Goal-Directed Classification Using Linear Machine Decision Trees , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Michael J. Pazzani,et al.  Reducing Misclassification Costs , 1994, ICML.

[22]  David D. Lewis,et al.  Heterogeneous Uncertainty Sampling for Supervised Learning , 1994, ICML.

[23]  Peter D. Turney Cost-Sensitive Classification: Empirical Evaluation of a Hybrid Genetic Decision Tree Induction Algorithm , 1994, J. Artif. Intell. Res..

[24]  Herbert A. Simon,et al.  Applications of machine learning and rule induction , 1995, CACM.

[25]  Moninder Singh,et al.  Learning Goal Oriented Bayesian Networks for Telecommunications Risk Management , 1996, ICML.

[26]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[27]  Kevin W. Bowyer,et al.  Combination of multiple classifiers using local accuracy estimates , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28]  Azriel Rosenfeld,et al.  Recognizing Blasting Caps in X-Ray Images , 1996 .

[29]  Tomaso Poggio,et al.  Image Representations for Visual Learning , 1996, Science.

[30]  S. Nayar,et al.  Early Visual Learning , 1996 .

[31]  RecognitionBruce A. DraperDept Learning Control Strategies for Object Recognition , 1996 .

[32]  Bruce A. Draper,et al.  Learning Grouping Strategies for 2D and 3D Object Recognition , 1996 .

[33]  Nasser M. Nasrabadi,et al.  Multi-stage target recognition using modular vector quantizers and multilayer perceptrons , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[34]  Harry Wechsler,et al.  Face and hand gesture recognition using hybrid classifiers , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[35]  Bruce A. Draper,et al.  Learning control strategies for object recognition , 1997 .

[36]  Richard Maclin,et al.  Feature Engineering and Classifier Selection: A Case Study in Venusian Volcano Detection , 1997, ICML.

[37]  Tom Fawcett,et al.  Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Distributions , 1997, KDD.

[38]  Kevin W. Bowyer,et al.  Combination of Multiple Classifiers Using Local Accuracy Estimates , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[40]  Ryszard S. Michalski,et al.  Learning symbolic descriptions of shape for object recognition in X-ray images , 1997 .

[41]  Federico Girosi,et al.  Training support vector machines: an application to face detection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[42]  Claire Cardie,et al.  Improving Minority Class Prediction Using Case-Specific Feature Weights , 1997, ICML.

[43]  Dean A. Pomerleau,et al.  Neural Network Vision for Robot Driving , 1997 .

[44]  Stan Matwin,et al.  Addressing the Curse of Imbalanced Training Sets: One-Sided Selection , 1997, ICML.

[45]  Pat Langley,et al.  Learning to Detect Rooftops in Aerial Images , 1997 .

[46]  Astro Teller,et al.  PADO: a new learning architecture for object recognition , 1997 .

[47]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..