Statistical Approach to the Identification of Separation Surface for Spatial Data

In spatial clustering, spatial objects are grouped into clusters according to their similarities. In terms of learning or pattern recognition, it belongs to the identification of structures/classes through an unsupervised process. In terms of data mining, it is the discovery of intrinsic classes, particularly new classes, in spatial data. It formulates class structures and determines the number of classes. I have examined in Chap. 2 the importance of clustering as a means for unraveling interesting, useful and natural patterns in spatial data. The process generally does not involve how to separate predetermined classes, or how to determine whether classes are significantly different from each other, or how to assign new objects to given classes. Another fundamental issue of spatial knowledge discovery involves spatial classification. It essentially deals with the separation of pre-specified classes and the assignment of new spatial objects to these classes on the basis of some measurements (with respect to selected features) about them. In terms of learning or pattern recognition, it is actually a supervised learning process which searches for the decision surface separating appropriately various classes. In terms of data mining, it often involves the discovery of classification rules from the training/learning data set that can separate distinct/genuine classes of spatial objects and the assignment of new spatial objects to these labeled classes. Whether the pre-specified classes are significantly different is usually not the main concern in classification. It can be determined by procedures such as the analysis of variance in statistics.

[1]  Pat Langley,et al.  Induction of Selective Bayesian Classifiers , 1994, UAI.

[2]  Peter E. Hart,et al.  The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[3]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[4]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[5]  Bernhard Schölkopf,et al.  Improving the Accuracy and Speed of Support Vector Machines , 1996, NIPS.

[6]  Bernhard Schölkopf,et al.  Improving the accuracy and speed of support vector learning machines , 1997, NIPS 1997.

[7]  M. GuadagniPeter,et al.  A Logit Model of Brand Choice Calibrated on Scanner Data , 1983 .

[8]  J. A. Anderson,et al.  7 Logistic discrimination , 1982, Classification, Pattern Recognition and Reduction of Dimensionality.

[9]  David J. Hand,et al.  Kernel Discriminant Analysis , 1983 .

[10]  John D. C. Little,et al.  A Logit Model of Brand Choice Calibrated on Scanner Data , 2011, Mark. Sci..

[11]  Geoffrey J. McLachlan,et al.  Discriminant Analysis and Statistical Pattern Recognition: McLachlan/Discriminant Analysis & Pattern Recog , 2005 .

[12]  S. Sathiya Keerthi,et al.  A fast iterative nearest point algorithm for support vector machine classifier design , 2000, IEEE Trans. Neural Networks Learn. Syst..

[13]  Trevor J. Hastie,et al.  Discriminative vs Informative Learning , 1997, KDD.

[14]  Marti A. Hearst Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..

[15]  M. J. D. Powell,et al.  Radial basis functions for multivariable interpolation: a review , 1987 .

[16]  Andreu Català,et al.  K-SVCR. A Multi-class Support Vector Machine , 2000, ECML.

[17]  David J. Hand,et al.  Statistical Classification Methods in Consumer Credit Scoring: a Review , 1997 .

[18]  R. Tibshirani,et al.  Discriminant Analysis by Gaussian Mixtures , 1996 .

[19]  Martin Brown,et al.  Support vector machines for spectral unmixing , 1999, IEEE 1999 International Geoscience and Remote Sensing Symposium. IGARSS'99 (Cat. No.99CH36293).

[20]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[21]  D. Collett Modelling Binary Data , 1991 .

[22]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[23]  Bernhard Schölkopf,et al.  Comparing support vector machines with Gaussian kernels to radial basis function classifiers , 1997, IEEE Trans. Signal Process..

[24]  David L. Waltz,et al.  Toward memory-based reasoning , 1986, CACM.

[25]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[26]  B. Efron The Efficiency of Logistic Regression Compared to Normal Discriminant Analysis , 1975 .

[27]  Jason Weston,et al.  Multi-Class Support Vector Machines , 1998 .

[28]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[29]  J. Berkson Application of the Logistic Function to Bio-Assay , 1944 .

[30]  Joachim M. Buhmann,et al.  Support vector machines for land usage classification in Landsat TM imagery , 1999, IEEE 1999 International Geoscience and Remote Sensing Symposium. IGARSS'99 (Cat. No.99CH36293).

[31]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .

[32]  Stephen Grossberg,et al.  ARTMAP: supervised real-time learning and classification of nonstationary data by a self-organizing neural network , 1991, [1991 Proceedings] IEEE Conference on Neural Networks for Ocean Engineering.

[33]  Marti A. Hearst Intelligent Connections: Battling with GA-Joe. , 1998 .

[34]  Paola Sebastiani,et al.  c ○ 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. Robust Learning with Missing Data , 2022 .

[35]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[36]  Greg M. Allenby,et al.  Modeling Household Purchase Behavior with Logistic Normal Regression , 1994 .

[37]  S. Menard Applied Logistic Regression Analysis , 1996 .

[38]  J. L. Hodges,et al.  Discriminatory Analysis - Nonparametric Discrimination: Consistency Properties , 1989 .

[39]  C. A. Smith Some examples of discrimination. , 1947, Annals of eugenics.

[40]  Selwyn Piramuthu Feature Selection for Financial Credit-Risk Evaluation Decisions , 1999, INFORMS J. Comput..