Principal Curve Clustering With Noise

Clustering on principal curves combines parametric modeling of noise with nonparametric modeling of feature shape This is useful for detecting curvilinear features in spatial point patterns with or without background noise Applications of this include the detection of curvilinear mine elds from reconnaissance images some of the points in which represent false detections and the detection of seismic faults from earthquake catalogs Our algorithm for principal curve clustering is in two steps the rst is hierarchical and agglomerative HPCC and the second consists of iterative relocation based on the Clas si cation EM algorithm CEM PCC HPCC is used to combine potential feature clusters while CEM PCC re nes the results and deals with background noise It is important to have a good starting point for the algorithm this can be found manually or automatically using for example nearest neighbor clutter removal or model based clustering We choose the number of features and the amount of smoothing simultaneously using approximate Bayes factors

[1]  O. William Journal Of The American Statistical Association V-28 , 1932 .

[2]  R. Prim Shortest connection networks and some generalizations , 1957 .

[3]  Charles T. Zahn,et al.  Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters , 1971, IEEE Transactions on Computers.

[4]  P. Bloomfield,et al.  Spline Functions in Data Analysis. , 1974 .

[5]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[6]  R. Tibshirani,et al.  Local Likelihood Estimation , 1987 .

[7]  J. D. Wilson,et al.  A smoothed EM approach to indirect estimation problems, with particular reference to stereology and emission tomography , 1990 .

[8]  D. Nychka Some properties of adding a smoothing step to the EM algorithm , 1990 .

[9]  P. Green On Use of the EM Algorithm for Penalized Likelihood Estimation , 1990 .

[10]  R. Tibshirani Principal curves revisited , 1992 .

[11]  A. Raftery,et al.  Ice Floe Identification in Satellite Images Using Mathematical Morphology and Clustering about Principal Curves , 1992 .

[12]  A. Raftery,et al.  Model-based Gaussian and non-Gaussian clustering , 1993 .

[13]  R. Tibshirani,et al.  Adaptive Principal Surfaces , 1994 .

[14]  L. Wasserman,et al.  A Reference Bayesian Test for Nested Hypotheses and its Relationship to the Schwarz Criterion , 1995 .

[15]  Geoff A. Latham,et al.  Existence of EMS Solutions and a Priori Estimates , 1995, SIAM J. Matrix Anal. Appl..

[16]  C. Fraley,et al.  Nonparametric Maximum Likelihood Estimation of Features in Spatial Point Processes Using Voronoï Tessellation , 1997 .

[17]  A. Raftery,et al.  Nearest-Neighbor Clutter Removal for Estimating Features in Spatial Point Processes , 1998 .

[18]  A. Raftery,et al.  Detecting features in spatial point processes with clutter via model-based clustering , 1998 .