Learning and Design of Principal Curves

Principal curves have been defined as "self-consistent" smooth curves which pass through the "middle" of a d-dimensional probability distribution or data cloud. They give a summary of the data and also serve as an efficient feature extraction tool. We take a new approach by defining principal curves as continuous curves of a given length which minimize the expected squared distance between the curve and points of the space randomly chosen according to a given distribution. The new definition makes it possible to theoretically analyze principal curve learning from training data and it also leads to a new practical construction. Our theoretical learning scheme chooses a curve from a class of polygonal lines with k segments and with a given total length to minimize the average squared distance over n training points drawn independently. Convergence properties of this learning scheme are analyzed and a practical version of this theoretical algorithm is implemented. In each iteration of the algorithm, a new vertex is added to the polygonal line and the positions of the vertices are updated so that they minimize a penalized squared distance criterion. Simulation results demonstrate that the new algorithm compares favorably with previous methods, both in terms of performance and computational complexity, and is more robust to varying data models.

[1]  A. Kolmogorov,et al.  Entropy and "-capacity of sets in func-tional spaces , 1961 .

[2]  Werner Stuetzle,et al.  Geometric Properties of Principal Curves in the Plane , 1996 .

[3]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[4]  Vincent Kanade,et al.  Clustering Algorithms , 2021, Wireless RF Energy Transfer in the Massive IoT Era.

[5]  IItevor Hattie Principal Curves and Surfaces , 1984 .

[6]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[7]  László Györfi,et al.  A Probabilistic Theory of Pattern Recognition , 1996, Stochastic Modelling and Applied Probability.

[8]  T. Hastie Principal Curves and Surfaces , 1984 .

[9]  M. Niranjan,et al.  SUBSPACE MODELS FOR SPEECH TRANSITIONS USING PRINCIPAL , 1998 .

[10]  P. Delicado Another Look at Principal Curves and Surfaces , 2001 .

[11]  KéglBalázs,et al.  Learning and Design of Principal Curves , 2000 .

[12]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[13]  A. Raftery,et al.  Ice Floe Identification in Satellite Images Using Mathematical Morphology and Clustering about Principal Curves , 1992 .

[14]  Joydeep Ghosh,et al.  Principal curves for nonlinear feature extraction and classification , 1998, Electronic Imaging.

[15]  R. Tibshirani Principal curves revisited , 1992 .

[16]  C. I. Watson,et al.  NIST Special Database 10 , 1993 .

[17]  Dudley,et al.  Real Analysis and Probability: Measurability: Borel Isomorphism and Analytic Sets , 2002 .

[18]  Vladimir Cherkassky,et al.  Self-Organization as an Iterative Kernel Smoothing Process , 1995, Neural Computation.

[19]  B. Kégl,et al.  Principal curves: learning, design, and applications , 2000 .

[20]  W. Andrew LO, . Finance: Survey.. Journal of the American Statistical Association, , . , 2000 .

[21]  P. Delicado Principal curves and principal oriented points , 1998 .

[22]  Bernard D. Flury,et al.  Principal Points and Self-Consistent Points of Elliptical Distributions , 1995 .

[23]  R. A. Silverman,et al.  Introductory Real Analysis , 1972 .

[24]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[25]  Joydeep Ghosh,et al.  Principal curve classifier-a nonlinear approach to pattern classification , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[26]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[27]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[28]  Nikolaos Papanikolopoulos,et al.  Letter-level shape description by skeletonization in faded documents , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[29]  D. Luenberger Optimization by Vector Space Methods , 1968 .