Adaptive splines and genetic algorithms for optimal statistical modeling

In many statistical applications, a modeling technique is needed which can capture a relationship between two variables x and y that is more complex than a simple linear relationship. In the case where little is known about the function f which relates x and y, the modeling technique should be flexible or adaptive, i.e., able to handle a wide variety of functional shapes and behaviors. Nonparametric modeling is one such technique which has been successful in characterizing features of datasets that could not be obtainable by other means. Spatially adaptive smoothing methods involving regression splines have become a popular and rapidly developing class of such modeling techniques. Most of these methods are based on nonlinear optimization and/or stepwise selection of basis functions. Although computationally fast and spatially adaptive, stepwise knot selection is necessarily suboptimal while determining the best model over the space of adaptive splines is a very poorly behaved nonlinear optimization problem (Wahba 1988). A possible alternative is to use a genetic algorithm to perform knot selection. Genetic algorithms are stochastic search methods which, under certain design conditions, have been shown theoretically to converge to the global optimum of the evaluation function of an optimization problem (Banyopadhyay, Murthy, and Pal 1999). Hence, given a variable selection criterion and a search space of possible knot locations, a genetic algorithm has the potential to find models that are more appropriate in comparison to models selected using stepwise or nonlinear optimization techniques. In this work we explore the use of genetic algorithms for adaptive spline modeling in low dimensional settings. Chapters 3 and 4 concern only linear splines while Chapter 5 involves the optimal fitting of polynomial splines. Experimental results are compared to those of Luo and Wahba (1997), Donoho and Johnstone (1995), Friedman (1991), and Stone, Kooperberg, Hansen and Troung (1997), among others. Future research is focused on comparisons with the recent Bayesian spline methods (e.g., Denison, Mallick and Smith 1998), extensions to higher dimensions, and explorations into other areas of study such as neural networks and fractals.

[1]  David Draper,et al.  Assessment and Propagation of Model Uncertainty , 2011 .

[2]  C. Biller Adaptive Bayesian Regression Splines in Semiparametric Generalized Linear Models , 2000 .

[3]  Thomas S. Shively,et al.  Variable Selection and Function Estimation in Additive Nonparametric Regression Using a Data-Based Prior , 1999 .

[4]  J. Spall,et al.  Theoretical comparisons of evolutionary computation and other optimization approaches , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[5]  Mary J. Lindstrom,et al.  Penalized Estimation of Free-Knot Splines , 1999 .

[6]  Sanghamitra Bandyopadhyay,et al.  Theoretical performance of genetic pattern classifier , 1999 .

[7]  Sanghamitra Bandyopadhyay,et al.  Genetic algorithms for generation of class boundaries , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[8]  Sanghamitra Bandyopadhyay,et al.  Pattern classification using genetic algorithms: Determination of H , 1998, Pattern Recognit. Lett..

[9]  B. Mallick Bayesian curve estimation by polynomial of random order , 1998 .

[10]  J. Simonoff Smoothing Methods in Statistics , 1998 .

[11]  Young K. Truong,et al.  Polynomial splines and their tensor products in extended linear modeling: 1994 Wald memorial lecture , 1997 .

[12]  C. A. Murthy,et al.  Optimal Line Fitting Using Genetic Algorithms. , 1997 .

[13]  J. Wille,et al.  On the structure of the Hessian matrix in feedforward networks and second derivative methods , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[14]  K. Burnham,et al.  Model selection: An integral part of inference , 1997 .

[15]  M. J. Baines,et al.  Analysis of an algorithm for generating locally optimal meshes for L2 approximation by discontinuous piecewise polynomials , 1997, Math. Comput..

[16]  G. Wahba,et al.  Hybrid Adaptive Splines , 1997 .

[17]  D. Madigan,et al.  Bayesian Model Averaging for Linear Regression Models , 1997 .

[18]  Kim Fung Man,et al.  Genetic algorithms for control and signal processing , 1997, Proceedings of the IECON'97 23rd International Conference on Industrial Electronics, Control, and Instrumentation (Cat. No.97CH36066).

[19]  Kevin Warwick,et al.  An introduction to radial basis functions for system identification. A comparison with other neural network methods , 1996, Proceedings of 35th IEEE Conference on Decision and Control.

[20]  R. Kohn,et al.  Nonparametric regression using Bayesian variable selection , 1996 .

[21]  S. Chatterjee,et al.  Genetic algorithms and their statistical applications: an introduction , 1996 .

[22]  C. A. Murthy,et al.  Genetic Algorithm with Elitist Model and Its Convergence , 1996, Int. J. Pattern Recognit. Artif. Intell..

[23]  V. Leute,et al.  Least squares splines approximation applied to multicomponent diffusion data , 1996 .

[24]  J. Berger,et al.  The Intrinsic Bayes Factor for Model Selection and Prediction , 1996 .

[25]  I. Johnstone,et al.  Adapting to Unknown Smoothness via Wavelet Shrinkage , 1995 .

[26]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[27]  C. Posse Projection pursuit exploratory data analysis , 1995 .

[28]  M. J. Bayarri,et al.  The relation between theory and application in statistics , 1995 .

[29]  Hubert Schwetlick,et al.  Least squares approximation by splines with free knots , 1995 .

[30]  C. Geyer,et al.  Annealing Markov chain Monte Carlo with applications to ancestral inference , 1995 .

[31]  Dan Boneh,et al.  On genetic algorithms , 1995, COLT '95.

[32]  Jianqing Fan,et al.  Data‐Driven Bandwidth Selection in Local Polynomial Fitting: Variable Bandwidth and Spatial Adaptation , 1995 .

[33]  Desire L. Massart,et al.  Robust regression and outlier detection for non-linear models using genetic algorithms , 1995 .

[34]  Hans-Georg Beyer,et al.  Toward a Theory of Evolution Strategies: On the Benefits of Sex the (/, ) Theory , 1995, Evolutionary Computation.

[35]  Michael Werman,et al.  Fitting a Second Degree Curve in the Presence of Error , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  I. Johnstone,et al.  Ideal spatial adaptation by wavelet shrinkage , 1994 .

[37]  Paul Dierckx,et al.  Curve and surface fitting with splines , 1994, Monographs on numerical analysis.

[38]  Bernard W. Silverman,et al.  The discrete wavelet transform in S , 1994 .

[39]  Ramesh C. Jain,et al.  A robust backpropagation learning algorithm for function approximation , 1994, IEEE Trans. Neural Networks.

[40]  M. J. Baines,et al.  Algorithms for optimal discontinuous piecewise linear and constant L 2 fits to continuous functions with adjustable nodes in one and two dimensions , 1994 .

[41]  D. M. Titterington,et al.  Neural Networks: A Review from a Statistical Perspective , 1994 .

[42]  Yingkang Hu,et al.  An algorithm for data reduction using splines with free knots , 1993 .

[43]  Gerrit Kateman,et al.  Optimization of calibration data with the dynamic genetic algorithm , 1992 .

[44]  Chris Bishop,et al.  Exact Calculation of the Hessian Matrix for the Multilayer Perceptron , 1992, Neural Computation.

[45]  Roberto Battiti,et al.  First- and Second-Order Methods for Learning: Between Steepest Descent and Newton's Method , 1992, Neural Computation.

[46]  Michael D. Vose,et al.  Generalizing the Notion of Schema in Genetic Algorithms , 1991, Artif. Intell..

[47]  A. Wathen,et al.  On the Best Least Squares Approximation of Continuous Functions using Linear Splines with Free Knots , 1991 .

[48]  David Rogers,et al.  G/SPLINES: A Hybrid of Friedman's Multivariate Adaptive Regression Splines (MARS) Algorithm with Holland's Genetic Algorithm , 1991, ICGA.

[49]  Hung Chen,et al.  ESTIMATION OF A PROJECTION-PURSUIT TYPE REGRESSION MODEL' , 1991 .

[50]  Reinhard Männer,et al.  Towards an Optimal Mutation Probability for Genetic Algorithms , 1990, PPSN.

[51]  Karl Heinz Hoffmann,et al.  Optimizing Complex Problems by Nature's Algorithms: Simulated Annealing and Evolution Strategy - A Comparative Study , 1990, PPSN.

[52]  Thomas Bäck,et al.  Genetic Algorithms and Evolution Strategies - Similarities and Differences , 1990, PPSN.

[53]  Nobue Adachi,et al.  Ecological Dynamics under Different Selection Rules in Distributed and Iterated Prisoner's Dilemma Games , 1990, PPSN.

[54]  Kenneth A. De Jong,et al.  An Analysis of the Interacting Roles of Population Size and Crossover in Genetic Algorithms , 1990, PPSN.

[55]  Paul L. Speckman,et al.  Curve fitting by polynomial-trigonometric regression , 1990 .

[56]  B. Yandell Spline smoothing and nonparametric regression , 1989 .

[57]  Dudley,et al.  Real Analysis and Probability: Integration , 2002 .

[58]  Trevor Hastie,et al.  [Flexible Parsimonious Smoothing and Additive Modeling]: Discussion , 1989 .

[59]  W. Cleveland,et al.  Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting , 1988 .

[60]  T. Lyche,et al.  A Data-Reduction Strategy for Splines with Applications to the Approximation of Functions and Data , 1988 .

[61]  Emile H. L. Aarts,et al.  Simulated Annealing: Theory and Applications , 1987, Mathematics and Its Applications.

[62]  J. Friedman,et al.  Estimating Optimal Transformations for Multiple Regression and Correlation. , 1985 .

[63]  J. E. Baker Adaptive Selection Methods for Genetic Algorithms , 1985, ICGA.

[64]  I. W. Wright Splines in Statistics , 1983 .

[65]  C. J. Stone,et al.  Optimal Global Rates of Convergence for Nonparametric Regression , 1982 .

[66]  Patricia L. Smith,et al.  Curve fitting and modeling with splines using statistical variable selection techniques , 1982 .

[67]  W. J. Studden,et al.  Asymptotic Integrated Mean Square Error Using Least Squares and Bias Minimizing Splines , 1980 .

[68]  Peter Craven,et al.  Smoothing noisy data with spline functions , 1978 .

[69]  D. Jupp Approximation to Data by Splines with Free Knots , 1978 .

[70]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[71]  Charles K. Chui,et al.  Unicity of best mean approximation by second order splines with variable knots , 1978 .

[72]  D. F. Andrews,et al.  A Robust Method for Multiple Linear Regression , 1974 .

[73]  Douglas M. Hawkins,et al.  On the Choice of Segments in Piecewise Approximation , 1972 .

[74]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[75]  I. J. Schoenberg,et al.  ON POLYA FREQUENCY FUNCTIONS. III. THE POSITIVITY OF TRANSLATION DETERMINANTS WITH AN APPLICATION TO THE INTERPOLATION PROBLEM BY SPLINE CURVES( , 1953 .

[76]  Stefan Sperlich,et al.  Generalized Additive Models , 2014 .

[77]  M. Hansen,et al.  Spline Adaptation in Extended Linear Models , 1998 .

[78]  Adrian F. M. Smith,et al.  Automatic Bayesian curve fitting , 1998 .

[79]  Clifford M. Hurvich,et al.  Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion , 1998 .

[80]  G. Unter Rudolph Convergence Rates of Evolutionary Algorithms for a Class of Convex Objective Functions , 1997 .

[81]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.

[82]  C. J. Stone,et al.  Polychotomous Regression , 1995 .

[83]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[84]  Francesco Palmieri,et al.  Theoretical analysis of evolutionary algorithms with an infinite population size in continuous space. Part I: Basic properties of selection and mutation , 1994, IEEE Trans. Neural Networks.

[85]  Günter Rudolph,et al.  Convergence analysis of canonical genetic algorithms , 1994, IEEE Trans. Neural Networks.

[86]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[87]  H. J. Larson Least squares estimation of linear splines with unknown knot locations , 1992 .

[88]  Gunar E. Liepins,et al.  Schema Disruption , 1991, ICGA.

[89]  J. Freidman,et al.  Multivariate adaptive regression splines , 1991 .

[90]  Thomas Bäck,et al.  A Survey of Evolution Strategies , 1991, ICGA.

[91]  Zbigniew Michalewicz,et al.  Handling Constraints in Genetic Algorithms , 1991, ICGA.

[92]  Charles L. Karr,et al.  Genetic algorithm applied to least squares curve fitting , 1991 .

[93]  José Carlos Príncipe,et al.  A Simulated Annealing Like Convergence Theory for the Simple Genetic Algorithm , 1991, ICGA.

[94]  Michael F. Barnsley,et al.  Fractals everywhere , 1988 .

[95]  D. E. Goldberg,et al.  Simple Genetic Algorithms and the Minimal, Deceptive Problem , 1987 .

[96]  L. Schumaker Spline Functions: Basic Theory , 1981 .

[97]  C. R. Deboor,et al.  A practical guide to splines , 1978 .

[98]  Girdhar G. Agarwal,et al.  Asymptotic design and estimation using linear splines , 1978 .

[99]  J C Pezzack,et al.  An assessment of derivative determining techniques used for motion analysis. , 1977, Journal of biomechanics.

[100]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[101]  K. Dejong,et al.  An analysis of the behavior of a class of genetic adaptive systems , 1975 .

[102]  C. L. Mallows Some comments on C_p , 1973 .

[103]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[104]  C. D. Boor,et al.  Least Squares Cubic Spline Approximation, II - Variable Knots , 1968 .

[105]  H. Piaggio Mathematical Analysis , 1955, Nature.

[106]  Herbert K. H. Lee A Framework for Nonparametric Regression Using Neural Networks , 2022 .