Polynomial regression is still widely used in engineering and economics where polynomials of low order (usually less than tenth order) are being fitted to experimental data. However, the fundamental problem of selecting the optimal order of the polynomial to be fitted to experimental data is not a straightforward problem. This paper investigates the performance of automated methods for predicting the order of the polynomial that can be fitted on the decision boundary formed between two classes in a pattern recognition problem. We have investigated statistical methods and proposed a method of predicting the order of the polynomial. Our proposed machine learning method is computing a number of measurements on the input data which are used by a classifier trained offline to predict the order of the polynomial that should be fitted to the decision boundary. We have considered two matching scenarios. One scenario is where we have counted only the exact matches as being correct and another scenario in which we count as correct an exact match and higher polynomial orders. Experimental results on synthetic data show that our proposed method predicts the exact order of the polynomial with 31.90% accuracy as opposed to 13.22% of the best statistical method, but it also under-estimates the true order almost twice as often when compared to statistical methods of predicting the order of the polynomial to be fitted to the same data points.
[1]
M. Rosenlicht.
Introduction to Analysis
,
1970
.
[2]
Colin L. Mallows,et al.
Some Comments on Cp
,
2000,
Technometrics.
[3]
Grace W. Rumantir.
Comparison of Second-order Polynomial Model Selection Methods: an Experimental Survey
,
1999,
AAAI/IAAI.
[4]
Shigeo Abe DrEng.
Pattern Classification
,
2001,
Springer London.
[5]
T. Ho,et al.
Data Complexity in Pattern Recognition
,
2006
.
[6]
David J. Spiegelhalter,et al.
Machine Learning, Neural and Statistical Classification
,
2009
.
[7]
H. Bozdogan.
Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions
,
1987
.
[8]
G. Schwarz.
Estimating the Dimension of a Model
,
1978
.
[9]
Kenji Matsuura,et al.
Vector Correlation: Review, Exposition, and Geographic Application
,
1992
.
[10]
Michael Golay,et al.
Survey of Model Selection and Model Combination
,
2008
.
[11]
Aki Vehtari,et al.
A survey of Bayesian predictive methods for model assessment, selection and comparison
,
2012
.
[12]
M. Stone.
The Generalized Weierstrass Approximation Theorem
,
1948
.
[13]
David G. Stork,et al.
Pattern Classification
,
1973
.