Cluster Analysis as an Experimental Design Generator, With Application to Gasoline Blend ing Experiments

This article concerns the selection of experimental design points from existing series of candidates when the design variables are too interrelated to be manipulated independently. Designs with an even spread of points are shown to estimate the parameters of an assumed linear or polynomial model reasonably efficiently while providing good tests of lack of fit. Furthestneighbor cluster analysis can be used to select the points of such a design under either the Euclidean or the Mahalanobis measure of distance. The technique is used to select the base fuels in actual series of experiments to measure the effect of blending a particular alcohol into gasolines. A new blending model parameterization is proposed, which relates the blending octane number of this alcohol to both its concentration and to the properties of the base fuel. An analagous generalized least squares model is discussed, which gives a simple expression for the expected mean squares in different error strata.

[1]  P. Hoel Minimax Designs in Two Dimensional Regression , 1965 .

[2]  H. Wynn The Sequential Generation of $D$-Optimum Experimental Designs , 1970 .

[3]  Vladimir J. Lumelsky,et al.  A combined algorithm for weighting the variables and clustering in the clustering problem , 1982, Pattern Recognit..

[4]  Stephen M. Stigler,et al.  Optimal Experimental Design for Polynomial Regression , 1971 .

[5]  O. Dykstra The Augmentation of Experimental Data to Maximize [X′X] , 1971 .

[6]  Julie Scoltock,et al.  A Survey of the Literature of Cluster Analysis , 1982, Comput. J..

[7]  J. Kiefer Optimum Experimental Designs , 1959 .

[8]  S. R. Searle Linear Models , 1971 .

[9]  L. A. Stone,et al.  Computer Aided Design of Experiments , 1969 .

[10]  H. Scheffé Experiments with Mixtures , 1958 .

[11]  M. Kendall,et al.  The advanced theory of statistics , 1945 .

[12]  M. J. R. Healy,et al.  Fitting Equations to Data, 2Nd Ed , 1980 .

[13]  N. Draper,et al.  Applied Regression Analysis , 1966 .

[14]  Otto Dykstra,et al.  The Augmentation of Experimental Data to Maximize |X′X|@@@The Augmentation of Experimental Data to Maximize |X prime X| , 1971 .

[15]  K. Smith ON THE STANDARD DEVIATIONS OF ADJUSTED AND INTERPOLATED VALUES OF AN OBSERVED POLYNOMIAL FUNCTION AND ITS CONSTANTS AND THE GUIDANCE THEY GIVE TOWARDS A PROPER CHOICE OF THE DISTRIBUTION OF OBSERVATIONS , 1918 .

[16]  Brian Everitt,et al.  Cluster analysis , 1974 .

[17]  J. W. Gorman,et al.  Fitting Equations to Data. , 1973 .

[18]  B. Everitt Unresolved Problems in Cluster Analysis , 1979 .

[19]  Herman Chernoff,et al.  Metric considerations in cluster analysis , 1972 .

[20]  Virgil L. Anderson,et al.  Extreme Vertices Design of Mixture Experiments , 1966 .

[21]  Ronald D. Snee,et al.  Developing Blending Models for Gasoline and Other Mixtures , 1981 .