Regression metamodels for simulation with common random numbers: comparison of validation tests and confidence intervals

Linear regression analysis is important in many fields. In the analysis of simulation results, a regression metamodel can be applied, even when common pseudorandom numbers are used. To test the validity of the specified regression model, Rao 1959 generalized the F statistic for lack of fit, whereas Kleijnen 1983 proposed a cross-validation procedure using a Student's t statistic combined with Bonferroni's inequality. This paper reports on an extensive Monte Carlo experiment designed to compare these two methods. Under the normality assumption, cross-validation is conservative, whereas Rao's test realizes its nominal type I error and has high power. Robustness is investigated through lognormal and uniform distributions. When simulation responses are distributed lognormally, then cross-validation using Ordinary Least Squares is the only technique that has acceptable type I error. Uniform distributions give results similar to the normal case. Once the regression model is validated, confidence intervals for the individual regression parameters are computed. The Monte Carlo experiment compares several confidence interval procedures. Under normality, Rao's procedure is preferred since it has good coverage probability and acceptable half-length. Under lognormality, Ordinary Least Squares achieves nominal coverage probability. Uniform distributions again give results similar to the normal case.

[1]  Calyampudi R. Rao SOME PROBLEMS INVOLVING LINEAR HYPOTHESES IN MULTIVARIATE ANALYSIS , 1959 .

[2]  T. W. Anderson An Introduction to Multivariate Statistical Analysis , 1959 .

[3]  Peter M. Neely Comparison of several algorithms for computation of means, standard deviations and correlation coefficients , 1966, CACM.

[4]  H. Scheffé Practical Solutions of the Behrens-Fisher Problem , 1970 .

[5]  R. Dykstra Establishing the Positive Definiteness of the Sample Covariance Matrix , 1970 .

[6]  W. R. Buckland,et al.  Distributions in Statistics: Continuous Multivariate Distributions , 1973 .

[7]  J. Neter,et al.  Applied linear statistical models : regression, analysis of variance, and experimental designs , 1974 .

[8]  Lee W. Schruben,et al.  Pseudorandom Number Assignment in Statistically Designed Simulation and Distribution Sampling Experiments , 1978 .

[9]  S. R. Searle,et al.  The theory of linear models and multivariate analysis , 1981 .

[10]  Jack P. C. Kleijnen,et al.  Cross-validation using the t statistic☆ , 1983 .

[11]  The asymptotic behaviour of the estimated generalized least squares method in the linear regression model , 1983 .

[12]  J.P.C. Kleijnen,et al.  The power of weighted and ordinary least squares with estimated unequal variances in experimental design , 1984 .

[13]  J. Kleijnen Statistical tools for simulation practitioners , 1986 .

[14]  Claude Dennis Pegden,et al.  Statistical Analysis for Use with the Schruben and Margolin Correlation Induction Strategy , 1987, Oper. Res..

[15]  Jack P. C. Kleijnen,et al.  Measurement scales and resolution IV designs : A note (Version 3) , 1990 .

[16]  J. Kleijnen Analyzing simulation experiments with common random numbers , 1988 .

[17]  James R. Wilson,et al.  Restricted Subset Selection Procedures for Simulation , 1989, Oper. Res..

[18]  Jack P. C. Kleijnen,et al.  Regression metamodels for simulation with common random numbers : Comparison of techniques , 1990 .

[19]  Jack P. C. Kleijnen,et al.  Analysis and comparison of two strategies for multi-item inventory systems with joint replenishment costs , 1992 .

[20]  Jack P. C. Kleijnen,et al.  Techniques for sensitivity analysis of simulation models: A case study of the CO2 greenhouse effect , 1992, Simul..

[21]  James R. Wilson,et al.  Validation of Simulation Analysis Methods for the Schruben-Margolin Correlation-Induction Strategy , 1992, Oper. Res..