Effective Design-Based Model Selection for Definitive Screening Designs

ABSTRACT Since their introduction by Jones and Nachtsheim in 2011, definitive screening designs (DSDs) have seen application in fields as diverse as bio-manufacturing, green energy production, and laser etching. One barrier to their routine adoption for screening is due to the difficulties practitioners experience in model selection when both main effects and second-order effects are active. Jones and Nachtsheim showed that for six or more factors, DSDs project to designs in any three factors that can fit a full quadratic model. In addition, they showed that DSDs have high power for detecting all the main effects as well as one two-factor interaction or one quadratic effect as long as the true effects are much larger than the error standard deviation. However, simulation studies of model selection strategies applied to DSDs can disappoint by failing to identify the correct set of active second-order effects when there are more than a few such effects. Standard model selection strategies such as stepwise regression, all-subsets regression, and the Dantzig selector are general tools that do not make use of any structural information about the design. It seems reasonable that a modeling approach that makes use of the known structure of a designed experiment could perform better than more general purpose strategies. This article shows how to take advantage of the special structure of the DSD to obtain the most clear-cut analytical results possible.

[1]  Christopher J. Nachtsheim,et al.  Blocking Schemes for Definitive Screening Designs , 2016, Technometrics.

[2]  Fengshan Bai,et al.  Constructing Definitive Screening Designs Using Conference Matrices , 2012 .

[3]  Kenny Q. Ye,et al.  Model-robust supersaturated and partially supersaturated designs , 2009 .

[4]  Christopher J. Nachtsheim,et al.  A Class of Three-Level Designs for Definitive Screening in the Presence of Second-Order Effects , 2011 .

[5]  R. Tibshirani,et al.  A LASSO FOR HIERARCHICAL INTERACTIONS. , 2012, Annals of statistics.

[6]  Ji Zhu,et al.  Variable Selection With the Strong Heredity Constraint and Its Oracle Property , 2010 .

[7]  Christopher J. Nachtsheim,et al.  Definitive Screening Designs with Added Two-Level Categorical Factors* , 2013 .

[8]  Hugh A. Chipman,et al.  Prior Distributions for Bayesian Analysis of Screening Experiments , 2006 .

[9]  William Li,et al.  Using Definitive Screening Designs to Identify Active First- and Second-Order Factor Effects , 2017 .

[10]  L. Stefanski,et al.  Controlling Variable Selection by the Addition of Pseudovariables , 2007 .

[11]  A Miller,et al.  Using Folded-Over Nonorthogonal Designs , 2005, Technometrics.

[12]  Bradley Jones,et al.  21st century screening experiments: What, why, and how , 2016 .

[13]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[14]  R. Lenth Quick and easy analysis of unreplicated factorials , 1989 .

[15]  Christopher J. Marley,et al.  A comparison of design and model selection methods for supersaturated experiments , 2010, Comput. Stat. Data Anal..

[16]  Felix Famoye,et al.  Plane Answers to Complex Questions: Theory of Linear Models , 2003, Technometrics.