Statistical vs. Stochastic experimental design: An experimental comparison on the example of protein refolding

Optimization of experimental problems is a challenging task in both engineering and science. In principle, two different design of experiments (DOE) strategies exist: statistical and stochastic methods. Both aim to efficiently and precisely identify optimal solutions inside the problem‐specific search space. Here, we evaluate and compare both strategies on the same experimental problem, the optimization of the refolding conditions of the lipase from Thermomyces lanuginosus with 26 variables under study. Protein refolding is one of the main bottlenecks in the process development for recombinant proteins. Despite intensive effort, the prediction of refolding from sequence information alone is still not applicable today. Instead, suitable refolding conditions are typically derived empirically in large screening experiments. Thus, protein refolding should constitute a good performance test for DOE strategies. We compared an iterative stochastic optimization applying a genetic algorithm and a standard statistical design consisting of a D‐optimal screening step followed by an optimization via response surface methodology. Our results revealed that only the stochastic optimization was able to identify optimal refolding conditions (∼1.400 U g−1 refolded activity), which were 3.4‐fold higher than the standard. Additionally, the stochastic optimization proved quite robust, as three independent optimizations performed similar. In contrast, the statistical DOE resulted in a suboptimal solution and failed to identify comparable activities. Interactions between process variables proved to be pivotal for this optimization. Hence, the linear screening model was not able to identify the most important process variables correctly. Thereby, this study highlighted the limits of the classic two‐step statistical DOE. © 2012 American Institute of Chemical Engineers Biotechnol. Prog., 2012

[1]  T. Arakawa,et al.  Effect of additives on protein aggregation. , 2009, Current pharmaceutical biotechnology.

[2]  M. Haas,et al.  Conformational lability of lipases observed in the absence of an oil-water interface: crystallographic studies of enzymes from the fungi Humicola lanuginosa and Rhizopus delemar. , 1995, Journal of lipid research.

[3]  D Weuster-Botz,et al.  Experimental design for fermentation media development: statistical design or global random search? , 2000, Journal of bioscience and bioengineering.

[4]  C. A. Coello Coello,et al.  Evolutionary multi-objective optimization: a historical view of the field , 2006, IEEE Computational Intelligence Magazine.

[5]  F. Schüth,et al.  On the suitability of different representations of solid catalysts for combinatorial library design by genetic algorithms. , 2008, Journal of combinatorial chemistry.

[6]  D. Weuster‐Botz,et al.  Experimental optimization of protein refolding with a genetic algorithm , 2010, Protein science : a publication of the Protein Society.

[7]  Rui Nian,et al.  Recent advances in biomolecular process intensification , 2006 .

[8]  J. Ahn,et al.  Investigation of refolding condition for Pseudomonas fluorescens lipase by response surface methodology. , 1997, Journal of biotechnology.

[9]  Alois Jungbauer,et al.  Current status of technical protein refolding. , 2007, Journal of biotechnology.

[10]  Margaret J. Robertson,et al.  Design and Analysis of Experiments , 2006, Handbook of statistics.

[11]  Luca Maria Gambardella,et al.  A survey on metaheuristics for stochastic combinatorial optimization , 2009, Natural Computing.

[12]  F D Lasky,et al.  Designing in quality through design control: a manufacturer's perspective. , 1997, Clinical chemistry.

[13]  T. Fox,et al.  Investigation of protein refolding using a fractional factorial screen: A study of reagent effects and interactions , 2005, Protein science : a publication of the Protein Society.

[14]  J. Buchner,et al.  Refolding of inclusion body proteins. , 2004, Methods in molecular medicine.

[15]  W. N. Chen,et al.  A rational design for hepatitis B virus X protein refolding and bioprocess development guided by second virial coefficient studies , 2011, Applied Microbiology and Biotechnology.

[16]  Carlos A. Coello Coello,et al.  Evolutionary multi-objective optimization: a historical view of the field , 2006, IEEE Comput. Intell. Mag..

[17]  Bieke Dejaegher,et al.  Experimental designs and their recent advances in set-up, data interpretation, and analytical applications. , 2011, Journal of pharmaceutical and biomedical analysis.

[18]  M. Reetz,et al.  Microbial lipases form versatile tools for biotechnology. , 1998, Trends in biotechnology.

[19]  R. Schmid,et al.  Functional expression of Candida antarctica lipase B in the Escherichia coli cytoplasm—a screening system for a frequently used biocatalyst , 2006, Applied Microbiology and Biotechnology.

[20]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[21]  Kouhei Tsumoto,et al.  Suppression of protein interactions by arginine: a proposed mechanism of the arginine effects. , 2007, Biophysical chemistry.

[22]  T. Borchert,et al.  In vitro selection of enzymatically active lipase variants from phage libraries using a mechanism-based inhibitor. , 2001, Gene.

[23]  R. Rudolph,et al.  In vitro folding of inclusion body proteins , 1996, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[24]  Ferdi Schüth,et al.  Multi-objective optimization in combinatorial chemistry applied to the selective catalytic reduction of NO with C3H6 , 2007 .

[25]  Renaud Vincentelli,et al.  High‐throughput automated refolding screening of inclusion bodies , 2004, Protein science : a publication of the Protein Society.

[26]  P. Valax,et al.  Molecular Characterization of β‐Lactamase Inclusion Bodies Produced in Escherichia coli. 1. Composition , 1993, Biotechnology progress.

[27]  D. Lawson,et al.  Structural origins of the interfacial activation in Thermomyces (Humicola) lanuginosa lipase. , 2000, Biochemistry.

[28]  Marco Laumanns,et al.  SPEA2: Improving the Strength Pareto Evolutionary Algorithm For Multiobjective Optimization , 2002 .

[29]  R. Seckler,et al.  An automated in vitro protein folding screen applied to a human dynactin subunit , 2004, Protein science : a publication of the Protein Society.

[30]  P. Dalby,et al.  Ultra scale‐down of protein refold screening in microwells: Challenges, solutions and application , 2009, Biotechnology and bioengineering.

[31]  E D Clark,et al.  Protein refolding for industrial processes. , 2001, Current opinion in biotechnology.

[32]  Anindya Basu,et al.  Refolding of proteins from inclusion bodies: rational design and recipes , 2011, Applied Microbiology and Biotechnology.

[33]  A. Middelberg,et al.  Preparative protein refolding. , 2002, Trends in biotechnology.

[34]  Rachel A. Jodun,et al.  The matrix refolded , 2005, Nature Methods.

[35]  Michelle K. M. Chow,et al.  REFOLD: an analytical database of protein refolding methods. , 2006, Protein expression and purification.

[36]  Ezequiel Franco-Lara,et al.  Comparison of genetic algorithms for experimental multi-objective optimization on the example of medium design for cyanobacteria. , 2006, Biotechnology journal.

[37]  E Schwarz,et al.  Advances in refolding of proteins produced in E. coli. , 1998, Current opinion in biotechnology.

[38]  Andre Raw,et al.  Quality by Design: Concepts for ANDAs , 2008, The AAPS Journal.