Statistical approaches to maximize recombinant protein expression in Escherichia coli: a general review.

The supply of many valuable proteins that have potential clinical or industrial use is often limited by their low natural availability. With the modern advances in genomics, proteomics and bioinformatics, the number of proteins being produced using recombinant techniques is exponentially increasing and seems to guarantee an unlimited supply of recombinant proteins. The demand of recombinant proteins has increased as more applications in several fields become a commercial reality. Escherichia coli (E. coli) is the most widely used expression system for the production of recombinant proteins for structural and functional studies. However, producing soluble proteins in E. coli is still a major bottleneck for structural biology projects. One of the most challenging steps in any structural biology project is predicting which protein or protein fragment will express solubly and purify for crystallographic studies. The production of soluble and active proteins is influenced by several factors including expression host, fusion tag, induction temperature and time. Statistical designed experiments are gaining success in the production of recombinant protein because they provide information on variable interactions that escape the "one-factor-at-a-time" method. Here, we review the most important factors affecting the production of recombinant proteins in a soluble form. Moreover, we provide information about how the statistical design experiments can increase protein yield and purity as well as find conditions for crystal growth.

[1]  I. Roy,et al.  Effect of trehalose on protein structure , 2008, Protein science : a publication of the Protein Society.

[2]  J. Claverie,et al.  Structural genomics of highly conserved microbial genes of unknown function in search of new antibacterial targets , 2004, Journal of Structural and Functional Genomics.

[3]  B. Coutard,et al.  Green fluorescent protein and factorial approach: an effective partnership for screening the soluble expression of recombinant proteins in Escherichia coli. , 2008, Protein expression and purification.

[4]  I. Pastan,et al.  A method for increasing the yield of properly folded recombinant fusion proteins: single-chain immunotoxins from renaturation of bacterial inclusion bodies. , 1992, Analytical biochemistry.

[5]  H. P. Sørensen,et al.  Soluble expression of recombinant proteins in the cytoplasm of Escherichia coli , 2005 .

[6]  F. Baneyx,et al.  Expression of aggregation-prone recombinant proteins at low temperatures: a comparative study of the Escherichia coli cspA and tac promoter systems. , 1997, Protein expression and purification.

[7]  M. Kharrati-Kopaei,et al.  Optimization of an extracellular zinc-metalloprotease (SVP2) expression in Escherichia coli BL21 (DE3) using response surface methodology. , 2012, Protein expression and purification.

[8]  Optimization of medium constituents for improved chitinase production by Paenibacillus sp. D1 using statistical approach , 2009, Letters in applied microbiology.

[9]  A. P. Barba de la Rosa,et al.  Optimization of culture conditions for a synthetic gene expression in Escherichia coli using response surface methodology: the case of human interferon beta. , 2007, Biomolecular engineering.

[10]  C. Carter Protein crystallization using incomplete factorial experiments. , 1979, The Journal of biological chemistry.

[11]  J. Walker,et al.  Over-production of proteins in Escherichia coli: mutant hosts that allow synthesis of some membrane proteins and globular proteins at high levels. , 1996, Journal of molecular biology.

[12]  W. Becker,et al.  High-yield expression in Escherichia coli, purification, and characterization of properly folded major peanut allergen Ara h 2. , 2003, Protein expression and purification.

[13]  M. Bezerra,et al.  Response surface methodology (RSM) as a tool for optimization in analytical chemistry. , 2008, Talanta.

[14]  Martin Hammarström,et al.  Rapid screening for improved solubility of small human proteins produced as fusion proteins in Escherichia coli , 2002, Protein science : a publication of the Protein Society.

[15]  Eric Gouaux,et al.  A new protein folding screen: Application to the ligand binding domains of a glutamate and kainate receptor and to lysozyme and carbonic anhydrase , 1999, Protein science : a publication of the Protein Society.

[16]  N. Nancib,et al.  Variation and modeling of the probability of plasmid loss as a function of growth rate of plasmid‐bearing cells of Escherichia coli during continuous cultures , 1993, Biotechnology and bioengineering.

[17]  B. Coutard,et al.  Expression in Escherichia coli, refolding and crystallization of Aspergillus niger feruloyl esterase A using a serial factorial approach. , 2007, Protein expression and purification.

[18]  H. P. Sørensen,et al.  Production of recombinant thermostable proteins expressed in Escherichia coli: completion of protein synthesis is the bottleneck. , 2003, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[19]  M. A. Eiteman,et al.  Optimization of recombinant aminolevulinate synthase production in Escherichia coli using factorial design , 2003, Applied Microbiology and Biotechnology.

[20]  H. Falentin,et al.  A genomic search approach to identify esterases in Propionibacterium freudenreichii involved in the formation of flavour in Emmental cheese , 2008, Microbial cell factories.

[21]  G. Georgiou,et al.  Expression of correctly folded proteins in Escherichia coli. , 1996, Current opinion in biotechnology.

[22]  D. Waugh,et al.  A generic protocol for the expression and purification of recombinant proteins in Escherichia coli using a combinatorial His6-maltose binding protein fusion tag , 2007, Nature Protocols.

[23]  J. Kane,et al.  Formation of recombinant protein inclusion bodies in Escherichia coli , 1988 .

[24]  B. Yakhchali,et al.  Response surface optimization of medium composition for alkaline protease production by Bacillus clausii , 2008 .

[25]  Sarah E. Bondos,et al.  Detection and prevention of protein aggregation before, during, and after purification. , 2003, Analytical biochemistry.

[26]  Sung-Hou Kim,et al.  Sparse matrix sampling: a screening method for crystallization of proteins , 1991 .

[27]  U. Brinkmann,et al.  High-level expression of recombinant genes in Escherichia coli is dependent on the availability of the dnaY gene product. , 1989, Gene.

[28]  Renaud Vincentelli,et al.  Medium-scale structural genomics: strategies for protein expression and crystallization. , 2003, Accounts of chemical research.

[29]  H. Shin,et al.  Statistical optimization for immobilized metal affinity purification of secreted human erythropoietin from Drosophila S2 cells. , 2003, Protein expression and purification.

[30]  Rebecca Page,et al.  Strategies to maximize heterologous protein expression in Escherichia coli with minimal cost. , 2007, Protein expression and purification.

[31]  W. V. Van Voorhis,et al.  Stabilizing Additives Added during Cell Lysis Aid in the Solubilization of Recombinant Proteins , 2012, PloS one.

[32]  Margaret J. Robertson,et al.  Design and Analysis of Experiments , 2006, Handbook of statistics.

[33]  S. Brothers,et al.  Unexpected effects of epitope and chimeric tags on gonadotropin-releasing hormone receptors: implications for understanding the molecular etiology of hypogonadotropic hypogonadism. , 2003, The Journal of clinical endocrinology and metabolism.

[34]  Dominic Esposito,et al.  Enhancement of soluble protein expression through the use of fusion tags. , 2006, Current opinion in biotechnology.

[35]  Mark Gerstein,et al.  Strategies for structural proteomics of prokaryotes: Quantifying the advantages of studying orthologous proteins and of using both NMR and X‐ray crystallography approaches , 2003, Proteins.

[36]  Naomi E Chayen,et al.  Turning protein crystallisation from an art into a science. , 2004, Current opinion in structural biology.

[37]  A. Robinson,et al.  Recombinant protein expression and purification: A comprehensive review of affinity tags and microbial applications , 2012, Biotechnology journal.

[38]  Yan Feng,et al.  Optimization of recombinant hyperthermophilic esterase production from agricultural waste using response surface methodology. , 2006, Bioresource technology.

[39]  J M Claverie,et al.  SAmBA: An interactive software for optimizing the design of biological macromolecules crystallization experiments , 1997, Proteins.

[40]  A. L. Larentis,et al.  Cloning and optimization of induction conditions for mature PsaA (pneumococcal surface adhesin A) expression in Escherichia coli and recombinant protein stability during long-term storage. , 2011, Protein expression and purification.

[41]  M. Runswick,et al.  Over‐expression of Escherichia coli F1Fo–ATPase subunit a is inhibited by instability of the uncB gene transcript , 2003, FEBS letters.

[42]  M. Khodabandeh,et al.  Response surface methodology for optimizing the induction conditions of recombinant interferon beta during high cell density culture , 2008 .

[43]  A. Middelberg,et al.  High-level expression of soluble viral structural protein in Escherichia coli. , 2008, Journal of biotechnology.

[44]  Eleni Douni,et al.  A statistical approach for optimization of RANKL overexpression in Escherichia coli: purification and characterization of the protein. , 2013, Protein expression and purification.

[45]  P. Marynen,et al.  The Dark Side of EGFP: Defective Polyubiquitination , 2006, PloS one.

[46]  C. Carter,et al.  Incomplete factorial and response surface methods in experimental design: yield optimization of tRNA(Trp) from in vitro T7 RNA polymerase transcription. , 1996, Nucleic acids research.

[47]  J. Porath Immobilized metal ion affinity chromatography. , 1992, Protein expression and purification.

[48]  Paul G. Blommel,et al.  Enhanced Bacterial Protein Expression During Auto‐Induction Obtained by Alteration of Lac Repressor Dosage and Medium Composition , 2008, Biotechnology progress.

[49]  M. Sowden,et al.  Increasing the yield of soluble recombinant protein expressed in E. coli by induction during late log phase. , 2003, BioTechniques.

[50]  Stephen P Chambers,et al.  Designing experiments for high-throughput protein expression. , 2009, Methods in molecular biology.

[51]  T. Mustelin,et al.  Structure of the hematopoietic tyrosine phosphatase (HePTP) catalytic domain: structure of a KIM phosphatase with phosphate bound at the active site. , 2005, Journal of molecular biology.

[52]  A. Malhotra Tagging for protein expression. , 2009, Methods in enzymology.

[53]  Chu di Guana,et al.  Vectors that facilitate the expression and purification of foreign peptides in Escherichia coli by fusion to maltose-binding protein. , 1988 .

[54]  Yuan Zhang,et al.  Stationary phase protein overproduction is a fundamental capability of Escherichia coli. , 2004, Biochemical and biophysical research communications.

[55]  R. Vincentelli,et al.  High-throughput protein expression screening and purification in Escherichia coli. , 2011, Methods.

[56]  Paul H. Bessette,et al.  Efficient folding of proteins with multiple disulfide bonds in the Escherichia coli cytoplasm. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[57]  Ian Hunt,et al.  From gene to protein: a review of new and enabling technologies for multi-parallel protein expression. , 2005, Protein expression and purification.

[58]  E D Clark,et al.  Protein refolding for industrial processes. , 2001, Current opinion in biotechnology.

[59]  C. Schein,et al.  Formation of Soluble Recombinant Proteins in Escherichia Coli is Favored by Lower Growth Temperature , 1988, Bio/Technology.

[60]  Vernon R Smith,et al.  Purification and folding of recombinant bovine oxoglutarate/malate carrier by immobilized metal-ion affinity chromatography. , 2003, Protein expression and purification.

[61]  Ian Humphery-Smith,et al.  Analysis of High Throughput Protein Expression in Escherichia coli* , 2006, Molecular & Cellular Proteomics.

[62]  Johannes Buchner,et al.  Protein Aggregation in vitro and in vivo: A Quantitative Model of the Kinetic Competition between Folding and Aggregation , 1991, Bio/Technology.

[63]  D. Ejima,et al.  Is arginine a protein-denaturant? , 2005, Protein expression and purification.

[64]  Kouhei Tsumoto,et al.  Suppression of protein interactions by arginine: a proposed mechanism of the arginine effects. , 2007, Biophysical chemistry.

[65]  G. Petersen,et al.  Current strategies for the use of affinity tags and tag removal for the purification of recombinant proteins. , 2006, Protein expression and purification.

[66]  M. K. Shaw,et al.  Synthesis of Macromolecules by Escherichia coli near the Minimal Temperature for Growth , 1967, Journal of bacteriology.

[67]  G. Kontopidis,et al.  Optimization of TNF-α overexpression in Escherichia coli using response surface methodology: Purification of the protein and oligomerization studies. , 2012, Protein expression and purification.

[68]  A. Xu,et al.  Production of a new sea anemone neurotoxin by recombinant Escherichia coli: Optimization of culture conditions using response surface methodology , 2005 .

[69]  Stephen P Chambers,et al.  Screening factors effecting a response in soluble protein expression: formalized approach using design of experiments. , 2006, Analytical biochemistry.

[70]  A. Villaverde,et al.  Protein quality in bacterial inclusion bodies. , 2006, Trends in biotechnology.

[71]  J. Beckwith,et al.  The Role of the Thioredoxin and Glutaredoxin Pathways in Reducing Protein Disulfide Bonds in the Escherichia coliCytoplasm* , 1997, The Journal of Biological Chemistry.

[72]  J. Peterson,et al.  Improved synthesis of Salmonella typhimurium enterotoxin using gene fusion expression systems. , 1994, Gene.

[73]  F. Bolivar,et al.  Kinetic study of penicillin acylase production by recombinant E. coli in batch cultures , 1994 .

[74]  R. Jaenicke,et al.  A kinetic study of the competition between renaturation and aggregation during the refolding of denatured-reduced egg white lysozyme. , 1991, Biochemistry.

[75]  C. Bignon,et al.  Fractional factorial approach combining 4 Escherichia coli strains, 3 culture media, 3 expression temperatures and 5 N-terminal fusion tags for screening the soluble expression of recombinant proteins. , 2012, Protein expression and purification.

[76]  Marco G. Casteleijn,et al.  Expression without boundaries: cell-free protein synthesis in pharmaceutical research. , 2013, International journal of pharmaceutics.

[77]  J. Gutiérrez,et al.  Effect of preservatives on IgG aggregation, complement-activating effect and hypotensive activity of horse polyvalent antivenom used in snakebite envenomation. , 2002, Biologicals : journal of the International Association of Biological Standardization.

[78]  H Inouye,et al.  Vectors that facilitate the expression and purification of foreign peptides in Escherichia coli by fusion to maltose-binding protein. , 1988, Gene.

[79]  H. Salleh,et al.  Recombinant bromelain production in Escherichia coli: process optimization in shake flask culture by response surface methodology , 2012, AMB Express.