An efficient and generic strategy for producing soluble human proteins and domains in E. coli by screening construct libraries

The implementation of generic and efficient technologies for the production of recombinant eukaryotic proteins remains an outstanding challenge in structural genomics programs. We have recently developed a new method for rapid identification of soluble protein expression in E. coli, the colony filtration blot (CoFi blot). In this study, the CoFi blot was used to screen libraries where the N‐terminal translation start point was randomized. To investigate the efficiency of this strategy, we have attributed a large number of proteins to this process. In a set of 32 mammalian proteins, we were able to double the success rate (from 34 to 68%) of producing soluble and readily purifiable proteins in E. coli. Most of the selected constructs had their N‐termini close to predicted domain borders and the method therefore provides a mean for experimental “domain foot printing.” Surprisingly, for most of the targets, we also observed expressing constructs that were close to full‐length. In summary this strategy constitutes a generic and efficient method for producing mammalian proteins for structural and functional studies. Proteins 2006. © 2006 Wiley‐Liss, Inc.

[1]  Stephen K. Burley,et al.  High-throughput Limited Proteolysis/Mass Spectrometry for Protein Domain Elucidation , 2005, Journal of Structural and Functional Genomics.

[2]  Pär Nordlund,et al.  Colony filtration blot: a new screening method for soluble protein expression in Escherichia coli , 2005, Nature Methods.

[3]  T. Terwilliger,et al.  Protein tagging and detection with engineered self-assembling fragments of green fluorescent protein , 2005, Nature Biotechnology.

[4]  Scott A. Lesley,et al.  Protein Production and Crystallization at the Joint Center for Structural Genomics , 2005, Journal of Structural and Functional Genomics.

[5]  G. Phillips,et al.  High-throughput Purification and Quality Assurance of Arabidopsis thaliana Proteins for Eukaryotic Structural Genomics , 2005, Journal of Structural and Functional Genomics.

[6]  Virgil L. Woods,et al.  On the use of DXMS to produce more crystallizable proteins: Structures of the T. maritima proteins TM0160 and TM1171 , 2004, Protein science : a publication of the Protein Society.

[7]  F. Baneyx,et al.  Recombinant protein folding and misfolding in Escherichia coli , 2004, Nature Biotechnology.

[8]  Golan Yona,et al.  Automatic prediction of protein domains from sequence information using a hybrid learning system , 2004, Bioinform..

[9]  J. S. Sodhi,et al.  Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. , 2004, Journal of molecular biology.

[10]  K. Büssow,et al.  Fast identification of folded human protein domains expressed in E. coli suitable for structural analysis , 2004, BMC Structural Biology.

[11]  Virgil L. Woods,et al.  Rapid refinement of crystallographic protein construct definition employing enhanced hydrogen/deuterium exchange MS. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[12]  D. Liang,et al.  Structural genomics efforts at the Chinese Academy of Sciences and Peking University , 2004, Journal of Structural and Functional Genomics.

[13]  Amos Bairoch,et al.  Recent improvements to the PROSITE database , 2004, Nucleic Acids Res..

[14]  Pascal Braun,et al.  High throughput protein production for functional proteomics. , 2003, Trends in biotechnology.

[15]  G. Waldo,et al.  Genetic screens and directed evolution for protein solubility. , 2003, Current opinion in chemical biology.

[16]  G. Rubin,et al.  Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Yanhui Hu,et al.  Proteome-scale purification of human proteins from bacteria , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[18]  I. Taylor,et al.  Identification of in vitro folding conditions for procathepsin S and cathepsin S using fractional factorial screens. , 2002, Protein expression and purification.

[19]  Martin Hammarström,et al.  Rapid screening for improved solubility of small human proteins produced as fusion proteins in Escherichia coli , 2002, Protein science : a publication of the Protein Society.

[20]  John F. Hunt,et al.  Protein solubility and folding monitored in vivo by structural complementation of a genetic marker protein , 2001, Nature Biotechnology.

[21]  F. Inagaki,et al.  Random PCR-based screening for soluble domains using green fluorescent protein. , 2001, Biochemical and biophysical research communications.

[22]  R D Klausner,et al.  The mammalian gene collection. , 1999, Science.

[23]  R. Raines,et al.  Hypersensitive substrate for ribonucleases. , 1999, Nucleic acids research.

[24]  T. Terwilliger,et al.  Rapid protein-folding assay using green fluorescent protein , 1999, Nature Biotechnology.

[25]  A. Mittermaier,et al.  A simple in vivo assay for increased protein solubility , 1999, Protein science : a publication of the Protein Society.

[26]  F. Baneyx,et al.  Expression of aggregation-prone recombinant proteins at low temperatures: a comparative study of the Escherichia coli cspA and tac promoter systems. , 1997, Protein expression and purification.

[27]  S. Cohen Domain elucidation by mass spectrometry. , 1996, Structure.

[28]  Steven L. Cohen,et al.  Probing the solution structure of the DNA‐binding protein Max by a combination of proteolysis and mass spectrometry , 1995, Protein science : a publication of the Protein Society.

[29]  W. Stemmer Rapid evolution of a protein in vitro by DNA shuffling , 1994, Nature.

[30]  C. Scorer,et al.  Foreign gene expression in yeast: a review , 1992, Yeast.

[31]  J. Beckwith,et al.  Escherichia coli alkaline phosphatase fails to acquire disulfide bonds when retained in the cytoplasm , 1991, Journal of bacteriology.

[32]  Robert C. Wolpert,et al.  A Review of the , 1985 .

[33]  S. Henikoff Unidirectional digestion with exonuclease III creates targeted breakpoints for DNA sequencing. , 1984, Gene.