Small-scale, semi-automated purification of eukaryotic proteins for structure determination

A simple approach that allows cost-effective automated purification of recombinant proteins in levels sufficient for functional characterization or structural studies is described. Studies with four human stem cell proteins, an engineered version of green fluorescent protein, and other proteins are included. The method combines an expression vector (pVP62K) that provides in vivo cleavage of an initial fusion protein, a factorial designed auto-induction medium that improves the performance of small-scale production, and rapid, automated metal affinity purification of His8-tagged proteins. For initial small-scale production screening, single colony transformants were grown overnight in 0.4 ml of auto-induction medium, produced proteins were purified using the Promega Maxwell 16, and purification results were analyzed by Caliper LC90 capillary electrophoresis. The yield of purified [U-15N]-His8-Tcl-1 was 7.5 μg/ml of culture medium, of purified [U-15N]-His8-GFP was 68 μg/ml, and of purified selenomethione-labeled AIA–GFP (His8 removed by treatment with TEV protease) was 172 μg/ml. The yield information obtained from a successful automated purification from 0.4 ml was used to inform the decision to scale-up for a second meso-scale (10–50 ml) cell growth and automated purification. 1H–15N NMR HSQC spectra of His8-Tcl-1 and of His8-GFP prepared from 50 ml cultures showed excellent chemical shift dispersion, consistent with well folded states in solution suitable for structure determination. Moreover, AIA–GFP obtained by proteolytic removal of the His8 tag was subjected to crystallization screening, and yielded crystals under several conditions. Single crystals were subsequently produced and optimized by the hanging drop method. The structure was solved by molecular replacement at a resolution of 1.7 Å. This approach provides an efficient way to carry out several key target screening steps that are essential for successful operation of proteomics pipelines with eukaryotic proteins: examination of total expression, determination of proteolysis of fusion tags, quantification of the yield of purified protein, and suitability for structure determination.

[1]  Dmitrij Frishman,et al.  Will my protein crystallize? A sequence‐based predictor , 2005, Proteins.

[2]  R C Stevens,et al.  Design of high-throughput methods of protein production for structural biology. , 2000, Structure.

[3]  Annabel E. Todd,et al.  Target Selection and Determination of Function in Structural Genomics , 2003, IUBMB life.

[4]  Rebecca Page,et al.  Protein biophysical properties that correlate with crystallization success in Thermotoga maritima: maximum clustering strategy for structural genomics. , 2004, Journal of molecular biology.

[5]  Gerhard Wagner,et al.  A solubility-enhancement tag (SET) for NMR studies of poorly behaving proteins , 2001, Journal of biomolecular NMR.

[6]  W. Arber,et al.  Host specificity of DNA produced by Escherichia coli , 2004, Molecular and General Genetics MGG.

[7]  S. Kain,et al.  Optimized codon usage and chromophore mutations provide enhanced sensitivity with the green fluorescent protein. , 1996, Nucleic acids research.

[8]  N. Chayen,et al.  Protein crystallization for genomics: towards high-throughput optimization techniques. , 2002, Acta crystallographica. Section D, Biological crystallography.

[9]  Kevin Cowtan,et al.  research papers Acta Crystallographica Section D Biological , 2005 .

[10]  F. Almeida,et al.  High‐throughput screening of structural proteomics targets using NMR , 2003, FEBS letters.

[11]  G. Phillips,et al.  High-throughput Purification and Quality Assurance of Arabidopsis thaliana Proteins for Eukaryotic Structural Genomics , 2005, Journal of Structural and Functional Genomics.

[12]  W. Arber,et al.  Host specificity of DNA produced by Escherichia coli III. Effects on transduction mediated by λ dg , 1964 .

[13]  I. Rayment Reductive alkylation of lysine residues to alter crystallization properties of proteins. , 1997, Methods in enzymology.

[14]  W. Wood,et al.  Host specificity of DNA produced by Escherichia coli: bacterial mutations affecting the restriction and modification of DNA. , 1966, Journal of molecular biology.

[15]  J. Sambrook,et al.  Molecular Cloning: A Laboratory Manual , 2001 .

[16]  Paul G. Blommel,et al.  Enhanced Bacterial Protein Expression During Auto‐Induction Obtained by Alteration of Lac Repressor Dosage and Medium Composition , 2008, Biotechnology progress.

[17]  O. Brodsky,et al.  Economical parallel protein expression screening and scale-up in Escherichia coli , 2007, Journal of Structural and Functional Genomics.

[18]  Jürgen Cox,et al.  Predicting experimental properties of proteins from sequence by machine learning techniques. , 2007, Current protein & peptide science.

[19]  S. Thao,et al.  Results from high-throughput DNA cloning of Arabidopsis thaliana target genes using site-specific recombination , 2004, Journal of Structural and Functional Genomics.

[20]  Brian G Fox,et al.  A combined approach to improving large-scale production of tobacco etch virus protease. , 2007, Protein expression and purification.

[21]  P. Vekilov,et al.  Entropy and surface engineering in protein crystallization. , 2006, Acta crystallographica. Section D, Biological crystallography.

[22]  G. Murshudov,et al.  Refinement of macromolecular structures by the maximum-likelihood method. , 1997, Acta crystallographica. Section D, Biological crystallography.

[23]  F. Blattner,et al.  Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[24]  B. Fox,et al.  Auto-induction medium for the production of [U-15N]- and [U-13C, U-15N]-labeled proteins for NMR screening and structure determination. , 2005, Protein expression and purification.

[25]  K. Wüthrich,et al.  Towards miniaturization of a structural genomics pipeline using micro-expression and microcoil NMR , 2006, Journal of Structural and Functional Genomics.

[26]  Mark Gerstein,et al.  Data mining crystallization databases: Knowledge‐based approaches to optimize protein crystal screens , 2003, Proteins.

[27]  Eric Steffen,et al.  High efficiency single step production of expression plasmids from cDNA clones using the Flexi Vector cloning system. , 2006, Protein expression and purification.

[28]  P E Wright,et al.  PCR-based gene synthesis and protein NMR spectroscopy. , 1997, Structure.

[29]  W. Stemmer,et al.  Improved Green Fluorescent Protein by Molecular Evolution Using DNA Shuffling , 1996, Nature Biotechnology.

[30]  N. W. Davis,et al.  The complete genome sequence of Escherichia coli K-12. , 1997, Science.

[31]  W Arber,et al.  Host specificity of DNA produced by Escherichia coli. 9. Host-controlled modification of bacteriophage fd. , 1966, Journal of molecular biology.

[32]  G. Phillips,et al.  Protocols for production of selenomethionine-labeled proteins in 2-L polyethylene terephthalate bottles using auto-induction medium. , 2005, Protein expression and purification.

[33]  Steven E. Brenner,et al.  Target selection for structural genomics , 2000, Nature Structural Biology.

[34]  F. Inagaki,et al.  Random PCR-based screening for soluble domains using green fluorescent protein. , 2001, Biochemical and biophysical research communications.

[35]  R. Kim,et al.  An automated small-scale protein expression and purification screening provides beneficial information for protein production , 2004, Journal of Structural and Functional Genomics.

[36]  G. Patterson,et al.  Use of the green fluorescent protein and its mutants in quantitative fluorescence microscopy. , 1997, Biophysical journal.

[37]  W A Hendrickson,et al.  Structure of a fibronectin type III domain from tenascin phased by MAD analysis of the selenomethionyl protein. , 1992, Science.

[38]  K. Büssow,et al.  Fast identification of folded human protein domains expressed in E. coli suitable for structural analysis , 2004, BMC Structural Biology.

[39]  H. Sambrook Molecular cloning : a laboratory manual. Cold Spring Harbor, NY , 1989 .

[40]  Mark Gerstein,et al.  Mining the structural genomics pipeline: identification of protein properties that affect high-throughput experimental analysis. , 2004, Journal of molecular biology.

[41]  Cheryl H Arrowsmith,et al.  NMR and X-ray crystallography, complementary tools in structural proteomics of small proteins. , 2005, Journal of the American Chemical Society.

[42]  T. Ellenberger,et al.  Domain Structure and Protein Interactions of the Silent Information Regulator Sir3 Revealed by Screening a Nested Deletion Library of Protein Fragments* , 2006, Journal of Biological Chemistry.

[43]  L. Pearl,et al.  Combinatorial Domain Hunting: An effective approach for the identification of soluble protein domains adaptable to high‐throughput applications , 2006, Protein science : a publication of the Protein Society.

[44]  K. Maki,et al.  Mutational analysis of protein solubility enhancement using short peptide tags. , 2007, Biopolymers.

[45]  P. Nordlund,et al.  Screening for soluble expression of recombinant proteins in a 96-well format. , 2001, Analytical biochemistry.

[46]  L. Pearl,et al.  Recursive PCR: a novel technique for total gene synthesis. , 1992, Protein engineering.

[47]  F. Studier,et al.  Protein production by auto-induction in high density shaking cultures. , 2005, Protein expression and purification.

[48]  Ivan Rayment,et al.  [12] Reductive alkylation of lysine residues to alter crystallization properties of proteins. , 1997, Methods in enzymology.

[49]  G. Phillips,et al.  Comparison of cell‐based and cell‐free protocols for producing target proteins from the Arabidopsis thaliana genome for structural studies , 2005, Proteins.

[50]  R. Stevens,et al.  Scalable high-throughput micro-expression device for recombinant proteins. , 2004, BioTechniques.

[51]  W. Arber HOST SPECIFICITY OF DNA PRODUCED BY ESCHERICHIA COLI. 3. EFFECTS ON TRANSDUCTION MEDIATED BY LAMBDA DG. , 1964, Virology.

[52]  M. Adams,et al.  Comparison of Small- and Large-scale Expression of Selected Pyrococcus furiosus Genes as an Aid to High-throughput Protein Production , 2005, Journal of Structural and Functional Genomics.

[53]  H. Dyson,et al.  Gene synthesis, high-level expression, and mutagenesis of Thiobacillus ferrooxidans rusticyanin: His 85 is a ligand to the blue copper center. , 1995, Biochemistry.

[54]  R. Wells,et al.  Effects of neighboring DNA homopolymers on the biochemical and physical properties of the Escherichia coli lactose promoter. I. Cloning and characterization studies. , 1982, The Journal of biological chemistry.

[55]  M. E. Lewis,et al.  Expression, purification, and crystallization of the RGS-like domain from the Rho nucleotide exchange factor, PDZ-RhoGEF, using the surface entropy reduction approach. , 2001, Protein expression and purification.

[56]  Collaborative Computational,et al.  The CCP4 suite: programs for protein crystallography. , 1994, Acta crystallographica. Section D, Biological crystallography.

[57]  Renaud Vincentelli,et al.  Medium-scale structural genomics: strategies for protein expression and crystallization. , 2003, Accounts of chemical research.