Heterologous expression of proteins from Plasmodium falciparum: results from 1000 genes.

As part of a structural genomics initiative, 1000 open reading frames from Plasmodium falciparum, the causative agent of the most deadly form of malaria, were tested in an E. coli protein expression system. Three hundred and thirty-seven of these targets were observed to express, although typically the protein was insoluble. Sixty-three of the targets provided soluble protein in yields ranging from 0.9 to 406.6 mg from one liter of rich media. Higher molecular weight, greater protein disorder (segmental analysis, SEG), more basic isoelectric point (pI), and a lack of homology to E. coli proteins were all highly and independently correlated with difficulties in expression. Surprisingly, codon usage and the percentage of adenosines and thymidines (%AT) did not appear to play a significant role. Of those proteins which expressed, high pI and a hypothetical annotation were both strongly and independently correlated with insolubility. The overwhelmingly important role of pI in both expression and solubility appears to be a surprising and fundamental issue in the heterologous expression of P. falciparum proteins in E. coli. Twelve targets which did not express in E. coli from the native gene sequence were codon-optimized through whole gene synthesis, resulting in the (insoluble) expression of three of these proteins. Seventeen targets which were expressed insolubly in E. coli were moved into a baculovirus/Sf-21 system, resulting in the soluble expression of one protein at a high level and six others at a low level. A variety of factors conspire to make the heterologous expression of P. falciparum proteins challenging, and these observations lay the groundwork for a rational approach to prioritizing and, ultimately, eliminating these impediments.

[1]  D. Kaslow,et al.  A recombinant vaccine expressed in the milk of transgenic mice protects Aotus monkeys from a lethal challenge with Plasmodium falciparum , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[2]  J. Schug,et al.  The Plasmodium genome database , 2002, Nature.

[3]  P. Roepe,et al.  Analysis of the Antimalarial Drug Resistance Protein Pfcrt Expressed in Yeast* , 2002, The Journal of Biological Chemistry.

[4]  P. Gerold,et al.  Plasmodium falciparum: glycosylation status of Plasmodium falciparum circumsporozoite protein expressed in the baculovirus system. , 2002, Experimental parasitology.

[5]  Rebecca Page,et al.  Protein biophysical properties that correlate with crystallization success in Thermotoga maritima: maximum clustering strategy for structural genomics. , 2004, Journal of molecular biology.

[6]  T. Tsuboi,et al.  von Willebrand Factor A domain-related protein, a novel microneme protein of the malaria ookinete highly conserved throughout Plasmodium parasites. , 2001, Molecular and biochemical parasitology.

[7]  Li Li,et al.  PlasmoDB: the Plasmodium genome resource. A database integrating experimental and computational data , 2003, Nucleic Acids Res..

[8]  F. Studier,et al.  Protein production by auto-induction in high density shaking cultures. , 2005, Protein expression and purification.

[9]  L. Goh,et al.  Soluble expression of a functionally active Plasmodium falciparum falcipain-2 fused to maltose-binding protein in Escherichia coli. , 2003, Protein expression and purification.

[10]  K. Berndt,et al.  Codon optimization reveals critical factors for high level expression of two rare codon genes in Escherichia coli: RNA stability and secondary structure but not tRNA abundance. , 2004, Biochemical and biophysical research communications.

[11]  E. Pizzi,et al.  Low-complexity regions in Plasmodium falciparum proteins. , 2001, Genome research.

[12]  R. Schwartz,et al.  Whole proteome pI values correlate with subcellular localizations of proteins for organisms within the three domains of life. , 2001, Genome research.

[13]  C. Brady,et al.  High-Level Production and Purification of P30P2MSP119, an Important Vaccine Antigen for Malaria, Expressed in the Methylotropic Yeast Pichia pastoris , 2001 .

[14]  David E Hill,et al.  High-throughput expression of C. elegans proteins. , 2004, Genome research.

[15]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[16]  Zhiyong Zhou,et al.  Enhanced expression of a recombinant malaria candidate vaccine in Escherichia coli by codon optimization. , 2004, Protein expression and purification.

[17]  S. Gulnik,et al.  Utility of (His)6 tag for purification and refolding of proplasmepsin-2 and mutants with altered activation properties. , 2002, Protein expression and purification.

[18]  G. Singer,et al.  Nucleotide bias causes a genomewide bias in the amino acid composition of proteins. , 2000, Molecular biology and evolution.

[19]  P. D. de Jong,et al.  Ligation-independent cloning of PCR products (LIC-PCR). , 1990, Nucleic acids research.

[20]  A W Munro,et al.  The TB structural genomics consortium: a resource for Mycobacterium tuberculosis biology. , 2003, Tuberculosis.

[21]  P. Rosenthal,et al.  Systematic optimization of expression and refolding of the Plasmodium falciparum cysteine protease falcipain-2. , 2001, Protein expression and purification.

[22]  Jonathan E. Allen,et al.  Genome sequence of the human malaria parasite Plasmodium falciparum , 2002, Nature.

[23]  E. Katoh,et al.  Improving expression and solubility of rice proteins produced as fusion proteins in Escherichia coli. , 2005, Protein expression and purification.

[24]  M. Luo,et al.  Parallel cloning, expression, purification and crystallization of human proteins for structural genomics. , 2002, Acta crystallographica. Section D, Biological crystallography.

[25]  Steven E. Brenner,et al.  Target selection for structural genomics , 2000, Nature Structural Biology.

[26]  John C. Wootton,et al.  Non-globular Domains in Protein Sequences: Automated Segmentation Using Complexity Measures , 1994, Comput. Chem..

[27]  C. Ockenhouse,et al.  Effect of Codon Optimization on Expression Levels of a Functionally Folded Malaria Vaccine Candidate in Prokaryotic and Eukaryotic Expression Systems , 2003, Infection and Immunity.

[28]  W G Hol,et al.  International Journal for Parasitology 30 (2000) 113±118 Rapid communication , 2000 .

[29]  Shigeyuki Yokoyama,et al.  Protein expression systems for structural genomics and proteomics. , 2003, Current opinion in chemical biology.

[30]  D. Fidock,et al.  Structural Elucidation of the Specificity of the Antibacterial Agent Triclosan for Malarial Enoyl Acyl Carrier Protein Reductase* , 2002, The Journal of Biological Chemistry.

[31]  T. Mitamura,et al.  Characterization of proteases involved in the processing of Plasmodium falciparum serine repeat antigen (SERA). , 2002, Molecular and biochemical parasitology.

[32]  Mark Gerstein,et al.  Mining the structural genomics pipeline: identification of protein properties that affect high-throughput experimental analysis. , 2004, Journal of molecular biology.

[33]  Bernard F. Buxton,et al.  The DISOPRED server for the prediction of protein disorder , 2004, Bioinform..

[34]  Bindu Gajria,et al.  PlasmoDB: The Plasmodium Genome Resource , 2005 .

[35]  C. Brady,et al.  High-level production and purification of P30P2MSP1(19), an important vaccine antigen for malaria, expressed in the methylotropic yeast Pichia pastoris. , 2001, Protein expression and purification.

[36]  Over-production of lactate dehydrogenase from Plasmodium falciparum opens a route to new antimalarials , 2001, Biotechnology Letters.

[37]  Yanhui Hu,et al.  Proteome-scale purification of human proteins from bacteria , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[38]  L. Aravind,et al.  Plasmodium Biology Genomic Gleanings , 2003, Cell.

[39]  Qijun Chen,et al.  Optimized expression of Plasmodium falciparum erythrocyte membrane protein 1 domains in Escherichia coli , 2004, Malaria Journal.

[40]  J. Kane,et al.  Effects of rare codon clusters on high-level expression of heterologous proteins in Escherichia coli. , 1995, Current opinion in biotechnology.

[41]  W. Sirawaraporn,et al.  Plasmodium falciparum: asparagine mutant at residue 108 of dihydrofolate reductase is an optimal antifolate-resistant single mutant. , 1997, Experimental parasitology.

[42]  N. Maltsev,et al.  Genome-scale expression of proteins from Bacillus subtilis , 2004, Journal of Structural and Functional Genomics.

[43]  G. Singh,et al.  Hyper-expansion of asparagines correlates with an abundance of proteins with prion-like domains in Plasmodium falciparum. , 2004, Molecular and biochemical parasitology.

[44]  F. Hackett,et al.  Functional Characterization of the Propeptide of Plasmodium falciparum Subtilisin-like Protease-1* , 2003, Journal of Biological Chemistry.

[45]  J. McCafferty,et al.  Production of soluble mammalian proteins in Escherichia coli: identification of protein features that correlate with successful expression , 2004, BMC biotechnology.

[46]  M. Vignali,et al.  A Facile Method for High-throughput Co-expression of Protein Pairs*S , 2004, Molecular & Cellular Proteomics.

[47]  D. Carucci,et al.  High-throughput generation of P. falciparum functional molecules by recombinational cloning. , 2004, Genome research.

[48]  P. Rathod,et al.  Divergent Regulation of Dihydrofolate Reductase Between Malaria Parasite and Human Host , 2002, Science.

[49]  P. Prapunwattana,et al.  Chemical synthesis of the Plasmodium falciparum dihydrofolate reductase-thymidylate synthase gene. , 1996, Molecular and biochemical parasitology.

[50]  D. Forsdyke,et al.  Low-complexity segments in Plasmodium falciparum proteins are primarily nucleic acid level adaptations. , 2003, Molecular and biochemical parasitology.

[51]  P. Fallon,et al.  AGA/AGG codon usage in parasites: implications for gene expression in Escherichia coli. , 1995, Parasitology today.

[52]  H. Bujard,et al.  Vaccine candidate MSP-1 from Plasmodium falciparum: a redesigned 4917 bp polynucleotide enables synthesis and isolation of full-length protein from Escherichia coli and mammalian cells. , 1999, Nucleic acids research.

[53]  D. Battistutta,et al.  Codon usage in Plasmodium falciparum. , 1988, Molecular and biochemical parasitology.

[54]  U. Certa,et al.  Expression and characterisation of plasmepsin I from Plasmodium falciparum. , 1997, European journal of biochemistry.

[55]  A. Chaffotte,et al.  Assistance of maltose binding protein to the in vivo folding of the disulfide-rich C-terminal fragment from Plasmodium falciparum merozoite surface protein 1 expressed in Escherichia coli. , 2003, Biochemistry.

[56]  S. Hay,et al.  The global distribution of clinical episodes of Plasmodium falciparum malaria , 2005, Nature.

[57]  Junpeng Deng,et al.  An improved protocol for rapid freezing of protein samples for long-term storage. , 2004, Acta crystallographica. Section D, Biological crystallography.

[58]  E. Boni,et al.  Cloning grills: High throughput cloning for structural genomics , 2004, Journal of Structural and Functional Genomics.

[59]  Robert D. Finn,et al.  The Pfam protein families database , 2004, Nucleic Acids Res..

[60]  D. Gowda,et al.  Protein glycosylation in the malaria parasite. , 1999, Parasitology today.