A high-throughput screening of genes that encode proteins transported into the endoplasmic reticulum in mammalian cells

The compartments of eukaryotic cells maintain a distinct protein composition to perform a variety of specialized functions. We developed a new method for identifying the proteins that are transported to the endoplasmic reticulum (ER) in living mammalian cells. The principle is based on the reconstitution of two split fragments of enhanced green fluorescent protein (EGFP) by protein splicing with DnaE from Synechocystis PCC6803. Complementary DNA (cDNA) libraries fused to the N-terminal halves of DnaE and EGFP are introduced in mammalian cells with retroviruses. If an expressed protein is transported into the ER, the N-terminal half of EGFP meets its C-terminal half in the ER, and full-length EGFP is reconstituted by protein splicing. The fluorescent cells are isolated using fluorescence-activated cell sorting and the cDNAs are sequenced. The developed method was able to accurately identify cDNAs that encode proteins transported to the ER. We identified 27 novel proteins as the ER-targeting proteins. The present method overcomes the limitation of the previous GFP- or epitope-tagged methods, using which it was difficult to identify the ER-targeting proteins in a high-throughput manner.

[1]  T. C. Evans,et al.  Characterization of a naturally occurring trans-splicing intein from Synechocystis sp. PCC6803. , 2001, Biochemistry.

[2]  T. C. Evans,et al.  Mechanistic and kinetic considerations of protein splicing. , 2002, Chemical reviews.

[3]  S. Chapman,et al.  High-Throughput Viral Expression of cDNA–Green Fluorescent Protein Fusions Reveals Novel Subcellular Addresses and Identifies Unique Proteins That Interact with Plasmodesmata Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.013284. , 2003, The Plant Cell Online.

[4]  T. Kitamura,et al.  Plat-E: an efficient and stable system for transient packaging of retroviruses , 2000, Gene Therapy.

[5]  M. Paetzel,et al.  Signal peptidases. , 2002, Chemical reviews.

[6]  X. Morin,et al.  A protein trap strategy to detect GFP-tagged proteins expressed from their endogenous loci in Drosophila , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[7]  S. Karlin,et al.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[8]  S. Fields,et al.  Protein analysis on a proteomic scale , 2003, Nature.

[9]  K. Akiyama,et al.  Functional Annotation of a Full-Length Arabidopsis cDNA Collection , 2002, Science.

[10]  T Nakahata,et al.  A method to identify cDNAs based on localization of green fluorescent protein fusion products. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[11]  H. Paulus,et al.  Protein splicing and related forms of protein autoprocessing. , 2000, Annual review of biochemistry.

[12]  K. Nakayama,et al.  Cloning and sequence analysis of cDNA for mouse prolactin. , 1986, Biochimica et biophysica acta.

[13]  Patricia J. Johnson,et al.  Ancient Invasions: From Endosymbionts to Organelles , 2004, Science.

[14]  S. Brunak,et al.  SHORT COMMUNICATION Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites , 1997 .

[15]  Yoshio Umezawa,et al.  A genetic approach to identifying mitochondrial proteins , 2003, Nature Biotechnology.

[16]  S. Munro,et al.  A C-terminal signal prevents secretion of luminal ER proteins , 1987, Cell.

[17]  G von Heijne,et al.  Signal sequences. The limits of variation. , 1985, Journal of molecular biology.

[18]  E. Birney,et al.  Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs , 2002, Nature.

[19]  E. O’Shea,et al.  Global analysis of protein localization in budding yeast , 2003, Nature.

[20]  Lila M Gierasch,et al.  Signal Sequences: The Same Yet Different , 1996, Cell.

[21]  M. Gerstein,et al.  Subcellular localization of the yeast proteome. , 2002, Genes & development.

[22]  Trisha N Davis,et al.  Protein localization in proteomics. , 2004, Current opinion in chemical biology.

[23]  B. Martoglio,et al.  Signal sequences: more than just greasy peptides. , 1998, Trends in cell biology.

[24]  I. Braakman,et al.  Quality control in the endoplasmic reticulum protein factory , 2003, Nature.

[25]  C. Bult,et al.  Functional annotation of a full-length mouse cDNA collection , 2001, Nature.

[26]  Y. Hiraoka,et al.  Large‐scale screening of intracellular protein localization in living fission yeast cells by the use of a GFP‐fusion genomic DNA library , 2000, Genes to cells : devoted to molecular & cellular mechanisms.

[27]  A. Morris,et al.  Lipids and the exocytotic machinery of eukaryotic cells. , 2003, Current opinion in cell biology.

[28]  Pamela A. Silver,et al.  Nuclear transport and cancer: from mechanism to intervention , 2004, Nature Reviews Cancer.

[29]  Paul Horton,et al.  Better Prediction of Protein Cellular Localization Sites with the it k Nearest Neighbors Classifier , 1997, ISMB.

[30]  K. Paulsson,et al.  Chaperones and folding of MHC class I molecules in the endoplasmic reticulum. , 2003, Biochimica et biophysica acta.

[31]  Z. Hu,et al.  Protein trans-splicing by a split intein encoded in a split DnaE gene of Synechocystis sp. PCC6803. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[32]  S. Brunak,et al.  Improved prediction of signal peptides: SignalP 3.0. , 2004, Journal of molecular biology.