Large-scale identification of mammalian proteins localized to nuclear sub-compartments.

Many nuclear components participating in related pathways appear concentrated in specific areas of the mammalian nucleus. The importance of this organization is attested to by the dysfunction that correlates with mis-localization of nuclear proteins in human disease and cancer. Determining the sub-nuclear localization of proteins is therefore important for understanding genome regulation and function, and it also provides clues to function for novel proteins. However, the complexity of proteins in the mammalian nucleus is too large to tackle this on a protein by protein basis. Large-scale approaches to determining protein function and sub-cellular localization are required. We have used a visual gene trap screen to identify more than 100 proteins, many of which are normal, located within compartments of the mouse nucleus. The most common discrete localizations detected are at the nucleolus and the splicing speckles and on chromosomes. Proteins at the nuclear periphery, or in other nuclear foci, have also been identified. Several of the proteins have been implicated in human disease or cancer, e.g. ATRX, HMGI-C, NBS1 and EWS, and the gene-trapped proteins provide a route into further understanding their function. We find that sequence motifs are often shared amongst proteins co-localized within the same sub-nuclear compartment. Conversely, some generally abundant motifs are lacking from the proteins concentrated in specific areas of the nucleus. This suggests that we may be able to predict sub-nuclear localization for proteins in databases based on their sequence.

[1]  T. Misteli,et al.  High mobility of proteins in the mammalian cell nucleus , 2000, Nature.

[2]  R. Poot,et al.  HuCHRAC, a human ISWI chromatin remodelling complex contains hACF1 and two novel histone‐fold proteins , 2000, The EMBO journal.

[3]  H. Hameister,et al.  The expression pattern of the Hmgic gene during development , 1998, Genes, chromosomes & cancer.

[4]  P. D. Dal Cin,et al.  HMGI(Y) and HMGI-C dysregulation: a common occurrence in human tumors. , 1999, Advances in anatomic pathology.

[5]  S. Pickering,et al.  Changes in the distribution of membranous organelles during mouse early development. , 1985, Journal of embryology and experimental morphology.

[6]  A. F. Neuwald,et al.  Purification and biochemical characterization of interchromatin granule clusters , 1999, The EMBO journal.

[7]  A. Krainer,et al.  A specific subset of SR proteins shuttles continuously between the nucleus and the cytoplasm. , 1998, Genes & development.

[8]  W. Bickmore,et al.  Re-modelling of nuclear architecture in quiescent and senescent human fibroblasts , 2000, Current Biology.

[9]  D. Hernandez-Verdun,et al.  Initiation of nucleolar assembly is independent of RNA polymerase I transcription. , 2000, Molecular biology of the cell.

[10]  G. Dreyfuss,et al.  Transport of Proteins and RNAs Review in and out of the Nucleus , 1999 .

[11]  G. Fronza,et al.  A gene trap approach to isolate mammalian genes involved in the cellular response to genotoxic stress. , 1997, Nucleic acids research.

[12]  S. Elledge,et al.  BASC, a super complex of BRCA1-associated proteins involved in the recognition and repair of aberrant DNA structures. , 2000, Genes & development.

[13]  J M Davies,et al.  Leukemia-associated retinoic acid receptor alpha fusion partners, PML and PLZF, heterodimerize and colocalize to nuclear bodies. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[14]  C. Will,et al.  The human U5 snRNP-specific 100-kD protein is an RS domain-containing, putative RNA helicase with significant homology to the yeast splicing factor Prp28p. , 1997, RNA.

[15]  G. Dreyfuss,et al.  Transport of Proteins and RNAs in and out of the Nucleus , 1999, Cell.

[16]  M. Slovak,et al.  t(9;11)(p22;p15) in acute myeloid leukemia results in a fusion between NUP98 and the gene encoding transcriptional coactivators p52 and p75-lens epithelium-derived growth factor (LEDGF). , 2000, Cancer research.

[17]  W. Bickmore,et al.  Putting the genome on the map. , 1998, Trends in genetics : TIG.

[18]  P. Lieberman,et al.  Lytic but Not Latent Replication of Epstein-Barr Virus Is Associated with PML and Induces Sequential Release of Nuclear Domain 10 Proteins , 2000, Journal of Virology.

[19]  Xianjin Zhou,et al.  Mutation responsible for the mouse pygmy phenotype in the developmentally regulated factor HMGI-C , 1995, Nature.

[20]  K. Sugimoto,et al.  Mouse MCM proteins: complex formation and transportation to the nucleus , 1996, Genes to cells : devoted to molecular & cellular mechanisms.

[21]  M. Dixon,et al.  Mutations in the Treacher Collins syndrome gene lead to mislocalization of the nucleolar protein treacle. , 1998, Human molecular genetics.

[22]  C. González,et al.  Motif trap: a rapid method to clone motifs that can target proteins to defined subcellular localisations. , 1999, Journal of cell science.

[23]  P Bork,et al.  Wanted: subcellular localization of proteins based on sequence. , 1998, Trends in cell biology.

[24]  Stephen S. Taylor,et al.  A Visual Screen of a Gfp-Fusion Library Identifies a New Type of Nuclear Envelope Membrane Protein , 1999, The Journal of cell biology.

[25]  I. Raška,et al.  Association between the nucleolus and the coiled body. , 1990, Journal of structural biology.

[26]  A. Lamond,et al.  Structure and function in the nucleus. , 1998, Science.

[27]  E. Ezoe,et al.  Analysis of nuclear localization signals using a green fluorescent protein-fusion protein library. , 1999, Experimental cell research.

[28]  K. Wilson,et al.  Lamins and Disease Insights into Nuclear Infrastructure , 2001, Cell.

[29]  M. Monk,et al.  HPRT-deficient (Lesch–Nyhan) mouse embryos derived from germline colonization by cultured cells , 1987, Nature.

[30]  B. Schaar,et al.  Characterization of the Kinetochore Binding Domain of CENP-E Reveals Interactions with the Kinetochore Proteins CENP-F and hBUBR1 , 1998, The Journal of cell biology.

[31]  R. Schwartz,et al.  Whole proteome pI values correlate with subcellular localizations of proteins for organisms within the three domains of life. , 2001, Genome research.

[32]  E. Birney,et al.  Analysis of the RNA-recognition motif and RS and RGG domains: conservation in metazoan pre-mRNA splicing factors. , 1993, Nucleic acids research.

[33]  L. Gudas,et al.  Characterization of genes which exhibit reduced expression during the retinoic acid-induced differentiation of F9 teratocarcinoma cells: involvement of cyclin D3 in RA-mediated growth arrest , 1998, Molecular and Cellular Endocrinology.

[34]  R. Pepperkok,et al.  In vivo detection of snRNP‐rich organelles in the nuclei of mammalian cells. , 1991, The EMBO journal.

[35]  K. Chada,et al.  Genomic structure and expression of the murine Hmgi-c gene. , 1996, Nucleic acids research.

[36]  Y. Agata,et al.  Targeting of Krüppel-associated Box-containing Zinc Finger Proteins to Centromeric Heterochromatin , 2001, The Journal of Biological Chemistry.

[37]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[38]  B. Birren,et al.  Analysis of the cat eye syndrome critical region in humans and the region of conserved synteny in mice: a search for candidate genes at or near the human chromosome 22 pericentromere. , 2001, Genome research.

[39]  Xiang-Dong Fu,et al.  Novel nuclear autoantigen with splicing factor motifs identified with antibody from hepatocellular carcinoma. , 1993, The Journal of clinical investigation.

[40]  S Tweedie,et al.  Capturing novel mouse genes encoding chromosomal and other nuclear proteins. , 1998, Journal of cell science.

[41]  Maria Carmo-Fonseca,et al.  To be or not to be in the nucleolus , 2000, Nature Cell Biology.

[42]  G. Thomas,et al.  Genomic structure of the EWS gene and its relationship to EWSR1, a site of tumor-associated chromosome translocation. , 1993, Genomics.

[43]  G. Thireos,et al.  The Gcn5 bromodomain co-ordinates nucleosome remodelling , 2000, Nature.

[44]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[45]  H. Orr,et al.  Nuclear localization of the spinocerebellar ataxia type 7 protein, ataxin-7. , 1999, Human molecular genetics.

[46]  T Nakahata,et al.  A method to identify cDNAs based on localization of green fluorescent protein fusion products. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[47]  J. Qin,et al.  Identity between TRAP and SMCC complexes indicates novel pathways for the function of nuclear receptors and diverse mammalian activators. , 1999, Molecular cell.

[48]  Juri Rappsilber,et al.  Mass spectrometry and EST-database searching allows characterization of the multi-protein spliceosome complex , 1998, Nature Genetics.

[49]  G. Payne,et al.  Clathrin Coats— Threads Laid Bare , 1998, Cell.

[50]  D. Higgs,et al.  Mutations in ATRX, encoding a SWI/SNF-like protein, cause diverse changes in the pattern of DNA methylation , 2000, Nature Genetics.

[51]  R. Brent,et al.  Two classes of proteins dependent on either the presence or absence of thyroid hormone for interaction with the thyroid hormone receptor. , 1995, Molecular endocrinology.

[52]  T Misteli,et al.  Cell biology of transcription and pre-mRNA splicing: nuclear architecture meets nuclear function. , 2000, Journal of cell science.

[53]  I. Hickson,et al.  DNA helicase deficiencies associated with cancer predisposition and premature ageing disorders. , 2001, Human molecular genetics.

[54]  A. Poustka,et al.  Systematic subcellular localization of novel proteins identified by large‐scale cDNA sequencing , 2000, EMBO reports.

[55]  K. Borden,et al.  SOLUTION STRUCTURE OF THE PHD DOMAIN FROM THE KAP-1 COREPRESSOR , 2001 .

[56]  B. Schwer A new twist on RNA helicases: DExH/D box proteins as RNPases , 2001, Nature Structural Biology.