CORUM: the comprehensive resource of mammalian protein complexes

CORUM is a database that provides a manually curated repository of experimentally characterized protein complexes from mammalian organisms, mainly human (64%), mouse (16%) and rat (12%). Protein complexes are key molecular entities that integrate multiple gene products to perform cellular functions. The new CORUM 2.0 release encompasses 2837 protein complexes offering the largest and most comprehensive publicly available dataset of mammalian protein complexes. The CORUM dataset is built from 3198 different genes, representing ∼16% of the protein coding genes in humans. Each protein complex is described by a protein complex name, subunit composition, function as well as the literature reference that characterizes the respective protein complex. Recent developments include mapping of functional annotation to Gene Ontology terms as well as cross-references to Entrez Gene identifiers. In addition, a ‘Phylogenetic Conservation’ analysis tool was implemented that analyses the potential occurrence of orthologous protein complex subunits in mammals and other selected groups of organisms. This allows one to predict the occurrence of protein complexes in different phylogenetic groups. CORUM is freely accessible at (http://mips.helmholtz-muenchen.de/genre/proj/corum/index.html).

[1]  H. Mewes,et al.  The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes. , 2004, Nucleic acids research.

[2]  Sean R. Collins,et al.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae , 2006, Nature.

[3]  C. Sander,et al.  The HUPO PSI's Molecular Interaction format—a community standard for the representation of protein interaction data , 2004, Nature Biotechnology.

[4]  B. Cisneros,et al.  Nuclear and nuclear envelope localization of dystrophin Dp71 and dystrophin‐associated proteins (DAPs) in the C2C12 muscle cells: DAPs nuclear localization is modulated during myogenesis , 2008, Journal of cellular biochemistry.

[5]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[6]  S. Osawa,et al.  Evolutionary relationship of archaebacteria, eubacteria, and eukaryotes inferred from phylogenetic trees of duplicated genes. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[7]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[8]  Elizabeth Pennisi,et al.  Working the (Gene Count) Numbers: Finally, a Firm Answer? , 2007, Science.

[9]  Igor V. Tetko,et al.  The Mouse Functional Genome Database (MfunGD): functional annotation of proteins in the light of their cellular context , 2005, Nucleic Acids Res..

[10]  B. Snel,et al.  Toward Automatic Reconstruction of a Highly Resolved Tree of Life , 2006, Science.

[11]  Hans-Werner Mewes,et al.  MPact: the MIPS protein interaction resource on yeast , 2005, Nucleic Acids Res..

[12]  Ian M. Donaldson,et al.  BIND: the Biomolecular Interaction Network Database , 2001, Nucleic Acids Res..

[13]  Claudio Altafini,et al.  Discerning static and causal interactions in genome-wide reverse engineering problems , 2008, Bioinform..

[14]  Sara Linse,et al.  Methods for the detection and analysis of protein–protein interactions , 2007, Proteomics.

[15]  Julie M. Sahalie,et al.  An experimentally derived confidence score for binary protein-protein interactions , 2008, Nature Methods.

[16]  Peer Bork,et al.  Not Comparable, But Complementary , 2008, Science.

[17]  Lan V. Zhang,et al.  Evidence for dynamically organized modularity in the yeast protein–protein interaction network , 2004, Nature.

[18]  Hunter B. Fraser,et al.  Modularity and evolutionary constraint on proteins , 2005, Nature Genetics.

[19]  M. Gerstein,et al.  Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogs. , 2004, Genome research.

[20]  Sven-Eric Schelhorn,et al.  An integrative approach for predicting interactions of protein regions , 2008, ECCB.

[21]  Min-Sung Kim,et al.  COFECO: composite function annotation enriched by protein complex data , 2009, Nucleic Acids Res..

[22]  R A Garrett,et al.  Archaebacterial DNA-dependent RNA polymerases testify to the evolution of the eukaryotic nuclear genome. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[23]  István Nagy,et al.  Eubacterial proteasomes , 2004, Molecular Biology Reports.

[24]  Sarah A Teichmann,et al.  The origins and evolution of functional modules: lessons from protein complexes , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[25]  K. Willison,et al.  Elucidation of the subunit orientation in CCT (chaperonin containing TCP1) from the subunit composition of CCT micro‐complexes , 1997, The EMBO journal.

[26]  Shoshana J. Wodak,et al.  CYGD: the Comprehensive Yeast Genome Database , 2004, Nucleic Acids Res..

[27]  Jade Buchanan-Carter,et al.  Sequencing and de novo analysis of a coral larval transcriptome using 454 GSFlx , 2009, BMC Genomics.

[28]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[29]  Eric P. Hoffman,et al.  Dystrophin: The protein product of the duchenne muscular dystrophy locus , 1987, Cell.

[30]  P. Bork,et al.  Co-evolution of transcriptional and post-translational cell-cycle regulation , 2006, Nature.

[31]  Koji Tsuda,et al.  The DICS repository: module-assisted analysis of disease-related gene lists , 2009, Bioinform..

[32]  Insuk Lee,et al.  A high-accuracy consensus map of yeast protein complexes reveals modular nature of gene essentiality , 2007, BMC Bioinformatics.

[33]  K. Campbell,et al.  Muscular dystrophies and the dystrophin-glycoprotein complex. , 1997, Current opinion in neurology.

[34]  Philip M. Kim,et al.  Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights , 2006, Science.

[35]  Hans-Werner Mewes,et al.  CRONOS: the cross-reference navigation server , 2009, Bioinform..

[36]  Thomas Rattei,et al.  SIMAP - The similarity matrix of proteins , 2005, ECCB/JBI.

[37]  B. Alberts The Cell as a Collection of Protein Machines: Preparing the Next Generation of Molecular Biologists , 1998, Cell.

[38]  Ben Lehner,et al.  Tissue specificity and the human protein interaction network , 2009, Molecular systems biology.

[39]  Caroline C. Friedel,et al.  Conserved principles of mammalian transcriptional regulation revealed by RNA half-life , 2009, Nucleic acids research.

[40]  Pall I. Olason,et al.  A human phenome-interactome network of protein complexes implicated in genetic disorders , 2007, Nature Biotechnology.

[41]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[42]  Paul Tempst,et al.  PINdb: a database of nuclear protein complexes from human and yeast , 2004, Bioinform..

[43]  Dmitrij Frishman,et al.  An evolutionary and structural characterization of mammalian protein complex organization , 2008, BMC Genomics.

[44]  Paul Tempst,et al.  Different EZH2-containing complexes target methylation of histone H1 or nucleosomal histone H3. , 2004, Molecular cell.

[45]  Hilla Peretz,et al.  The , 1966 .

[46]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2006, Nucleic Acids Res..

[47]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2005, Nucleic Acids Res..

[48]  Anthony A High,et al.  The Fanconi Anemia Core Complex Forms Four Complexes of Different Sizes in Different Subcellular Compartments* , 2004, Journal of Biological Chemistry.