CORUM: the comprehensive resource of mammalian protein complexes—2019

Abstract CORUM is a database that provides a manually curated repository of experimentally characterized protein complexes from mammalian organisms, mainly human (67%), mouse (15%) and rat (10%). Given the vital functions of these macromolecular machines, their identification and functional characterization is foundational to our understanding of normal and disease biology. The new CORUM 3.0 release encompasses 4274 protein complexes offering the largest and most comprehensive publicly available dataset of mammalian protein complexes. The CORUM dataset is built from 4473 different genes, representing 22% of the protein coding genes in humans. Protein complexes are described by a protein complex name, subunit composition, cellular functions as well as the literature references. Information about stoichiometry of subunits depends on availability of experimental data. Recent developments include a graphical tool displaying known interactions between subunits. This allows the prediction of structural interconnections within protein complexes of unknown structure. In addition, we present a set of 58 protein complexes with alternatively spliced subunits. Those were found to affect cellular functions such as regulation of apoptotic activity, protein complex assembly or define cellular localization. CORUM is freely accessible at http://mips.helmholtz-muenchen.de/corum/.

[1]  Anthony A High,et al.  The Fanconi Anemia Core Complex Forms Four Complexes of Different Sizes in Different Subcellular Compartments* , 2004, Journal of Biological Chemistry.

[2]  A. Barabasi,et al.  Network-based in silico drug efficacy screening , 2016, Nature Communications.

[3]  K. Campbell,et al.  Muscular dystrophies and the dystrophin-glycoprotein complex. , 1997, Current opinion in neurology.

[4]  Hyojin Kim,et al.  MouseNet v2: a database of gene networks for studying the laboratory mouse and eight other model vertebrates , 2015, Nucleic Acids Res..

[5]  Marco Y. Hein,et al.  A Human Interactome in Three Quantitative Dimensions Organized by Stoichiometries and Abundances , 2015, Cell.

[6]  Claire D. McWhite,et al.  Integration of over 9,000 mass spectrometry experiments builds a global map of human protein complexes , 2017, Molecular systems biology.

[7]  Haiyuan Yu,et al.  Interactome INSIDER: a structural interactome browser for genomic studies , 2017, Nature Methods.

[8]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[9]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[10]  K. Willison,et al.  Elucidation of the subunit orientation in CCT (chaperonin containing TCP1) from the subunit composition of CCT micro‐complexes , 1997, The EMBO journal.

[11]  Shoshana J. Wodak,et al.  CYGD: the Comprehensive Yeast Genome Database , 2004, Nucleic Acids Res..

[12]  Paul Tempst,et al.  Different EZH2-containing complexes target methylation of histone H1 or nucleosomal histone H3. , 2004, Molecular cell.

[13]  Sean R. Collins,et al.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae , 2006, Nature.

[14]  Riitta Veijola,et al.  Two missense mutations in KCNQ1 cause pituitary hormone deficiency and maternally inherited gingival fibromatosis , 2017, Nature Communications.

[15]  Christopher J. Lord,et al.  A Compendium of Co-regulated Protein Complexes in Breast Cancer Reveals Collateral Loss Events , 2017, bioRxiv.

[16]  Patricia Greninger,et al.  Detection of Dysregulated Protein Association Networks by High-Throughput Proteomics Predicts Cancer Vulnerabilities , 2017, Nature Biotechnology.

[17]  Astrid Gall,et al.  Ensembl 2018 , 2017, Nucleic Acids Res..

[18]  Giulio Superti-Furga,et al.  MLL-fusion-driven leukemia requires SETD2 to safeguard genomic integrity , 2018, Nature Communications.

[19]  Greg W. Clark,et al.  Panorama of ancient metazoan macromolecular complexes , 2015, Nature.

[20]  Devin K. Schweppe,et al.  Architecture of the human interactome defines protein communities and disease networks , 2017, Nature.

[21]  Eric P. Hoffman,et al.  Dystrophin: The protein product of the duchenne muscular dystrophy locus , 1987, Cell.

[22]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[23]  Sara Linse,et al.  Methods for the detection and analysis of protein–protein interactions , 2007, Proteomics.

[24]  David Haussler,et al.  The UCSC Genome Browser database: 2018 update , 2017, Nucleic Acids Res..

[25]  Hans-Werner Mewes,et al.  CORUM: the comprehensive resource of mammalian protein complexes , 2007, Nucleic Acids Res..

[26]  S. Teichmann,et al.  Structure, dynamics, assembly, and evolution of protein complexes. , 2015, Annual review of biochemistry.

[27]  Ran Zhao,et al.  Caspase-2 Short Isoform Interacts with Membrane-Associated Cytoskeleton Proteins to Inhibit Apoptosis , 2013, PloS one.

[28]  B. Cisneros,et al.  Nuclear and nuclear envelope localization of dystrophin Dp71 and dystrophin‐associated proteins (DAPs) in the C2C12 muscle cells: DAPs nuclear localization is modulated during myogenesis , 2008, Journal of cellular biochemistry.

[29]  Edward L. Huttlin,et al.  The BioPlex Network: A Systematic Exploration of the Human Interactome , 2015, Cell.

[30]  Rafael C. Jimenez,et al.  The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases , 2013, Nucleic Acids Res..

[31]  Rafael Rodríguez-Muñoz,et al.  Novel Nuclear Protein Complexes of Dystrophin 71 Isoforms in Rat Cultured Hippocampal GABAergic and Glutamatergic Neurons , 2015, PloS one.