Architecture of the human interactome defines protein communities and disease networks

The physiology of a cell can be viewed as the product of thousands of proteins acting in concert to shape the cellular response. Coordination is achieved in part through networks of protein–protein interactions that assemble functionally related proteins into complexes, organelles, and signal transduction pathways. Understanding the architecture of the human proteome has the potential to inform cellular, structural, and evolutionary mechanisms and is critical to elucidating how genome variation contributes to disease. Here we present BioPlex 2.0 (Biophysical Interactions of ORFeome-derived complexes), which uses robust affinity purification–mass spectrometry methodology to elucidate protein interaction networks and co-complexes nucleated by more than 25% of protein-coding genes from the human genome, and constitutes, to our knowledge, the largest such network so far. With more than 56,000 candidate interactions, BioPlex 2.0 contains more than 29,000 previously unknown co-associations and provides functional insights into hundreds of poorly characterized proteins while enhancing network-based analyses of domain associations, subcellular localization, and co-complex formation. Unsupervised Markov clustering of interacting proteins identified more than 1,300 protein communities representing diverse cellular activities. Genes essential for cell fitness are enriched within 53 communities representing central cellular functions. Moreover, we identified 442 communities associated with more than 2,000 disease annotations, placing numerous candidate disease genes into a cellular framework. BioPlex 2.0 exceeds previous experimentally derived interaction networks in depth and breadth, and will be a valuable resource for exploring the biology of incompletely characterized proteins and for elucidating larger-scale patterns of proteome organization.

[1]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[2]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[3]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[4]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[5]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[6]  S. L. Wong,et al.  Towards a proteome-scale map of the human protein–protein interaction network , 2005, Nature.

[7]  Sean R. Collins,et al.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae , 2006, Nature.

[8]  Steven P Gygi,et al.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry , 2007, Nature Methods.

[9]  M. Mann,et al.  Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips , 2007, Nature Protocols.

[10]  B. Frey,et al.  Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing , 2008, Nature Genetics.

[11]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[12]  S. Gygi,et al.  Defining the Human Deubiquitinating Enzyme Interaction Landscape , 2009, Cell.

[13]  Qing Jun Wang,et al.  Distinct regulation of autophagic activity by Atg14L and Rubicon associated with Beclin 1–phosphatidylinositol-3-kinase complex , 2009, Nature Cell Biology.

[14]  Hans-Werner Mewes,et al.  CORUM: the comprehensive resource of mammalian protein complexes , 2007, Nucleic Acids Res..

[15]  S. Gygi,et al.  Network organization of the human autophagy system , 2010, Nature.

[16]  Gary D. Bader,et al.  The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function , 2010, Nucleic Acids Res..

[17]  Edward L. Huttlin,et al.  A Tissue-Specific Atlas of Mouse Protein Phosphorylation and Expression , 2010, Cell.

[18]  Thomas M Green,et al.  A public genome-scale lentiviral expression library of human ORFs , 2011, Nature Methods.

[19]  Jennifer M. Rust,et al.  The BioGRID Interaction Database , 2011 .

[20]  Julian Mintseris,et al.  A Protein Complex Network of Drosophila melanogaster , 2011, Cell.

[21]  Michele Magrane,et al.  UniProt Knowledgebase: a hub of integrated protein data , 2011, Database J. Biol. Databases Curation.

[22]  Livia Perfetto,et al.  MINT, the molecular interaction database: 2012 update , 2011, Nucleic Acids Res..

[23]  Andrei L. Turinsky,et al.  A Census of Human Soluble Protein Complexes , 2012, Cell.

[24]  Franco J. Vizeacoumar,et al.  Interaction landscape of membrane-protein complexes in Saccharomyces cerevisiae , 2012, Nature.

[25]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[26]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[27]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..

[28]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[29]  S. Gygi,et al.  Quantitative comparison of the fasted and re-fed mouse liver phosphoproteomes using lower pH reductive dimethylation. , 2013, Methods.

[30]  Zachary A. Szpiech,et al.  High-resolution network biology: connecting sequence with function , 2013, Nature Reviews Genetics.

[31]  M. Rosenfeld,et al.  Zebrafish Ciliopathy Screen Plus Human Mutational Analysis Identifies C21orf59 and CCDC65 Defects as Causing Primary Ciliary Dyskinesia. , 2013, American journal of human genetics.

[32]  Christie S. Chang,et al.  The BioGRID interaction database: 2013 update , 2012, Nucleic Acids Res..

[33]  T. Ideker,et al.  A gene ontology inferred from molecular networks , 2012, Nature Biotechnology.

[34]  J. Harper,et al.  Parallel SCF adaptor capture proteomics reveals a role for SCFFBXL17 in NRF2 activation via BACH1 repressor turnover. , 2013, Molecular cell.

[35]  P. Stenson,et al.  The Human Gene Mutation Database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine , 2013, Human Genetics.

[36]  B. Kuster,et al.  Mass-spectrometry-based draft of the human proteome , 2014, Nature.

[37]  Bridget E. Begg,et al.  A Proteome-Scale Map of the Human Interactome Network , 2014, Cell.

[38]  A. Barabasi,et al.  Uncovering disease-disease relationships through the incomplete interactome , 2015, Science.

[39]  E. Lander,et al.  Identification and characterization of essential genes in the human genome , 2015, Science.

[40]  Núria Queralt-Rosinach,et al.  DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes , 2015, Database J. Biol. Databases Curation.

[41]  Feng Zhang,et al.  Genome engineering using CRISPR-Cas9 system. , 2015, Methods in molecular biology.

[42]  Jing Chen,et al.  NDEx, the Network Data Exchange. , 2015, Cell systems.

[43]  Greg W. Clark,et al.  Panorama of ancient metazoan macromolecular complexes , 2015, Nature.

[44]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[45]  Edward L. Huttlin,et al.  The BioPlex Network: A Systematic Exploration of the Human Interactome , 2015, Cell.

[46]  Marco Y. Hein,et al.  A Human Interactome in Three Quantitative Dimensions Organized by Stoichiometries and Abundances , 2015, Cell.

[47]  G. Superti-Furga,et al.  Gene essentiality and synthetic lethality in haploid human cells , 2015, Science.

[48]  P. De Camilli,et al.  Endosome-ER Contacts Control Actin Nucleation and Retromer Function through VAP-Dependent Regulation of PI4P , 2016, Cell.

[49]  Gregory A. Wyant,et al.  The CASTOR Proteins Are Arginine Sensors for the mTORC1 Pathway , 2016, Cell.

[50]  M. Sowa,et al.  A protein interaction map for cell-cell adhesion regulators identifies DUSP23 as a novel phosphatase for β-catenin , 2016, Scientific Reports.

[51]  Kun-Liang Guan,et al.  Mechanisms of Hippo pathway regulation , 2016, Genes & development.

[52]  Karl R. Clauser,et al.  MitoCarta2.0: an updated inventory of mammalian mitochondrial proteins , 2015, Nucleic Acids Res..

[53]  Robert W. Taylor,et al.  Mitochondrial Protein Interaction Mapping Identifies Regulators of Respiratory Chain Function. , 2016, Molecular cell.