Systematic analyses of the transcriptome, translatome, and proteome provide a global view and potential strategy for the C-HPP.

To estimate the potential of the state-of-the-art proteomics technologies on full coverage of the encoding gene products, the Chinese Human Chromosome Proteome Consortium (CCPC) applied a multiomics strategy to systematically analyze the transciptome, translatome, and proteome of the same cultured hepatoma cells with varied metastatic potential qualitatively and quantitatively. The results provide a global view of gene expression profiles. The 9064 identified high confident proteins covered 50.2% of all gene products in the translatome. Those proteins with function of adhesion, development, reproduction, and so on are low abundant in transcriptome and translatome but absent in proteome. Taking the translatome as the background of protein expression, we found that the protein abundance plays a decisive role and hydrophobicity has a greater influence than molecular weight and isoelectric point on protein detectability. Thus, the enrichment strategy used for low-abundant transcription factors helped to identify missing proteins. In addition, those peptides with single amino acid polymorphisms played a significant role for the disease research, although they might negligibly contribute to new protein identification. The proteome raw and metadata of proteome were collected using the iProX submission system and submitted to ProteomeXchange (PXD000529, PXD000533, and PXD000535). All detailed information in this study can be accessed from the Chinese Chromosome-Centric Human Proteome Database.

[1]  Quanhui Wang,et al.  Chromosome-8-coded proteome of Chinese Chromosome Proteome Data set (CCPD) 2.0 with partial immunohistochemical verifications. , 2014, Journal of proteome research.

[2]  E. Lundberg,et al.  Towards a knowledge-based Human Protein Atlas , 2010, Nature Biotechnology.

[3]  Robertson Craig,et al.  Open source system for analyzing, validating, and storing protein identification data. , 2004, Journal of proteome research.

[4]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[5]  Tao Zhang,et al.  Systematic analysis of missing proteins provides clues to help define all of the protein-coding genes on human chromosome 1. , 2014, Journal of proteome research.

[6]  Eric W Deutsch,et al.  The state of the human proteome in 2012 as viewed through PeptideAtlas. , 2013, Journal of proteome research.

[7]  M. Selbach,et al.  Global quantification of mammalian gene expression control , 2011, Nature.

[8]  G. Omenn,et al.  A first step toward completion of a genome-wide characterization of the human proteome. , 2013, Journal of proteome research.

[9]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[10]  The UniProt Consortium,et al.  Update on activities at the Universal Protein Resource (UniProt) in 2013 , 2012, Nucleic Acids Res..

[11]  Martin Kircher,et al.  Deep proteome and transcriptome mapping of a human cancer cell line , 2011, Molecular systems biology.

[12]  Cathy H. Wu,et al.  The Human Proteome Project: Current State and Future Direction , 2011, Molecular & Cellular Proteomics.

[13]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[14]  Qing-Yu He,et al.  Translating mRNAs strongly correlate to proteins in a multivariate manner and their translation ratios are phenotype specific , 2013, Nucleic acids research.

[15]  Cheng Chang,et al.  Omics evidence: single nucleotide variants transmissions on chromosome 20 in liver cancer cell lines. , 2014, Journal of proteome research.

[16]  Amos Bairoch,et al.  neXtProt: a knowledge platform for human proteins , 2011, Nucleic Acids Res..

[17]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[18]  Mingming Jia,et al.  COSMIC (the Catalogue of Somatic Mutations in Cancer): a resource to investigate acquired mutations in human cancer , 2009, Nucleic Acids Res..

[19]  J. Ellenberg,et al.  The quantitative proteome of a human cell line , 2011, Molecular systems biology.

[20]  Yun-ping Zhu,et al.  Regular Patterns for Proteome-Wide Distribution of Protein Abundance across Species , 2012, PloS one.

[21]  Cheng Chang,et al.  First proteomic exploration of protein-encoding genes on chromosome 1 in human liver, stomach, and colon. , 2013, Journal of proteome research.

[22]  S. Hanash,et al.  The Chromosome-Centric Human Proteome Project for cataloging proteins encoded in the genome , 2012, Nature Biotechnology.

[23]  Mona Singh,et al.  Measuring differential gene expression by short read sequencing: quantitative comparison to 2-channel gene expression microarrays , 2009, BMC Genomics.

[24]  A. Valleriani,et al.  FANSe: an accurate algorithm for quantitative mapping of large scale sequencing reads , 2012, Nucleic acids research.

[25]  Liwei Li,et al.  PepDistiller: A quality control tool to improve the sensitivity and accuracy of peptide identifications in shotgun proteomics , 2012, Proteomics.

[26]  Yunping Zhu,et al.  ProPAS: standalone software to analyze protein properties , 2012, Bioinformation.

[27]  Matthias Mann,et al.  Analysis of High Accuracy, Quantitative Proteomics Data in the MaxQB Database , 2012, Molecular & Cellular Proteomics.

[28]  Wei Zhang,et al.  SILVER: an efficient tool for stable isotope labeling LC-MS data quantitative analysis with quality control methods , 2013, Bioinform..

[29]  Cheng Chang,et al.  Qualitative and quantitative expression status of the human chromosome 20 genes in cancer tissues and the representative cell lines. , 2013, Journal of proteome research.

[30]  Jun Qin,et al.  Proteome-wide profiling of activated transcription factors with a concatenated tandem array of transcription factor response elements , 2013, Proceedings of the National Academy of Sciences.

[31]  Juan M. Vaquerizas,et al.  A census of human transcription factors: function, expression and evolution , 2009, Nature Reviews Genetics.

[32]  M. Mann,et al.  Defining the transcriptome and proteome in three functionally different human cell lines , 2010, Molecular systems biology.

[33]  Cheng Chang,et al.  Proteome atlas of human chromosome 8 and its multiple 8p deficiencies in tumorigenesis of the stomach, colon, and liver. , 2013, Journal of proteome research.

[34]  Yan Li,et al.  Stepwise metastatic human hepatocellular carcinoma cell model system with multiple metastatic potentials established through consecutive in vivo selection and studies on metastatic characteristics , 2004, Journal of Cancer Research and Clinical Oncology.

[35]  Cheng Chang,et al.  Systematic research on the pretreatment of peptides for quantitative proteomics using a C18 microcolumn , 2013, Proteomics.