iProX: an integrated proteome resource

Abstract Sharing of research data in public repositories has become best practice in academia. With the accumulation of massive data, network bandwidth and storage requirements are rapidly increasing. The ProteomeXchange (PX) consortium implements a mode of centralized metadata and distributed raw data management, which promotes effective data sharing. To facilitate open access of proteome data worldwide, we have developed the integrated proteome resource iProX (http://www.iprox.org) as a public platform for collecting and sharing raw data, analysis results and metadata obtained from proteomics experiments. The iProX repository employs a web-based proteome data submission process and open sharing of mass spectrometry-based proteomics datasets. Also, it deploys extensive controlled vocabularies and ontologies to annotate proteomics datasets. Users can use a GUI to provide and access data through a fast Aspera-based transfer tool. iProX is a full member of the PX consortium; all released datasets are freely accessible to the public. iProX is based on a high availability architecture and has been deployed as part of the proteomics infrastructure of China, ensuring long-term and stable resource support. iProX will facilitate worldwide data analysis and sharing of proteomics experiments.

[1]  Jun Fan,et al.  The mzTab Data Exchange Format: Communicating Mass-spectrometry-based Proteomics and Metabolomics Experimental Results to a Wider Audience* , 2014, Molecular & Cellular Proteomics.

[2]  Andrew R. Jones,et al.  ProteomeXchange provides globally co-ordinated proteomics data submission and dissemination , 2014, Nature Biotechnology.

[3]  Mary Shimoyama,et al.  Disease Ontology: improving and unifying disease annotations across species , 2018, Disease Models & Mechanisms.

[4]  Mingwei Liu,et al.  A proteomic landscape of diffuse-type gastric cancer , 2018, Nature Communications.

[5]  Juan Antonio Vizcaíno,et al.  ms-data-core-api: an open-source, metadata-oriented library for computational proteomics , 2015, Bioinform..

[6]  Johannes Griss,et al.  jmzTab: A Java interface to the mzTab data standard , 2014, Proteomics.

[7]  Michael J MacCoss,et al.  Panorama Public: A Public Repository for Quantitative Data Sets Processed in Skyline* , 2018, Molecular & Cellular Proteomics.

[8]  Martin Eisenacher,et al.  The mzQuantML Data Standard for Mass Spectrometry–based Quantitative Studies in Proteomics , 2013, Molecular & Cellular Proteomics.

[9]  Martin Eisenacher,et al.  The HUPO proteomics standards initiative- mass spectrometry controlled vocabulary , 2013, Database J. Biol. Databases Curation.

[10]  Lennart Martens,et al.  The PRoteomics IDEntification (PRIDE) Converter 2 Framework: An Improved Suite of Tools to Facilitate Data Submission to the PRIDE Database and the ProteomeXchange Consortium , 2012, Molecular & Cellular Proteomics.

[11]  Qing-Yu He,et al.  Detergent-Insoluble Proteome Analysis Revealed Aberrantly Aggregated Proteins in Human Preeclampsia Placentas. , 2017, Journal of proteome research.

[12]  Qing-Yu He,et al.  Identification of Missing Proteins Defined by Chromosome-Centric Proteome Project in the Cytoplasmic Detergent-Insoluble Proteins. , 2015, Journal of proteome research.

[13]  Harald Barsnes,et al.  OLS Client and OLS Dialog: Open Source Tools to Annotate Public Omics Datasets , 2017, Proteomics.

[14]  Richard D. Smith,et al.  Recommendations for mass spectrometry data quality metrics for open access data (corollary to the Amsterdam principles) , 2012, Proteomics.

[15]  Juan P Albar,et al.  The Minimal Information about a Proteomics Experiment (MIAPE) from the Proteomics Standards Initiative. , 2014, Methods in molecular biology.

[16]  Alan Ruttenberg,et al.  The Cell Ontology 2016: enhanced content, modularization, and ontology interoperability , 2016, J. Biomed. Semant..

[17]  Juan Antonio Vizcaíno,et al.  The ProteomeXchange consortium in 2017: supporting the cultural change in proteomics public data deposition , 2016, Nucleic Acids Res..

[18]  Luisa Montecchi-Palazzi,et al.  The PSI-MOD community standard for representation of protein modification data , 2008, Nature Biotechnology.

[19]  Qing-Yu He,et al.  Phosphoproteome Characterization of Human Colorectal Cancer SW620 Cell-Derived Exosomes and New Phosphosite Discovery for C-HPP. , 2016, Journal of proteome research.

[20]  Martin Eisenacher,et al.  Proteomics Standards Initiative: Fifteen Years of Progress and Future Work , 2017, Journal of proteome research.

[21]  Xianyu Li,et al.  Mining the human plasma proteome with three-dimensional strategies by high-resolution Quadrupole Orbitrap Mass Spectrometry. , 2016, Analytica chimica acta.

[22]  Martin Eisenacher,et al.  Controlled vocabularies and ontologies in proteomics: Overview, principles and practice , 2014, Biochimica et biophysica acta.

[23]  Phil Andrews,et al.  Recommendations from the 2008 International Summit on Proteomics Data Release and Sharing Policy: the Amsterdam principles. , 2009, Journal of proteome research.

[24]  Masaki Matsumoto,et al.  jPOSTrepo: an international standard data repository for proteomes , 2016, Nucleic Acids Res..

[25]  Lennart Martens,et al.  mzML—a Community Standard for Mass Spectrometry Data* , 2010, Molecular & Cellular Proteomics.

[26]  Antje Chang,et al.  BRENDA in 2017: new perspectives and new tools in BRENDA , 2016, Nucleic Acids Res..

[27]  José A. Dianes,et al.  2016 update of the PRIDE database and its related tools , 2016, Nucleic Acids Res..

[28]  Luis Mendoza,et al.  PASSEL: The PeptideAtlas SRMexperiment library , 2012, Proteomics.

[29]  Harald Barsnes,et al.  The mzIdentML Data Standard Version 1.2, Supporting Advances in Proteome Informatics* , 2017, Molecular & Cellular Proteomics.

[30]  Richard D Smith,et al.  Recommendations for mass spectrometry data quality metrics for open access data (corollary to the Amsterdam Principles). , 2012, Journal of proteome research.

[31]  Robert Petryszak,et al.  Discovering and linking public omics data sets using the Omics Discovery Index , 2017, Nature Biotechnology.

[32]  Huaizhong Zhang,et al.  The mzqLibrary – An open source Java library supporting the HUPO‐PSI quantitative proteomics standard , 2015, Proteomics.