A Data Analysis Protocol for Quantitative Data-Independent Acquisition Proteomics.

Data-independent acquisition (DIA) mode of mass spectrometry, such as the SWATH-MS technology, enables accurate and consistent measurement of proteins, which is crucial for comparative proteomics studies. However, there is lack of free and easy to implement data analysis protocols that can handle the different data processing steps from raw spectrum files to peptide intensity matrix and its downstream analysis. Here, we provide a data analysis protocol, named diatools, covering all these steps from spectral library building to differential expression analysis of DIA proteomics data. The data analysis tools used in this protocol are open source and the protocol is distributed at Docker Hub as a complete software environment that supports Linux, Windows, and macOS operating systems.

[1]  Dirk Merkel,et al.  Docker: lightweight Linux containers for consistent development and deployment , 2014 .

[2]  Lars Malmström,et al.  TRIC: an automated alignment strategy for reproducible protein quantification in targeted proteomics , 2016, Nature Methods.

[3]  Laura L. Elo,et al.  Enhanced differential expression statistics for data-independent acquisition proteomics , 2017, Scientific Reports.

[4]  Ludovic C. Gillet,et al.  Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis* , 2012, Molecular & Cellular Proteomics.

[5]  Natalie I. Tasman,et al.  A guided tour of the Trans‐Proteomic Pipeline , 2010, Proteomics.

[6]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[7]  Ben C. Collins,et al.  OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data , 2014, Nature Biotechnology.

[8]  Brendan MacLean,et al.  Building high-quality assay libraries for targeted analysis of SWATH MS data , 2015, Nature Protocols.

[9]  Knut Reinert,et al.  OpenMS – An open-source software framework for mass spectrometry , 2008, BMC Bioinformatics.

[10]  Raphael Gottardo,et al.  Orchestrating high-throughput genomic analysis with Bioconductor , 2015, Nature Methods.

[11]  Natalie I. Tasman,et al.  A Cross-platform Toolkit for Mass Spectrometry and Proteomics , 2012, Nature Biotechnology.

[12]  L.L. Elo,et al.  Reproducibility-Optimized Test Statistic for Ranking Genes in Microarray Studies , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[13]  Olli S Nevalainen,et al.  Using Peptide-Level Proteomics Data for Detecting Differentially Expressed Proteins. , 2015, Journal of proteome research.

[14]  Yaoyang Zhang,et al.  SWATH enables precise label‐free quantification on proteome scale , 2015, Proteomics.