Global spectral deconvolution based on non-negative matrix factorization in GC × GC-HRTOFMS.

A global spectral deconvolution, based on non-negative matrix factorization (NMF) in comprehensive two-dimensional gas chromatography high-resolution time-of-flight mass spectrometry, was developed. We evaluated the ability of various instrumental parameters and NMF settings to derive high-performance detection in nontarget screening using a sediment sample. To evaluate the performance of the process, a NIST library search was used to identify the deconvoluted spectra. Differences of the instrumental scan rates (25 and 50 Hz) in deconvolution were evaluated and results show that a high scan rate enhanced the number of compounds detected in the sediment sample. A higher mass resolution in the range of 1,000 to 10,000 and a higher m/z precision in the deconvolution were needed to obtain an accurate mass database. After removal of multiple duplicate hits, which occurred in batch processes of NIST library search on the deconvolution result, 62 unique assignable spectra with a match factor ≥900 were obtained in the deconvoluted chromatogram from the sediment sample, including 54 spectra that were refined by the deconvolution. This method will help to detect and build up well-resolved reference spectra from various complex mixtures and will accelerate nontarget screening.

[1]  Emma L. Schymanski,et al.  Identifying small molecules via high resolution mass spectrometry: communicating confidence. , 2014, Environmental science & technology.

[2]  Yizeng Liang,et al.  Determination of Essential Oil Composition from Osmanthus fragrans Tea by GC-MS Combined with a Chemometric Resolution Method , 2010, Molecules.

[3]  V. P. Pauca,et al.  Nonnegative matrix factorization for spectral data analysis , 2006 .

[4]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[5]  Renaud Gaujoux,et al.  A flexible R package for nonnegative matrix factorization , 2010, BMC Bioinformatics.

[6]  Rasmus Bro,et al.  Handling within run retention time shifts in two-dimensional chromatography data using shift correction and modeling. , 2009, Journal of chromatography. A.

[7]  Tonghua Li,et al.  Overlapping spectra resolution using non-negative matrix factorization. , 2005, Talanta.

[8]  T. Ohura,et al.  Environmental analysis of chlorinated and brominated polycyclic aromatic hydrocarbons by comprehensive two-dimensional gas chromatography coupled to high-resolution time-of-flight mass spectrometry. , 2011, Journal of chromatography. A.

[9]  S. Stein An integrated method for spectrum extraction and compound identification from gas chromatography/mass spectrometry data , 1999 .

[10]  Yi-Zeng Liang,et al.  Subwindow factor analysis , 1999 .

[11]  Robert E. Synovec,et al.  Toward automated peak resolution in complete GC × GC–TOFMS chromatograms by PARAFAC , 2009 .

[12]  Shibata Yasuyuki,et al.  Quantification of polychlorinated dibenzo-p-dioxins and dibenzofurans by direct injection of sample extract into the comprehensive multidimensional gas chromatograph/high-resolution time-of-flight mass spectrometer. , 2008, Journal of chromatography. A.

[13]  A. Fushimi,et al.  Selective extraction of halogenated compounds from data measured by comprehensive multidimensional gas chromatography/high resolution time-of-flight mass spectrometry for non-target analysis of environmental and biological samples. , 2013, Journal of chromatography. A.

[14]  Sylvain Chartier,et al.  An Introduction to Independent Component Analysis: InfoMax and FastICA algorithms , 2010 .

[15]  Oliver Fiehn,et al.  Informatics for cross-sample analysis with comprehensive two-dimensional gas chromatography and high-resolution mass spectrometry (GCxGC-HRMS). , 2011, Talanta.

[16]  Hong Tao Gao,et al.  A novel trilinear decomposition algorithm: Three-dimension non-negative matrix factorization , 2007 .

[17]  Patrik O. Hoyer,et al.  Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..

[18]  Jamin C. Hoggard,et al.  Parallel factor analysis (PARAFAC) of target analytes in GC x GC-TOFMS data: automated selection of a model with an appropriate number of factors. , 2007, Analytical chemistry.

[19]  Shao-hui Yu,et al.  Component recognition with three‐dimensional fluorescence spectra based on non‐negative matrix factorization , 2011 .

[20]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[21]  S. Reichenbach,et al.  Global and selective detection of organohalogens in environmental samples by comprehensive two-dimensional gas chromatography-tandem mass spectrometry and high-resolution time-of-flight mass spectrometry. , 2011, Journal of chromatography. A.

[22]  A. Fushimi,et al.  Comprehensive two-dimensional gas chromatography coupled to high-resolution time-of-flight mass spectrometry and simultaneous nitrogen phosphorous and mass spectrometric detection for characterization of nanoparticles in roadside atmosphere. , 2007, Journal of chromatography. A.

[23]  Tonghua Li,et al.  Direct decomposition of three-way arrays using a non-negative approximation. , 2010, Talanta.

[24]  Stephen Stein,et al.  Mass spectral reference libraries: an ever-expanding resource for chemical identification. , 2012, Analytical chemistry.

[25]  H. R. Keller,et al.  Heuristic evolving latent projections: resolving two-way multicomponent data. 2. Detection and resolution of minor constituents , 1992 .

[26]  J. S. Arey,et al.  Mapping environmental partitioning properties of nonpolar complex mixtures by use of GC × GC. , 2014, Environmental science & technology.

[27]  Stephen E. Reichenbach,et al.  Information technologies for comprehensive two-dimensional gas chromatography , 2004 .

[28]  Y. Kanai,et al.  Retrospective analysis by data processing tools for comprehensive two-dimensional gas chromatography coupled to high resolution time-of-flight mass spectrometry: a challenge for matrix-rich sediment core sample from Tokyo Bay. , 2014, Journal of chromatography. A.

[29]  Aapo Hyvärinen,et al.  Independent component analysis: recent advances , 2013, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[30]  Maryam Vosough,et al.  Using mean field approach independent component analysis to fatty acid characterization with overlapped GC-MS signals. , 2007, Analytica chimica acta.

[31]  Xiang Zhang,et al.  Data Dependent Peak Model Based Spectrum Deconvolution for Analysis of High Resolution LC-MS Data , 2014, Analytical chemistry.

[32]  A. Fushimi,et al.  Rapid automatic identification and quantification of compounds in complex matrices using comprehensive two-dimensional gas chromatography coupled to high resolution time-of-flight mass spectrometry with a peak sentinel tool. , 2013, Analytica chimica acta.