Chemometrics in analytical chemistry—part I: history, experimental design and data analysis tools

Chemometrics has achieved major recognition and progress in the analytical chemistry field. In the first part of this tutorial, major achievements and contributions of chemometrics to some of the more important stages of the analytical process, like experimental design, sampling, and data analysis (including data pretreatment and fusion), are summarised. The tutorial is intended to give a general updated overview of the chemometrics field to further contribute to its dissemination and promotion in analytical chemistry.

[1]  Rasmus Bro,et al.  Multi-way Analysis with Applications in the Chemical Sciences , 2004 .

[2]  I. Jolliffe Principal Component Analysis , 2002 .

[3]  Bruce Slutsky,et al.  Chemometrics: A Practical Guide By Kenneth R. Beebe, Randy J. Pell, and Mary Beth Seasholtz. Wiley-Interscience Series on Laboratory Automation. John Wiley & Sons: New York, 1998, xi + 348 pp, ISBN 0-471-12451-6 , 1998, Journal of chemical information and computer sciences.

[4]  R. Bro PARAFAC. Tutorial and applications , 1997 .

[5]  D. Botstein,et al.  Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Beata Walczak,et al.  Analysis of variance of designed chromatographic data sets: The analysis of variance-target projection approach. , 2015, Journal of chromatography. A.

[7]  Te-Won Lee,et al.  Independent Component Analysis , 1998, Springer US.

[8]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[9]  Bruce R. Kowalski,et al.  Chemometrics, mathematics and statistics in chemistry , 1984 .

[10]  Edmund R. Malinowski,et al.  Factor Analysis in Chemistry , 1980 .

[11]  I. Mechelen,et al.  SCA with rotation to distinguish common and distinctive information in linked data , 2013, Behavior Research Methods.

[12]  Federico Marini Chemometrics in Food Chemistry : Data Handling in Science and Technology Volume 28 , 2013 .

[13]  Age K. Smilde,et al.  ANOVA-simultaneous component analysis (ASCA): a new tool for analyzing designed metabolomics data , 2005, Bioinform..

[14]  G. WEBER,et al.  Enumeration of Components in Complex Systems by Fluorescence Spectrophotometry , 1961, Nature.

[15]  Carla Agreda,et al.  Experimental Design: A Chemometric Approach , 1994 .

[16]  B. Kowalski,et al.  Theory of analytical chemistry , 1994 .

[17]  Thomas L. Isenhour,et al.  Computerized learning machines applied to chemical problems. Multicategory pattern classification by least squares , 1969 .

[18]  Kim H. Esbensen,et al.  Sampling in Practice: a TOS toolbox of unit operations , 2005 .

[19]  Peter de B. Harrington,et al.  Analysis of variance–principal component analysis: A soft tool for proteomic discovery , 2005 .

[20]  Daniel Eriksson,et al.  Data integration in plant biology: the O2PLS method for combined modeling of transcript and metabolite data. , 2007, The Plant journal : for cell and molecular biology.

[21]  Pierre Gy,et al.  Sampling for analytical purposes , 1998 .

[22]  Lionel Blanchet,et al.  Data Fusion in Metabolomics and Proteomics for Biomarker Discovery. , 2016, Methods in molecular biology.

[23]  Tommy Löfstedt,et al.  OnPLS—a novel multiblock method for the modelling of predictive and orthogonal variation , 2011 .

[24]  Kim H. Esbensen,et al.  Representative Sampling for reliable data analysis: Theory Of Sampling , 2005 .

[25]  Evelyne Vigneau,et al.  Common components and specific weights analysis performed on preference data , 2001 .

[26]  P. C. Meier,et al.  Statistical Methods in Analytical Chemistry , 2005 .

[27]  R. Tauler Multivariate curve resolution applied to second order data , 1995 .

[28]  Christian Jutten,et al.  Multimodal Data Fusion: An Overview of Methods, Challenges, and Prospects , 2015, Proceedings of the IEEE.

[29]  Jan van der Greef,et al.  Symbiosis of chemometrics and metabolomics: past, present, and future , 2005 .

[30]  Lutgarde M. C. Buydens,et al.  Breaking with trends in pre-processing? , 2013 .

[31]  Jürgen W. Einax,et al.  Sampling and Sampling Design , 2004 .

[32]  Richard M. Wallace,et al.  ANALYSIS OF ABSORPTION SPECTRA OF MULTICOMPONENT SYSTEMS1 , 1960 .

[33]  Rasmus Bro,et al.  Understanding data fusion within the framework of coupled matrix and tensor factorizations , 2013 .

[34]  S. Wold Spline Functions in Data Analysis , 1974 .

[35]  Gerrit Kateman,et al.  Chemometrics — Sampling Strategies , 1987, Chemometrics and Species Identification.

[36]  Eric F Lock,et al.  Analysis of multi-source metabolomic data using joint and individual variation explained (JIVE). , 2015, The Analyst.

[37]  Richard G. Brereton,et al.  Chemometrics for Pattern Recognition , 2009 .

[38]  Paul Geladi,et al.  Principles of Proper Validation: use and abuse of re‐sampling for validation , 2010 .

[39]  Sidney Addelman,et al.  trans-Dimethanolbis(1,1,1-trifluoro-5,5-dimethylhexane-2,4-dionato)zinc(II) , 2008, Acta crystallographica. Section E, Structure reports online.

[40]  J. I The Design of Experiments , 1936, Nature.

[41]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[42]  Joshua Lederberg,et al.  Applications of Artificial Intelligence for Organic Chemistry: The DENDRAL Project , 1980 .

[43]  Klaus Jung,et al.  Statistical Analysis in Proteomics , 2016, Methods in Molecular Biology.

[44]  Vincent Baeten,et al.  Multivariate Calibration and Chemometrics for near Infrared Spectroscopy: Which Method? , 2000 .