Chemometrics for Data Interpretation: Application of Principal Components Analysis (PCA) to Multivariate Spectroscopic Measurements

Extracting relevant and useful information from measurements is an issue of paramount importance and it can be considered as complementary to the process of data acquisition. This is a crucial point especially in the field of chemical measurements, where data sets can consist of hundreds or even thousands of variables so their interpretation can require long time. Chemometrics try to tackle this issue by applying mathematical and statistical tools to data coming from chemical, biological or medical analyses. Among possible methods, Principal Components Analysis (PCA) has found wide application in the I&M field thanks to its ability to identify patterns in acquired measurements and classify data in different groups. Possible applications span from chemicals detection [1] to concentration estimation of compounds in a given system [2]. Actually, many studies demonstrated the possibility to use PCA to process different kinds of data [3], in some cases coupling PCA to other tools such as artificial neural networks to improve the processing performance [4].

[1]  Charles K. Bayne Practical Guide to Chemometrics , 1995 .

[2]  Robert T. Downs,et al.  The power of databases: The RRUFF project , 2016 .

[3]  Joel Nothman,et al.  SciPy 1.0-Fundamental Algorithms for Scientific Computing in Python , 2019, ArXiv.

[4]  Abdallah El Hamidi,et al.  A Direct PCA-Based Approach for Real-Time Description of Physiological Organ Deformations , 2015, IEEE Transactions on Medical Imaging.

[5]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[6]  P. Eilers A perfect smoother. , 2003, Analytical chemistry.

[7]  Eric R. Ziegel,et al.  Statistics and Chemometrics for Analytical Chemistry , 2004, Technometrics.

[8]  Hu Lizhi,et al.  Discrimination of olive oil adulterated with vegetable oils using dielectric spectroscopy , 2010 .

[9]  R. Barnes,et al.  Standard Normal Variate Transformation and De-Trending of Near-Infrared Diffuse Reflectance Spectra , 1989 .

[10]  Keshav Kumar Principal component analysis: Most favourite tool in chemometrics , 2017 .

[11]  Xiaomin Li,et al.  Fuel-Type Identification Using Joint Probability Density Arbiter and Soft-Computing Techniques , 2012, IEEE Transactions on Instrumentation and Measurement.

[12]  Luiz Eduardo Borges da Silva,et al.  An ultrasound-based water-cut meter for heavy fuel oil , 2019 .

[13]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[14]  Amanda F. Mejia,et al.  PCA leverage: outlier detection for high‐dimensional functional magnetic resonance imaging data , 2015, Biostatistics.

[15]  Eduardo García-Breijo,et al.  Low-Cost Electronic Tongue System and Its Application to Explosive Detection , 2013, IEEE Transactions on Instrumentation and Measurement.