A fuzzy distance metric for measuring the dissimilarity of planar chromatographic profiles with application to denaturing gradient gel electrophoresis data from human skin microbes: demonstration of an individual and gender-based fingerprint.

A newly devised fuzzy metric for measuring the dissimilarity between two planar chromatographic profiles is proposed in this paper. It does not require an accurately assigned sample-feature matrix and can cope with slight imprecision of the positional information. This makes it very suitable for 1-D techniques which do not have a second spectroscopic dimension to aid variable assignment. The usefulness of this metric has been demonstrated on a large data set consisting of nearly 400 samples from Denaturing Gradient Gel Electrophoresis (DGGE) analysis of microbes on human skin. The pattern revealed by this dissimilarity metric was compared with the one represented by a sample-feature matrix and highly consistent results were obtained. Several pattern recognition techniques have been applied on the dissimilarity matrix based on this dissimilarity metric. According to rank analysis, within-individual variation is significantly less than between-individual variation, suggesting a unique individual microbial fingerprint. Principal Coordinates Analysis (PCO) suggests that there is a considerable separation between genders. These results suggest that there are specific microbial colonies characteristic of individuals.

[1]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[2]  Richard G. Brereton,et al.  Pattern Recognition of Gas Chromatography Mass Spectrometry of Human Volatiles in Sweat to distinguish the sex of subjects and determine potential Discriminatory Marker Peaks , 2007 .

[3]  D. Penn,et al.  Individual and gender fingerprints in human body odour , 2007, Journal of The Royal Society Interface.

[4]  Dustin J Penn,et al.  In situ surface sampling of biological objects and preconcentration of their volatiles for chromatographic analysis. , 2006, Analytical chemistry.

[5]  D. Penn,et al.  An automated method for peak detection and matching in large gas chromatography‐mass spectrometry data sets , 2006 .

[6]  A. G. Asuero,et al.  The Correlation Coefficient: An Overview , 2006 .

[7]  D L Massart,et al.  Automatic program for peak detection and deconvolution of multi-overlapped chromatographic signals part I: peak detection. , 2005, Journal of chromatography. A.

[8]  J. R. Torres-Lapasió,et al.  Automatic program for peak detection and deconvolution of multi-overlapped chromatographic signals part II: peak model and deconvolution algorithms. , 2005, Journal of chromatography. A.

[9]  J. D. Elsas,et al.  Molecular Microbial Ecology Manual , 2013, Springer Netherlands.

[10]  D. Rodríguez-Lázaro,et al.  Rapid Quantitative Detection of Listeria monocytogenes in Meat Products by Real-Time PCR , 2004, Applied and Environmental Microbiology.

[11]  J. Trevors,et al.  The development of a DNA microarray-based assay for the characterization of commercially formulated microbial products. , 2004, Journal of microbiological methods.

[12]  Frans van den Berg,et al.  Correlation optimized warping and dynamic time warping as preprocessing methods for chromatographic data , 2004 .

[13]  Danilo Ercolini,et al.  PCR-DGGE fingerprinting: novel strategies for detection of microbes in food. , 2004, Journal of microbiological methods.

[14]  David E. Booth,et al.  Chemometrics: Data Analysis for the Laboratory and Chemical Plant , 2004, Technometrics.

[15]  A. Cañada,et al.  Handbook of differential equations , 2004 .

[16]  E. Meijering A chronology of interpolation: from ancient astronomy to modern signal and image processing , 2002, Proc. IEEE.

[17]  B. Díez,et al.  Application of Denaturing Gradient Gel Electrophoresis (DGGE) To Study the Diversity of Marine Picoeukaryotic Assemblages and Comparison of DGGE with Other Molecular Techniques , 2001, Applied and Environmental Microbiology.

[18]  G. Muyzer DGGE/TGGE a method for identifying genes from natural ecosystems. , 1999, Current opinion in microbiology.

[19]  J. Carstensen,et al.  Aligning of single and multiple wavelength chromatographic profiles for chemometric data analysis using correlation optimised warping , 1998 .

[20]  P. Guldberg,et al.  Fingerprinting sequence variation in ribosomal DNA of parasites by DGGE. , 1996, Molecular and cellular probes.

[21]  A. Uitterlinden,et al.  Profiling of complex microbial populations by denaturing gradient gel electrophoresis analysis of polymerase chain reaction-amplified genes coding for 16S rRNA , 1993, Applied and environmental microbiology.

[22]  Thomas L. Isenhour,et al.  Time-warping algorithm applied to chromatographic peak matching gas chromatography/Fourier transform infrared/mass spectrometry , 1987 .

[23]  R. D'Agostino,et al.  Goodness-of-Fit-Techniques , 1987 .

[24]  P. Robert,et al.  A Unifying Tool for Linear Multivariate Statistical Methods: The RV‐Coefficient , 1976 .

[25]  J. Gower Some distance properties of latent root and vector methods used in multivariate analysis , 1966 .

[26]  A. Kligman,et al.  A new method for the quantitative investigation of cutaneous bacteria. , 1965, The Journal of investigative dermatology.

[27]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .