Clustering based on periodicity in high‐throughput time course data

Nuclear magnetic resonance (NMR) spectroscopy, traditionally used in analytical chemistry, has recently been introduced to studies of metabolite composition of biological fluids and tissues. Metabolite levels change over time, and providing a tool for better extraction of NMR peaks exhibiting periodic behavior is of interest. We propose a method in which NMR peaks are clustered based on periodic behavior. Periodic regression is used to obtain estimates of the parameter corresponding to period for individual NMR peaks. A mixture model is then used to develop clusters of peaks, taking into account the variability of the regression parameter estimates. Methods are applied to NMR data collected from human blood plasma over a 24‐h period. Simulation studies show that the extra variance component due to the estimation of the parameter estimate should be accounted for in the clustering procedure. © 2011 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 4: 579–589, 2011

[1]  Tianwei Yu,et al.  An exploratory data analysis method to reveal modular latent structures in high-throughput data , 2010, BMC Bioinformatics.

[2]  Elaine Holmes,et al.  Prediction and classification of drug toxicity using probabilistic modeling of temporal metabolic data: the consortium on metabonomic toxicology screening approach. , 2007, Journal of proteome research.

[3]  Adrian E. Raftery,et al.  How Many Clusters? Which Clustering Method? Answers Via Model-Based Cluster Analysis , 1998, Comput. J..

[4]  Mark R Viant,et al.  An NMR metabolomic investigation of early metabolic disturbances following traumatic brain injury in a mammalian model , 2005, NMR in biomedicine.

[5]  Dean P. Jones,et al.  Individual variation in macronutrient regulation measured by proton magnetic resonance spectroscopy of human plasma. , 2009, American journal of physiology. Regulatory, integrative and comparative physiology.

[6]  John M. Walker,et al.  Metabolic Profiling , 2011, Methods in Molecular Biology.

[7]  Oliver Fiehn,et al.  Metabolite profiling of human colon carcinoma – deregulation of TCA cycle and amino acid turnover , 2008, Molecular Cancer.

[8]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[9]  O. Fiehn,et al.  Mass spectrometry-based metabolic profiling reveals different metabolite patterns in invasive ovarian carcinomas and ovarian borderline tumors. , 2006, Cancer research.

[10]  T. Ebbels,et al.  NMR-based metabonomic toxicity classification: hierarchical cluster analysis and k-nearest-neighbour approaches , 2003 .

[11]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2001, Springer Series in Statistics.

[12]  Henrik Antti,et al.  Comparative metabonomics of differential hydrazine toxicity in the rat and mouse. , 2005, Toxicology and applied pharmacology.

[13]  M. Spraul,et al.  750 MHz 1H and 1H-13C NMR spectroscopy of human blood plasma. , 1995, Analytical chemistry.

[14]  Alan Hutson,et al.  Detection of epithelial ovarian cancer using 1H‐NMR‐based metabonomics , 2005, International journal of cancer.

[15]  Dean P. Jones,et al.  Diurnal variation in glutathione and cysteine redox states in human plasma. , 2007, The American journal of clinical nutrition.

[16]  L. Qin,et al.  The Clustering of Regression Models Method with Applications in Gene Expression Data , 2006, Biometrics.

[17]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[18]  Jianhong Wu,et al.  Data clustering - theory, algorithms, and applications , 2007 .

[19]  B. G. Quinn,et al.  Estimating the frequency of a periodic function , 1991 .

[20]  M. Lipsky,et al.  American Medical Association Complete Medical Encyclopedia , 2003 .

[21]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[22]  A. B. Simon Algebra and trigonometry with analytic geometry , 1979 .

[23]  T. Ebbels,et al.  Metabolic profiling, metabolomic and metabonomic procedures for NMR spectroscopy of urine, plasma, serum and tissue extracts , 2007, Nature Protocols.

[24]  Rebecca Nugent,et al.  An overview of clustering applied to molecular biology. , 2010, Methods in molecular biology.

[25]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[26]  A. Roche,et al.  Organic Chemistry: , 1982, Nature.

[27]  H. J. Andersen,et al.  NMR-based metabonomic studies reveal changes in the biochemical profile of plasma and urine from pigs fed high-fibre rye bread , 2006, British Journal of Nutrition.

[28]  David L. Woodruff,et al.  Beam search for peak alignment of NMR signals , 2004 .

[29]  M. Barker,et al.  Partial least squares for discrimination , 2003 .