Knowledge Discovery in Spectral Data by Means of Complex Networks

In the last decade, complex networks have widely been applied to the study of many natural and man-made systems, and to the extraction of meaningful information from the interaction structures created by genes and proteins. Nevertheless, less attention has been devoted to metabonomics, due to the lack of a natural network representation of spectral data. Here we define a technique for reconstructing networks from spectral data sets, where nodes represent spectral bins, and pairs of them are connected when their intensities follow a pattern associated with a disease. The structural analysis of the resulting network can then be used to feed standard data-mining algorithms, for instance for the classification of new (unlabeled) subjects. Furthermore, we show how the structure of the network is resilient to the presence of external additive noise, and how it can be used to extract relevant knowledge about the development of the disease.

[1]  Jerry Workman,et al.  Applied Spectroscopy: A Compact Reference for Practitioners , 1998 .

[2]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[3]  W. Couser,et al.  Glomerulonephritis , 1999, The Lancet.

[4]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[5]  K. Siamopoulos,et al.  Evaluation of tubulointerstitial lesions' severity in patients with glomerulonephritides: an NMR-based metabonomic study. , 2007, Journal of proteome research.

[6]  M E Newman,et al.  Scientific collaboration networks. I. Network construction and fundamental results. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[8]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[9]  Vladimir Shulaev,et al.  Metabolomics technology and bioinformatics , 2006, Briefings Bioinform..

[10]  J. C. Martínez-Espinosa,et al.  Monitoring of chemotherapy leukemia treatment using Raman spectroscopy and principal component analysis , 2014, Lasers in Medical Science.

[11]  V. Latora,et al.  Complex networks: Structure and dynamics , 2006 .

[12]  J. Lindon,et al.  'Metabonomics': understanding the metabolic responses of living systems to pathophysiological stimuli via multivariate statistical analysis of biological NMR spectroscopic data. , 1999, Xenobiotica; the fate of foreign compounds in biological systems.

[13]  J. Thiery,et al.  Complex networks orchestrate epithelial–mesenchymal transitions , 2006, Nature Reviews Molecular Cell Biology.

[14]  L. da F. Costa,et al.  Characterization of complex networks: A survey of measurements , 2005, cond-mat/0505185.

[15]  David S. Wishart,et al.  MetaboAnalyst 2.0—a comprehensive server for metabolomic data analysis , 2012, Nucleic Acids Res..

[16]  P. Anderson More is different. , 1972, Science.

[17]  V Latora,et al.  Efficient behavior of small-world networks. , 2001, Physical review letters.

[18]  Kazuhiro Takemoto,et al.  Current Understanding of the Formation and Adaptation of Metabolic Systems Based on Network Theory , 2012, Metabolites.

[19]  Tom Starzl,et al.  THE LANCET , 1992, The Lancet.

[20]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[21]  Massimiliano Zanin,et al.  Optimizing Functional Network Representation of Multivariate Time Series , 2012, Scientific Reports.

[22]  Pedro Mendes,et al.  Bioinformatics Approaches to Integrate Metabolomics and Other Systems Biology Data , 2006 .

[23]  J. Lindon,et al.  Metabonomics: a platform for studying drug toxicity and gene function , 2002, Nature Reviews Drug Discovery.

[24]  Phillip Bonacich,et al.  Eigenvector-like measures of centrality for asymmetric relations , 2001, Soc. Networks.

[25]  Marc-Thorsten Hütt,et al.  A Topological Characterization of Medium-Dependent Essential Metabolic Reactions , 2012, Metabolites.

[26]  S. Boccaletti,et al.  Complex networks analysis of obstructive nephropathy data. , 2011, Chaos.

[27]  I. R. Lewis,et al.  Handbook of Raman Spectroscopy: From the Research Laboratory to the Process Line , 2001 .

[28]  David S. Wishart,et al.  MetaboAnalyst: a web server for metabolomic data analysis and interpretation , 2009, Nucleic Acids Res..

[29]  Ernestina Menasalvas Ruiz,et al.  Preprocessing and analyzing genetic data with complex networks: An application to Obstructive Nephropathy , 2012, Networks Heterog. Media.

[30]  Massimiliano Pontil,et al.  Support Vector Machines: Theory and Applications , 2001, Machine Learning and Its Applications.

[31]  O. Sporns,et al.  Complex brain networks: graph theoretical analysis of structural and functional systems , 2009, Nature Reviews Neuroscience.