A novel approach based on KATZ measure to predict associations of human microbiota with non‐infectious diseases

Motivation: Accumulating clinical observations have indicated that microbes living in the human body are closely associated with a wide range of human noninfectious diseases, which provides promising insights into the complex disease mechanism understanding. Predicting microbe‐disease associations could not only boost human disease diagnostic and prognostic, but also improve the new drug development. However, little efforts have been attempted to understand and predict human microbe‐disease associations on a large scale until now. Results: In this work, we constructed a microbe‐human disease association network and further developed a novel computational model of KATZ measure for Human Microbe‐Disease Association prediction (KATZHMDA) based on the assumption that functionally similar microbes tend to have similar interaction and non‐interaction patterns with noninfectious diseases, and vice versa. To our knowledge, KATZHMDA is the first tool for microbe‐disease association prediction. The reliable prediction performance could be attributed to the use of KATZ measurement, and the introduction of Gaussian interaction profile kernel similarity for microbes and diseases. LOOCV and k‐fold cross validation were implemented to evaluate the effectiveness of this novel computational model based on known microbe‐disease associations obtained from HMDAD database. As a result, KATZHMDA achieved reliable performance with average AUCs of 0.8130 ± 0.0054, 0.8301 ± 0.0033 and 0.8382 in 2‐fold and 5‐fold cross validation and LOOCV framework, respectively. It is anticipated that KATZHMDA could be used to obtain more novel microbes associated with important noninfectious human diseases and therefore benefit drug discovery and human medical improvement. Availability and Implementation: Matlab codes and dataset explored in this work are available at http://dwz.cn/4oX5mS. Contacts: xingchen@amss.ac.cn or zhuhongyou@gmail.com or wangxuesongcumt@163.com Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  E. Stewart Growing Unculturable Bacteria , 2012, Journal of bacteriology.

[2]  Xuefei Shi,et al.  A critical role for the long non‐coding RNA GAS5 in proliferation and apoptosis in non‐small‐cell lung cancer , 2015, Molecular carcinogenesis.

[3]  Qionghai Dai,et al.  Constructing lncRNA functional similarity network based on lncRNA-disease associations and disease semantic similarity , 2015, Scientific Reports.

[4]  R. Feiner,et al.  A new perspective on lysogeny: prophages as active regulatory switches of bacteria , 2015, Nature Reviews Microbiology.

[5]  M. Crowell,et al.  Human gut microbiota in obesity and after gastric bypass , 2009, Proceedings of the National Academy of Sciences.

[6]  F. Bäckhed,et al.  Obesity alters gut microbial ecology. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Ruslan Medzhitov,et al.  Disease Tolerance as a Defense Strategy , 2012, Science.

[8]  Zhu-Hong You,et al.  Using manifold embedding for assessing and predicting protein interactions from high-throughput experimental data , 2010, Bioinform..

[9]  Xing Chen miREFRWR: a novel disease-related microRNA-environmental factor interactions prediction method. , 2016, Molecular bioSystems.

[10]  F. Bäckhed,et al.  The gut microbiota — masters of host development and physiology , 2013, Nature Reviews Microbiology.

[11]  Lars Råberg,et al.  Decomposing health: tolerance and resistance to parasites in animals , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[12]  Shareef M Dabdoub,et al.  The subgingival microbiome of clinically healthy current and never smokers , 2014, The ISME Journal.

[13]  Pierre Cochat,et al.  Efficacy and safety of Oxalobacter formigenes to reduce urinary oxalate in primary hyperoxaluria. , 2011, Nephrology, dialysis, transplantation : official publication of the European Dialysis and Transplant Association - European Renal Association.

[14]  Ruslan Medzhitov,et al.  Role of Tissue Protection in Lethal Respiratory Viral-Bacterial Coinfection , 2013, Science.

[15]  Xing Chen,et al.  HGIMDA: Heterogeneous graph inference for miRNA-disease association prediction , 2016, Oncotarget.

[16]  Zhen Ji,et al.  Prediction of protein-protein interactions from amino acid sequences using a novel multi-scale continuous and discontinuous feature set , 2014, BMC Bioinformatics.

[17]  J. Neu,et al.  Intestinal Microbial Ecology in Premature Infants Assessed with Non–Culture-Based Techniques , 2010 .

[18]  Roded Sharan,et al.  Associating Genes and Protein Complexes with Disease via Network Propagation , 2010, PLoS Comput. Biol..

[19]  Qionghai Dai,et al.  WBSMDA: Within and Between Score for MiRNA-Disease Association prediction , 2016, Scientific Reports.

[20]  Emily R. Davenport,et al.  Seasonal Variation in Human Gut Microbiome Composition , 2014, PloS one.

[21]  Zhu-Hong You,et al.  ILNCSIM: improved lncRNA functional similarity calculation model , 2016, Oncotarget.

[22]  Se Jin Song,et al.  The treatment-naive microbiome in new-onset Crohn's disease. , 2014, Cell host & microbe.

[23]  Peter Cimermancic,et al.  A Systematic Analysis of Biosynthetic Gene Clusters in the Human Microbiome Reveals a Common Family of Antibiotics , 2014, Cell.

[24]  Skov,et al.  Bacterial superantigens and inflammatory skin diseases , 2000, Clinical and experimental dermatology.

[25]  Xing Chen,et al.  A Computational Framework to Infer Human Disease-Associated Long Noncoding RNAs , 2014, PloS one.

[26]  Xing Chen,et al.  NLLSS: Predicting Synergistic Drug Combinations Based on Semi-supervised Learning , 2016, PLoS Comput. Biol..

[27]  Xinghua Shi,et al.  A Network Based Method for Analysis of lncRNA-Disease Associations and Prediction of lncRNAs Implicated in Diseases , 2014, PloS one.

[28]  A. Barabasi,et al.  Human symptoms–disease network , 2014, Nature Communications.

[29]  Katherine H. Huang,et al.  Structure, Function and Diversity of the Healthy Human Microbiome , 2012, Nature.

[30]  Xing Chen KATZLDA: KATZ measure for the lncRNA-disease association prediction , 2015, Scientific Reports.

[31]  Carl Nathan,et al.  Fresh Approaches to Anti-Infective Therapies , 2012, Science Translational Medicine.

[32]  Lars Råberg,et al.  Disentangling Genetic Variation for Resistance and Tolerance to Infectious Diseases in Animals , 2007, Science.

[33]  Olli Simell,et al.  Gut Microbiome Metagenomics Analysis Suggests a Functional Model for the Development of Autoimmunity for Type 1 Diabetes , 2011, PloS one.

[34]  Jennifer C. Drew,et al.  Toward defining the autoimmune microbiome for type 1 diabetes , 2011, The ISME Journal.

[35]  Xing Chen,et al.  Drug-target interaction prediction by random walk on the heterogeneous network. , 2012, Molecular bioSystems.

[36]  J. Bakken,et al.  Treating Clostridium difficile infection with fecal microbiota transplantation. , 2011, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[37]  David S Schneider,et al.  Two ways to survive infection: what resistance and tolerance can teach us about treating infectious diseases , 2008, Nature Reviews Immunology.

[38]  Mihai Pop,et al.  Deep Sequencing of the Oral Microbiome Reveals Signatures of Periodontal Disease , 2012, PloS one.

[39]  Lawrence A. David,et al.  Diet rapidly and reproducibly alters the human gut microbiome , 2013, Nature.

[40]  Xing Chen,et al.  RWRMDA: predicting novel human microRNA-disease associations. , 2012, Molecular bioSystems.

[41]  Zhu-Hong You,et al.  Detection of Interactions between Proteins through Rotation Forest and Local Phase Quantization Descriptors , 2015, International journal of molecular sciences.

[42]  W E Moore,et al.  Intestinal floras of populations that have a high risk of colon cancer , 1995, Applied and environmental microbiology.

[43]  Xing Chen,et al.  Predicting lncRNA-disease associations and constructing lncRNA functional similarity network based on the information of miRNA , 2015, Scientific Reports.

[44]  Xing Chen,et al.  Semi-supervised learning for potential human microRNA-disease associations inference , 2014, Scientific Reports.

[45]  Elena Marchiori,et al.  Gaussian interaction profile kernels for predicting drug-target interaction , 2011, Bioinform..

[46]  J. Clemente,et al.  Diet Drives Convergence in Gut Microbiome Functions Across Mammalian Phylogeny and Within Humans , 2011, Science.

[47]  Tobias Hamp,et al.  Sequence-based prediction of protein-protein interactions , 2014 .

[48]  Xing Chen,et al.  Novel human lncRNA-disease association inference based on lncRNA expression profiles , 2013, Bioinform..

[49]  Xing Chen,et al.  Long non-coding RNAs and complex diseases: from experimental results to computational models , 2016, Briefings Bioinform..

[50]  J. Gordon,et al.  Human nutrition, the gut microbiome and the immune system , 2011, Nature.

[51]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[52]  Marcus J. Claesson,et al.  Genome-scale analyses of health-promoting bacteria: probiogenomics , 2009, Nature Reviews Microbiology.

[53]  Xing Chen,et al.  FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model , 2016, Oncotarget.

[54]  Zhu-Hong You,et al.  Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis , 2013, BMC Bioinformatics.

[55]  Piet A van den Brandt,et al.  Gut microbiota composition and development of atopic manifestations in infancy: the KOALA Birth Cohort Study , 2006, Gut.

[56]  J. Petrosino,et al.  Microbiota Modulate Behavioral and Physiological Abnormalities Associated with Neurodevelopmental Disorders , 2013, Cell.

[57]  Xing Chen,et al.  Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding , 2016, BMC Bioinformatics.