Analysis of Microbiome Data across Inflammatory Bowel Disease Patients

The interaction and inter-play of microbes with human host cells is responsible for several disease conditions and of criticality to human health. In this study we analyze the microbial communities within the human gut and their roles in Inflammatory Bowel Disease (IBD). These microbial communities can be profiled using either Length Heterogeneity PCR (LH-PCR) or small subunit (SSU) rRNA sequences. Classification methods based on support vector machines (SVM) and k-nearest neighbor (KNN) were developed to differentiate between healthy controls and IBD patients at various intestinal locations using those profiles. The results show that there exist significant operational taxonomic units (OTUs) or microbial species that are differentially abundant between IBD and healthy control state at specific intestinal locations. Moreover, the classification performances of the sequence data outperform those of LH-PCR profiles and the lowest taxonomic level (Genus-Species) is more likely to have superior classification performances than the higher taxonomic levels.

[1]  O. Eremin,et al.  Inflammatory bowel disease: dysfunction of GALT and gut bacterial flora (II). , 2003, The surgeon : journal of the Royal Colleges of Surgeons of Edinburgh and Ireland.

[2]  John A. Swets,et al.  Signal Detection Theory and ROC Analysis in Psychology and Diagnostics: Collected Papers , 1996 .

[3]  C. Litchfield,et al.  A comparison of DNA profiling techniques for monitoring nutrient impact on microbial community composition during bioremediation of petroleum-contaminated soils. , 2003, Journal of microbiological methods.

[4]  R. Ley,et al.  Ecological and Evolutionary Forces Shaping Microbial Diversity in the Human Intestine , 2006, Cell.

[5]  George Karypis,et al.  Profile-based direct kernels for remote homology detection and fold recognition , 2005, Bioinform..

[6]  B L Maidak,et al.  The RDP-II (Ribosomal Database Project) , 2001, Nucleic Acids Res..

[7]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[8]  S. Carding,et al.  Inflammatory bowel disease: cause and immunobiology , 2007, The Lancet.

[9]  M. Pop,et al.  Metagenomic Analysis of the Human Distal Gut Microbiome , 2006, Science.

[10]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[11]  K. Guillemin,et al.  We know you are in there: conversing with the indigenous gut microbiota. , 2007, Research in microbiology.

[12]  Pat Morin,et al.  Output-Sensitive Algorithms for Computing Nearest-Neighbour Decision Boundaries , 2003, WADS.

[13]  D. Savage Microbial ecology of the gastrointestinal tract. , 1977, Annual review of microbiology.

[14]  Giri Narasimhan,et al.  An ecoinformatics tool for microbial community studies : Supervised classification of Amplicon Length Heterogeneity ( ALH ) profiles of 16 S rRNA , 2006 .

[15]  Cynthia L Sears,et al.  A dynamic partnership: celebrating our gut flora. , 2005, Anaerobe.

[16]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..