Machine Learning Approaches for Metagenomics

Microbes exists everywhere. Current generation of genomic technologies have allowed researchers to determine the collective DNA sequence of all microorganisms co-existing together. In this paper, we present some of the challenges related to the analysis of data obtained from the community genomics experiment (commonly referred by metagenomics), advocate the need of machine learning techniques and highlight our contributions related to development of supervised and unsupervised techniques for solving this complex, real world problem.

[1]  Steven Salzberg,et al.  Identifying bacterial genes and endosymbiont DNA with Glimmer , 2007, Bioinform..

[2]  Azad Naik,et al.  Classifying Documents within Multiple Hierarchical Datasets Using Multi-task Learning , 2013, 2013 IEEE 25th International Conference on Tools with Artificial Intelligence.

[3]  Huzefa Rangwala,et al.  Evaluation of short read metagenomic assembly , 2011, BMC Genomics.

[4]  Huzefa Rangwala,et al.  Efficient Clustering of Metagenomic Sequences using Locality Sensitive Hashing , 2012, SDM.

[5]  Huzefa Rangwala,et al.  Multi-task Learning for Classifying Proteins Using Dual Hierarchies , 2012, 2012 IEEE 12th International Conference on Data Mining.

[6]  Huzefa Rangwala,et al.  LSH-Div: Species diversity estimation using locality sensitive hashing , 2012, 2012 IEEE International Conference on Bioinformatics and Biomedicine.

[7]  Alan M. Frieze,et al.  Min-wise independent permutations (extended abstract) , 1998, STOC '98.

[8]  Huzefa Rangwala,et al.  A Map-Reduce Framework for Clustering Metagenomes , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[9]  Huzefa Rangwala,et al.  MC-MinH: Metagenome Clustering using Minwise based Hashing , 2013, SDM.

[10]  M. Borodovsky,et al.  Ab initio gene identification in metagenomic sequences , 2010, Nucleic acids research.

[11]  Alan M. Frieze,et al.  Min-Wise Independent Permutations , 2000, J. Comput. Syst. Sci..

[12]  Ruth Ann Luna,et al.  Metagenomic pyrosequencing and microbial identification. , 2009, Clinical chemistry.