JContextExplorer: a tree-based approach to facilitate cross-species genomic context comparison

BackgroundCross-species comparisons of gene neighborhoods (also called genomic contexts) in microbes may provide insight into determining functionally related or co-regulated sets of genes, suggest annotations of previously un-annotated genes, and help to identify horizontal gene transfer events across microbial species. Existing tools to investigate genomic contexts, however, lack features for dynamically comparing and exploring genomic regions from multiple species. As DNA sequencing technologies improve and the number of whole sequenced microbial genomes increases, a user-friendly genome context comparison platform designed for use by a broad range of users promises to satisfy a growing need in the biological community.ResultsHere we present JContextExplorer: a tool that organizes genomic contexts into branching diagrams. We implement several alternative context-comparison and tree rendering algorithms, and allow for easy transitioning between different clustering algorithms. To facilitate genomic context analysis, our tool implements GUI features, such as text search filtering, point-and-click interrogation of individual contexts, and genomic visualization via a multi-genome browser. We demonstrate a use case of our tool by attempting to resolve annotation ambiguities between two highly homologous yet functionally distinct genes in a set of 22 alpha and gamma proteobacteria.ConclusionsJContextExplorer should enable a broad range of users to analyze and explore genomic contexts. The program has been tested on Windows, Mac, and Linux operating systems, and is implemented both as an executable JAR file and java WebStart. Program executables, source code, and documentation is available at http://www.bme.ucdavis.edu/facciotti/resources_data/software/.

[1]  L. Stein,et al.  JBrowse: a next-generation genome browser. , 2009, Genome research.

[2]  Andreas Prlic,et al.  Sequence analysis , 2003 .

[3]  Lynne A. Goodwin,et al.  Evidence for Horizontal Gene Transfer of Anaerobic Carbon Monoxide Dehydrogenases , 2012, Front. Microbio..

[4]  S. Pandit,et al.  A Comparative Study on Distance Measuring Approaches for Clustering , 2011 .

[5]  V. Stewart,et al.  Purine Utilization by Klebsiella oxytoca M5al: Genes for Ring-Oxidizing and -Opening Enzymes , 2008, Journal of bacteriology.

[6]  A. Arkin,et al.  The Life-Cycle of Operons , 2006, PLoS genetics.

[7]  Eric Bapteste,et al.  Evolution of rhodopsin ion pumps in haloarchaea , 2007, BMC Evolutionary Biology.

[8]  M. Kanehisa,et al.  Systematic survey for novel types of prokaryotic retroelements based on gene neighborhood and protein architecture. , 2008, Molecular biology and evolution.

[9]  Kevin P. Byrne,et al.  The Yeast Gene Order Browser: combining curated homology and syntenic context reveals gene fate in polyploid species. , 2005, Genome research.

[10]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[11]  Eric J Alm,et al.  Correction: The Life-Cycle of Operons , 2006, PLoS Genetics.

[12]  David J. Reiss,et al.  Integration and visualization of systems biology data in context of the genome , 2010, BMC Bioinformatics.

[13]  Eugene V Koonin,et al.  Connected gene neighborhoods in prokaryotic genomes. , 2002, Nucleic acids research.

[14]  Katherine H. Huang,et al.  Operon formation is driven by co-regulation and not by horizontal gene transfer. , 2005, Genome research.

[15]  R. Overbeek,et al.  The use of gene clusters to infer functional coupling. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Y. van de Peer,et al.  i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets , 2011, Nucleic acids research.

[17]  H. Mori,et al.  Evolutionary instability of operon structures disclosed by sequence comparisons of complete microbial genomes. , 1999, Molecular biology and evolution.

[18]  Feng Chen,et al.  OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups , 2005, Nucleic Acids Res..

[19]  Peter D. Karp,et al.  EcoCyc: A comprehensive view of Escherichia coli biology , 2008, Nucleic Acids Res..

[20]  Gautier Koscielny,et al.  Ensembl 2012 , 2011, Nucleic Acids Res..

[21]  Matthieu Muffato,et al.  Genomicus: a database and a browser to study gene synteny in modern and ancestral genomes , 2010, Bioinform..

[22]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[23]  Inna Dubchak,et al.  The integrated microbial genomes (IMG) system , 2005, Nucleic Acids Res..

[24]  BMC Bioinformatics , 2005 .

[25]  Christian von Mering,et al.  STRING 8—a global view on proteins and their functional interactions in 630 organisms , 2008, Nucleic Acids Res..

[26]  S. Pongor,et al.  The quest for orthologs: finding the corresponding gene across genomes. , 2008, Trends in genetics : TIG.

[27]  Inna Dubchak,et al.  MicrobesOnline: an integrated portal for comparative and functional genomics , 2009, Nucleic Acids Res..

[28]  G. Georgiou,et al.  The many faces of glutathione in bacteria. , 2006, Antioxidants & redox signaling.

[29]  C. Stoeckert,et al.  OrthoMCL: identification of ortholog groups for eukaryotic genomes. , 2003, Genome research.

[30]  E. Koonin,et al.  Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context. , 2001, Genome research.

[31]  N. Grishin,et al.  Genome trees and the tree of life. , 2002, Trends in genetics : TIG.

[32]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[33]  Inna Dubchak,et al.  RegPredict: an integrated system for regulon inference in prokaryotes by comparative genomics approach , 2010, Nucleic Acids Res..

[34]  Y. van de Peer,et al.  PLAZA: A Comparative Genomics Resource to Study Gene and Genome Evolution in Plants[W] , 2009, The Plant Cell Online.

[35]  Jeremy D. DeBarry,et al.  MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity , 2012, Nucleic acids research.

[36]  Andrew H. Paterson,et al.  Synteny and Collinearity in Plant Genomes , 2008, Science.

[37]  F. González-Candelas,et al.  Horizontal gene transfer in the molecular evolution of mannose PTS transporters. , 2005, Molecular biology and evolution.

[38]  P Bork,et al.  Gene context conservation of a higher order than operons. , 2000, Trends in biochemical sciences.

[39]  Sergio Gómez,et al.  Solving Non-Uniqueness in Agglomerative Hierarchical Clustering Using Multidendrograms , 2006, J. Classif..

[40]  Sean D. Hooper,et al.  Gene Context Analysis in the Integrated Microbial Genomes (IMG) Data Management System , 2009, PloS one.