EvolClust: automated inference of evolutionary conserved gene clusters in eukaryotes

Abstract Motivation The evolution and role of gene clusters in eukaryotes is poorly understood. Currently, most studies and computational prediction programs limit their focus to specific types of clusters, such as those involved in secondary metabolism. Results We present EvolClust, a python-based tool for the inference of evolutionary conserved gene clusters from genome comparisons, independently of the function or gene composition of the cluster. EvolClust predicts conserved gene clusters from pairwise genome comparisons and infers families of related clusters from multiple (all versus all) genome comparisons. Availability and implementation https://github.com/Gabaldonlab/EvolClust/. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  Y. van de Peer,et al.  i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets , 2011, Nucleic acids research.

[2]  E. Sonnhammer,et al.  Genomic gene clustering analysis of pathways in eukaryotes. , 2003, Genome research.

[3]  T. Samuelsson,et al.  Analysis of Gene Order Conservation in Eukaryotes Identifies Transcriptionally and Functionally Linked Genes , 2010, PloS one.

[4]  Yuri Y. Shevelyov,et al.  Large clusters of co-expressed genes in the Drosophila genome , 2002, Nature.

[5]  D. Haft,et al.  SMURF: Genomic mapping of fungal secondary metabolite clusters. , 2010, Fungal genetics and biology : FG & B.

[6]  Peter Shaw,et al.  Genome-wide identification of physically clustered genes suggests chromatin-level co-regulation in male reproductive development in Arabidopsis thaliana , 2017, Nucleic acids research.

[7]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[8]  Jens Stoye,et al.  Finding approximate gene clusters with Gecko 3 , 2016, Nucleic acids research.

[9]  Antonis Rokas,et al.  The Evolution of Fungal Metabolic Pathways , 2014, PLoS genetics.

[10]  Jeremy D. DeBarry,et al.  MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity , 2012, Nucleic acids research.

[11]  Kai Blin,et al.  antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences , 2011, Nucleic Acids Res..