ACES: Analysis of Conservation with an Extensive list of Species

Abstract Motivation An abundance of new reference genomes is becoming available through large-scale sequencing efforts. While the reference FASTA for each genome is available, there is currently no automated mechanism to query a specific sequence across all new reference genomes. Results We developed ACES (Analysis of Conservation with an Extensive list of Species) as a computational workflow to query specific sequences of interest (e.g. enhancers, promoters, exons) against reference genomes with an available reference FASTA. This automated workflow generates BLAST hits against each of the reference genomes, a multiple sequence alignment file, a graphical fragment assembly file and a phylogenetic tree file. These data files can then be used by the researcher in several ways to provide key insights into conservation of the query sequence. Availability and implementation ACES is available at https://github.com/TNTurnerLab/ACES Supplementary information Supplementary data are available at Bioinformatics online.

[1]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[2]  M. Nei,et al.  Molecular Evolutionary Genetics Analysis , 2007 .

[3]  C. Russo,et al.  Bootstrap and rogue identification tests for phylogenetic analyses. , 2018, Molecular biology and evolution.

[4]  Axel Visel,et al.  Progressive Loss of Function in a Limb Enhancer during Snake Evolution , 2016, Cell.

[5]  Sergey Koren,et al.  Towards complete and error-free genome assemblies of all vertebrate species , 2020, Nature.

[6]  Justin Zobel,et al.  Bandage: interactive visualization of de novo genome assemblies , 2015, bioRxiv.

[7]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[8]  Astrid Gall,et al.  Ensembl 2021 , 2020, Nucleic Acids Res..

[9]  B. Oostra,et al.  A long-range Shh enhancer regulates expression in the developing limb and fin and is associated with preaxial polydactyly. , 2003, Human molecular genetics.

[10]  Sudhir Kumar,et al.  MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. , 2016, Molecular biology and evolution.

[11]  Towards complete and error-free genome assemblies of all vertebrate species , 2021, Nature.

[12]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[13]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.