EuGene: maximizing synthetic gene design for heterologous expression

UNLABELLED Numerous software applications exist to deal with synthetic gene design, granting the field of heterologous expression a significant support. However, their dispersion requires the access to different tools and online services in order to complete one single project. Analyzing codon usage, calculating codon adaptation index (CAI), aligning orthologs and optimizing genes are just a few examples. A software application, EuGene, was developed for the optimization of multiple gene synthetic design algorithms. In a seamless automatic form, EuGene calculates or retrieves genome data on codon usage (relative synonymous codon usage and CAI), codon context (CPS and codon pair bias), GC content, hidden stop codons, repetitions, deleterious sites, protein primary, secondary and tertiary structures, gene orthologs, species housekeeping genes, performs alignments and identifies genes and genomes. The main function of EuGene is analyzing and redesigning gene sequences using multi-objective optimization techniques that maximize the coding features of the resulting sequence. AVAILABILITY EuGene is freely available for non-commercial use, at http://bioinformatics.ua.pt/eugene.

[1]  David W. Corne,et al.  Approximating the Nondominated Front Using the Pareto Archived Evolution Strategy , 2000, Evolutionary Computation.

[2]  Randall L. Kincaid,et al.  Heterologous Protein Expression Is Enhanced by Harmonizing the Codon Usage Frequencies of the Target Gene with those of the Expression Host , 2008, PloS one.

[3]  D. Ardell,et al.  Influences on gene expression in vivo by a Shine–Dalgarno sequence , 2006, Molecular microbiology.

[4]  Joel Arrais,et al.  Large Scale Comparative Codon-Pair Context Analysis Unveils General Rules that Fine-Tune Evolution of mRNA Primary Structure , 2007, PloS one.

[5]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[6]  Alan Villalobos,et al.  Design Parameters to Control Synthetic Gene Expression in Escherichia coli , 2009, PloS one.

[7]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[8]  Gang Wu,et al.  The Synthetic Gene Designer: a flexible web platform to explore sequence manipulation for heterologous expression. , 2006, Protein expression and purification.

[9]  F. Taddei,et al.  Translational misreading: a tRNA modification counteracts a +2 ribosomal frameshift. , 2001, Genes & development.

[10]  Santiago Garcia-Vallvé,et al.  Working toward a new NIOSH. , 1996, Nucleic Acids Res..

[11]  Hervé Seligmann,et al.  The ambush hypothesis: hidden stop codons prevent off-frame gene reading. , 2004, DNA and cell biology.

[12]  Mark Johnson,et al.  NCBI BLAST: a better web interface , 2008, Nucleic Acids Res..

[13]  M. Kozak Influences of mRNA secondary structure on initiation by eukaryotic ribosomes. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[14]  John Walchli,et al.  Gene Composer: database software for protein construct design, codon engineering, and gene synthesis , 2009, BMC biotechnology.