Genotype Specification Language.

We describe here the Genotype Specification Language (GSL), a language that facilitates the rapid design of large and complex DNA constructs used to engineer genomes. The GSL compiler implements a high-level language based on traditional genetic notation, as well as a set of low-level DNA manipulation primitives. The language allows facile incorporation of parts from a library of cloned DNA constructs and from the "natural" library of parts in fully sequenced and annotated genomes. GSL was designed to engage genetic engineers in their native language while providing a framework for higher level abstract tooling. To this end we define four language levels, Level 0 (literal DNA sequence) through Level 3, with increasing abstraction of part selection and construction paths. GSL targets an intermediate language based on DNA slices that translates efficiently into a wide range of final output formats, such as FASTA and GenBank, and includes formats that specify instructions and materials such as oligonucleotide primers to allow the physical construction of the GSL designs by individual strain engineers or an automated DNA assembly core facility.

[1]  Etsuko N. Moriyama,et al.  Vector NTI, a balanced all-in-one sequence analysis suite , 2004, Briefings Bioinform..

[2]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[3]  Jed Dean,et al.  High-throughput, cost-effective verification of structural DNA assembly , 2013, Nucleic acids research.

[4]  Jingdong Tian,et al.  Circular Polymerase Extension Cloning of Complex Gene Libraries and Pathways , 2009, PloS one.

[5]  Swapnil Bhatia,et al.  Developer's and user's guide to Clotho v2.0 A software platform for the creation of synthetic biological systems. , 2011, Methods in enzymology.

[6]  Allan Kuchinsky,et al.  The Synthetic Biology Open Language (SBOL) provides a community standard for communicating designs in synthetic biology , 2014, Nature Biotechnology.

[7]  Emma M. B. Weeding,et al.  Eugene – A Domain Specific Language for Specifying and Constraining Synthetic Biological Parts, Devices, and Systems , 2011, PloS one.

[8]  Jay D. Keasling,et al.  Production of amorphadiene in yeast, and its conversion to dihydroartemisinic acid, precursor to the antimalarial agent artemisinin , 2012, Proceedings of the National Academy of Sciences.

[9]  D. G. Gibson,et al.  Enzymatic assembly of DNA molecules up to several hundred kilobases , 2009, Nature Methods.

[10]  Alan Villalobos,et al.  Gene Designer: a synthetic biology tool for constructing artificial DNA segments , 2006, BMC Bioinformatics.

[11]  J. Keasling,et al.  High-level semi-synthetic production of the potent antimalarial artemisinin , 2013, Nature.

[12]  Elaine B Shapland,et al.  Low-Cost, High-Throughput Sequencing of DNA Assemblies Using a Highly Multiplexed Nextera Process. , 2015, ACS synthetic biology.

[13]  Matthias Köppe,et al.  SBROME: a scalable optimization and module matching framework for automated biosystems design. , 2013, ACS synthetic biology.

[14]  Max G Schubert,et al.  Efficient Multiplexed Integration of Synergistic Alleles and Metabolic Pathways in Yeasts via CRISPR-Cas. , 2015, Cell systems.

[15]  Chris J. Myers,et al.  Tablet—next generation sequence assembly visualization , 2009, Bioinform..

[16]  Deepak Chandran,et al.  TinkerCell: modular CAD tool for synthetic biology , 2009, Journal of biological engineering.

[17]  Maxime Durot,et al.  Rapid and reliable DNA assembly via ligase cycling reaction. , 2014, ACS synthetic biology.

[18]  Jean Peccoud,et al.  Writing DNA with GenoCAD™ , 2009, Nucleic Acids Res..

[19]  Peter Jackson,et al.  Use of pantothenate as a metabolic switch increases the genetic stability of farnesene producing Saccharomyces cerevisiae. , 2014, Metabolic engineering.