Rapid Microsatellite Identification from Illumina Paired-End Genomic Sequencing in Two Birds and a Snake

Identification of microsatellites, or simple sequence repeats (SSRs), can be a time-consuming and costly investment requiring enrichment, cloning, and sequencing of candidate loci. Recently, however, high throughput sequencing (with or without prior enrichment for specific SSR loci) has been utilized to identify SSR loci. The direct “Seq-to-SSR” approach has an advantage over enrichment-based strategies in that it does not require a priori selection of particular motifs, or prior knowledge of genomic SSR content. It has been more expensive per SSR locus recovered, however, particularly for genomes with few SSR loci, such as bird genomes. The longer but relatively more expensive 454 reads have been preferred over less expensive Illumina reads. Here, we use Illumina paired-end sequence data to identify potentially amplifiable SSR loci (PALs) from a snake (the Burmese python, Python molurus bivittatus), and directly compare these results to those from 454 data. We also compare the python results to results from Illumina sequencing of two bird genomes (Gunnison Sage-grouse, Centrocercus minimus, and Clark's Nutcracker, Nucifraga columbiana), which have considerably fewer SSRs than the python. We show that direct Illumina Seq-to-SSR can identify and characterize thousands of potentially amplifiable SSR loci for as little as $10 per sample – a fraction of the cost of 454 sequencing. Given that Illumina Seq-to-SSR is effective, inexpensive, and reliable even for species such as birds that have few SSR loci, it seems that there are now few situations for which prior hybridization is justifiable.

[1]  Albert J. Vilella,et al.  The genome of a songbird , 2010, Nature.

[2]  Charlotte L. Oskam,et al.  Identification of microsatellites from an extinct moa species using high-throughput (454) sequence data. , 2009, BioTechniques.

[3]  N. Gemmell,et al.  The rise, fall and renaissance of microsatellites in eukaryotic genomes. , 2006, BioEssays : news and reviews in molecular, cellular and developmental biology.

[4]  J. Jurka,et al.  Repbase Update, a database of eukaryotic repetitive elements , 2005, Cytogenetic and Genome Research.

[5]  A. Hughes,et al.  DNA repeat arrays in chicken and human genomes and the adaptive evolution of avian genome size , 2005, BMC Evolutionary Biology.

[6]  N. Gemmell,et al.  Fast, cost-effective development of species-specific microsatellite markers by genomic sequencing. , 2009, BioTechniques.

[7]  S. Haig,et al.  Multiplexed microsatellite recovery using massively parallel sequencing , 2011, Molecular ecology resources.

[8]  Samuel E. Fox,et al.  Discovery of Highly Divergent Repeat Landscapes in Snake Genomes Using High-Throughput Sequencing , 2011, Genome biology and evolution.

[9]  Wanjun Gu,et al.  Rapid identification of thousands of copperhead snake (Agkistrodon contortrix) microsatellite loci from modest amounts of 454 shotgun genome sequence , 2010, Molecular ecology resources.

[10]  P. Uetz,et al.  Sequencing the genome of the Burmese python (Python molurus bivittatus) as a model for studying extreme adaptations in snakes , 2011, Genome Biology.

[11]  M. Shapiro,et al.  A proposal to sequence the genome of a garter snake (Thamnophis sirtalis) , 2011, Standards in genomic sciences.

[12]  B. Faircloth,et al.  msatcommander: detection of microsatellite repeat arrays and automated, locus‐specific primer design , 2008, Molecular ecology resources.

[13]  Colin N. Dewey,et al.  Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution , 2004, Nature.

[14]  T. Ryan Gregory,et al.  Eukaryotic genome size databases , 2006, Nucleic Acids Res..

[15]  Jun S. Liu,et al.  Phylogenomics of nonavian reptiles and the structure of the ancestral amniote genome , 2007, Proceedings of the National Academy of Sciences.

[16]  Emese Meglécz,et al.  QDD: a user-friendly program to select microsatellite markers and design primers from large sequencing projects , 2010, Bioinform..

[17]  S. Tyekucheva,et al.  The genome-wide determinants of human and chimpanzee microsatellite evolution. , 2007, Genome research.

[18]  Mark Johnston,et al.  Benchmarking next-generation transcriptome sequencing for functional and evolutionary genomics. , 2009, Molecular biology and evolution.