Next-generation pyrosequencing of gonad transcriptomes in the polyploid lake sturgeon (Acipenser fulvescens): the relative merits of normalization and rarefaction in gene discovery

BackgroundNext-generation sequencing technologies have been applied most often to model organisms or species closely related to a model. However, these methods have the potential to be valuable in many wild organisms, including those of conservation concern. We used Roche 454 pyrosequencing to characterize gene expression in polyploid lake sturgeon (Acipenser fulvescens) gonads.ResultsTitration runs on a Roche 454 GS-FLX produced more than 47,000 sequencing reads. These reads represented 20,741 unique sequences that passed quality control (mean length = 186 bp). These were assembled into 1,831 contigs (mean contig depth = 4.1 sequences). Over 4,000 sequencing reads (~19%) were assigned gene ontologies, mostly to protein, RNA, and ion binding. A total of 877 candidate SNPs were identified from > 50 different genes. We employed an analytical approach from theoretical ecology (rarefaction) to evaluate depth of sequencing coverage relative to gene discovery. We also considered the relative merits of normalized versus native cDNA libraries when using next-generation sequencing platforms. Not surprisingly, fewer genes from the normalized libraries were rRNA subunits. Rarefaction suggests that normalization has little influence on the efficiency of gene discovery, at least when working with thousands of reads from a single tissue type.ConclusionOur data indicate that titration runs on 454 sequencers can characterize thousands of expressed sequence tags which can be used to identify SNPs, gene ontologies, and levels of gene expression in species of conservation concern. We anticipate that rarefaction will be useful in evaluations of gene discovery and that next-generation sequencing technologies hold great potential for the study of other non-model organisms.

[1]  J. Ohlrogge,et al.  Sampling the Arabidopsis Transcriptome with Massively Parallel Pyrosequencing1[W][OA] , 2007, Plant Physiology.

[2]  S. Afanasyev,et al.  Transcribed Tc1-like transposons in salmonid fish , 2005, BMC Genomics.

[3]  Andreas Graner,et al.  454 sequencing put to the test using the complex genome of barley , 2006, BMC Genomics.

[4]  Matthew E Hudson,et al.  Sequencing breakthroughs for genomic ecology and evolutionary biology , 2008, Molecular ecology resources.

[5]  Luciano Milanesi,et al.  Data handling strategies for high throughput pyrosequencers , 2007, BMC Bioinformatics.

[6]  F. Piferrer,et al.  Temperature-Dependent Sex Determination in Fish Revisited: Prevalence, a Single Sex Ratio Response Pattern, and Possible Effects of Climate Change , 2008, PloS one.

[7]  M. Ashburner,et al.  Gene Ontology: a controlled vocabulary to describe the function, biological process and cellular location of gene products in genome databases , 1999 .

[8]  Evandro Novaes,et al.  High-throughput gene and SNP discovery in Eucalyptus grandis, an uncharacterized genome , 2008, BMC Genomics.

[9]  J. Volff,et al.  Governing Sex Determination in Fish: Regulatory Putsches and Ephemeral Dictators , 2007, Sexual Development.

[10]  L. Hillier,et al.  PCAP: a whole-genome assembly program. , 2003, Genome research.

[11]  N. Gotelli,et al.  NULL MODELS IN ECOLOGY , 1996 .

[12]  M. Pourkazemi,et al.  The RAPD technique failed to identify sex‐specific sequences in beluga (Huso huso) , 2007 .

[13]  P. Leberg,et al.  Estimating allelic richness: Effects of sample size and bottlenecks , 2002, Molecular ecology.

[14]  Brandon S Gaut,et al.  Variation in Mutation Dynamics Across the Maize Genome as a Function of Regional and Flanking Base Composition , 2006, Genetics.

[15]  A. Chenchik,et al.  Reverse transcriptase template switching: a SMART approach for full-length cDNA library construction. , 2001, BioTechniques.

[16]  D. Noakes,et al.  Conservation Implications of Behaviour and Growth of the Lake Sturgeon, Acipenser fulvescens, in Northern Ontario , 1999, Environmental Biology of Fishes.

[17]  Matthew E Hudson,et al.  Wasp Gene Expression Supports an Evolutionary Link Between Maternal Behavior and Eusociality , 2007, Science.

[18]  Steven J. M. Jones,et al.  BMC Genomics BioMed Central Methodology article , 2006 .

[19]  A. Ludwig,et al.  Genome duplication events and functional reduction of ploidy levels in sturgeon (Acipenser, Huso and Scaphirhynchus). , 2001, Genetics.

[20]  S. Lukyanov,et al.  A method for the preparation of normalized cDNA libraries enriched with full-length sequences , 2005, Russian Journal of Bioorganic Chemistry.

[21]  L. Zane,et al.  Extensive screening of sturgeon genomes by random screening techniques revealed no sex-specific marker , 2006 .

[22]  S. Hurlbert The Nonconcept of Species Diversity: A Critique and Alternative Parameters. , 1971, Ecology.

[23]  J. DeWoody,et al.  Multiple molecular approaches yield no evidence for sex-determining genes in lake sturgeon (Acipenser fulvescens) , 2008 .

[24]  J. Garvey,et al.  A guide to the embryonic development of the shovelnose sturgeon (Scaphirhynchus platorynchus), reared at a constant temperature , 2007 .

[25]  J. Marden,et al.  Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing , 2008, Molecular ecology.

[26]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[27]  B. Haas,et al.  Sequencing Medicago truncatula expressed sequenced tags using 454 Life Sciences technology , 2006, BMC Genomics.

[28]  M. Macnair Genetics of Populations (2nd edn) , 2000, Heredity.

[29]  T. Vision,et al.  The molecular ecologist's guide to expressed sequence tags , 2006, Molecular ecology.

[30]  J. Avise,et al.  Genetic parentage in large half-sib clutches: theoretical estimates and empirical appraisals. , 2000, Genetics.

[31]  M. Wiley,et al.  Parasite-induced collapse of populations of a dominant grazer in Michigan streams , 1992 .

[32]  H. Ellegren Sequencing goes 454 and takes large‐scale genomics into the wild , 2008, Molecular ecology.

[33]  J. Jackson,et al.  Growth and feeding dynamics of lake sturgeon, Acipenser fulvescens, in Oneida Lake, New York: results from the first five years of a restoration program , 2002 .

[34]  R. Devlin,et al.  Sex determination and sex differentiation in fish: an overview of genetic, physiological, and environmental influences , 2002 .

[35]  Noah A. Rosenberg,et al.  ADZE: a rarefaction approach for counting alleles private to combinations of populations , 2008, Bioinform..

[36]  Amit Dhingra,et al.  Rapid and accurate pyrosequencing of angiosperm plastid genomes , 2006, BMC Plant Biology.

[37]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[38]  Christian Schlötterer,et al.  Gene expression profiling by massively parallel sequencing. , 2007, Genome research.

[39]  D. Nickerson,et al.  The utility of single nucleotide polymorphisms in inferences of population history , 2003 .