Transcriptome Analysis of the Portunus trituberculatus: De Novo Assembly, Growth-Related Gene Identification and Marker Discovery

Background The swimming crab, Portunus trituberculatus, is an important farmed species in China, has been attracting extensive studies, which require more and more genome background knowledge. To date, the sequencing of its whole genome is unavailable and transcriptomic information is also scarce for this species. In the present study, we performed de novo transcriptome sequencing to produce a comprehensive transcript dataset for major tissues of Portunus trituberculatus by the Illumina paired-end sequencing technology. Results Total RNA was isolated from eyestalk, gill, heart, hepatopancreas and muscle. Equal quantities of RNA from each tissue were pooled to construct a cDNA library. Using the Illumina paired-end sequencing technology, we generated a total of 120,137 transcripts with an average length of 1037 bp. Further assembly analysis showed that all contigs contributed to 87,100 unigenes, of these, 16,029 unigenes (18.40% of the total) can be matched in the GenBank non-redundant database. Potential genes and their functions were predicted by GO, KEGG pathway mapping and COG analysis. Based on our sequence analysis and published literature, many putative genes with fundamental roles in growth and muscle development, including actin, myosin, tropomyosin, troponin and other potentially important candidate genes were identified for the first time in this specie. Furthermore, 22,673 SSRs and 66,191 high-confidence SNPs were identified in this EST dataset. Conclusion The transcriptome provides an invaluable new data for a functional genomics resource and future biological research in Portunus trituberculatus. The data will also instruct future functional studies to manipulate or select for genes influencing growth that should find practical applications in aquaculture breeding programs. The molecular markers identified in this study will provide a material basis for future genetic linkage and quantitative trait loci analyses, and will be essential for accelerating aquaculture breeding programs with this species.

[1]  R. Lyons,et al.  Transcriptomics of a Giant Freshwater Prawn (Macrobrachium rosenbergii): De Novo Assembly, Annotation and Marker Discovery , 2011, PloS one.

[2]  N. Friedman,et al.  Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2011, Nature Biotechnology.

[3]  Qianqian Li,et al.  The first homolog of pacifastin-related precursor in the swimming crab (Portunus trituberculatus): characterization and potential role in immune response to bacteria and fungi. , 2012, Fish & shellfish immunology.

[4]  P. Trotta,et al.  Myosin , 1967, The Journal of General Physiology.

[5]  Genetic variation and population structure of swimming crab (Portunus trituberculatus) inferred from mitochondrial control region , 2012, Molecular Biology Reports.

[6]  Gene Ontology Consortium The Gene Ontology (GO) database and informatics resource , 2003 .

[7]  Songnian Hu,et al.  Comparative Transcriptome Analysis of the Accessory Sex Gland and Testis from the Chinese Mitten Crab (Eriocheir sinensis) , 2013, PloS one.

[8]  Wei-Jun Yang,et al.  Actin gene in prawn, Macrobrachium rosenbergii: characteristics and differential tissue expression during embryonic development. , 2005, Comparative biochemistry and physiology. Part B, Biochemistry & molecular biology.

[9]  Qianghua Xu,et al.  Expressed sequence tags from cDNA library prepared from gills of the swimming crab, Portunus trituberculatus , 2010 .

[10]  J. Cesar,et al.  Expression patterns of ubiquitin, heat shock protein 70, α‐actin and β‐actin over the molt cycle in the abdominal muscle of marine shrimp Litopenaeus vannamei , 2007 .

[11]  Jianguo He,et al.  Analysis of Litopenaeus vannamei Transcriptome Using the Next-Generation DNA Sequencing Technique , 2012, PloS one.

[12]  J. Burke,et al.  EST-SSRs as a resource for population genetic analyses , 2007, Heredity.

[13]  P. Leekitcharoenphon,et al.  Predicted sub-populations in a marine shrimp proteome as revealed by combined EST and cDNA data from multiple Penaeus species , 2010, BMC Research Notes.

[14]  L. Marins,et al.  Characterization of growth‐related genes in the south‐western Atlantic pink shrimp Farfantepenaeus paulensis (Pérez‐Farfante 1967) through a modified DDRT‐PCR protocol , 2008 .

[15]  Ji-Ping Z. Wang,et al.  EST clustering error evaluation and correction , 2004, Bioinform..

[16]  Shunping He,et al.  Transcriptome Analysis of Silver Carp (Hypophthalmichthys molitrix) by Paired-End RNA Sequencing , 2012, DNA research : an international journal for rapid publication of reports on genes and genomes.

[17]  Jan-Ming Ho,et al.  A Review of the Major Penaeid Shrimp EST Studies and the Construction of a Shrimp Transcriptome Database Based on the ESTs from Four Penaeid Shrimp , 2011, Marine Biotechnology.

[18]  G. Qiu,et al.  Transcriptome Analysis of the Oriental River Prawn, Macrobrachium nipponense Using 454 Pyrosequencing for Discovery of Genes and Markers , 2012, PloS one.

[19]  XIANG YING Meng,et al.  SNPs in the myostatin gene of the mollusk Chlamys farreri: association with growth traits. , 2010, Comparative biochemistry and physiology. Part B, Biochemistry & molecular biology.

[20]  Xuelin Zhao,et al.  Transcriptomic Responses to Salinity Stress in the Pacific Oyster Crassostrea gigas , 2012, PloS one.

[21]  H. Devaraj,et al.  Molecular mechanisms regulating molting in a crustacean , 2006, The FEBS journal.

[22]  S. Roth,et al.  The maternal and early embryonic transcriptome of the milkweed bug Oncopeltus fasciatus , 2011, BMC Genomics.

[23]  C. Niu,et al.  Transcriptome Profiling of Chironomus kiinensis under Phenol Stress Using Solexa Sequencing Technology , 2013, PloS one.

[24]  Jie Hao,et al.  Genetic diversity of swimming crab (Portunus trituberculatus) populations from Shandong peninsula as assessed by microsatellite markers , 2012 .

[25]  Hyun-Woo Kim,et al.  Twelve actin-encoding cDNAs from the American lobster, Homarus americanus: cloning and tissue expression of eight skeletal muscle, one heart, and three cytoplasmic isoforms. , 2009, Comparative biochemistry and physiology. Part B, Biochemistry & molecular biology.

[26]  Brandon S Gaut,et al.  Variation in Mutation Dynamics Across the Maize Genome as a Function of Regional and Flanking Base Composition , 2006, Genetics.

[27]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[28]  Sue A. Olson,et al.  Emboss opens up sequence analysis , 2002, Briefings Bioinform..

[29]  J. Xiang,et al.  Transcriptome Analysis on Chinese Shrimp Fenneropenaeus chinensis during WSSV Acute Infection , 2013, PloS one.

[30]  Hongxia Wang,et al.  Application of microsatellite DNA parentage markers in the swimming crab Portunus trituberculatus , 2012, Aquaculture International.

[31]  Roberto Dominguez,et al.  Actin structure and function. , 2011, Annual review of biophysics.

[32]  C. Mu,et al.  Development and characterization of EST-derived microsatellite makers for swimming crab, Portunus trituberculatus , 2013, Conservation Genetics Resources.

[33]  Chi-Ying Lee,et al.  Crustacean molt-inhibiting hormone: structure, function, and cellular mode of action. , 2009, Comparative biochemistry and physiology. Part A, Molecular & integrative physiology.

[34]  M. Sorrells,et al.  Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum and wheat , 2002, Plant Molecular Biology.

[35]  Qianghua Xu,et al.  Molecular cloning of heat shock protein 60 (PtHSP60) from Portunus trituberculatus and its expression response to salinity stress , 2012, Cell Stress and Chaperones.

[36]  P. Bagnaresi,et al.  A computational-based update on microRNAs and their targets in barley (Hordeum vulgare L.) , 2010, BMC Genomics.

[37]  M. Tiensuwan,et al.  Characterization of candidate genes involved in growth of black tiger shrimp Penaeus monodon , 2010 .

[38]  Stefan Götz,et al.  Blast2GO: A Comprehensive Suite for Functional Analysis in Plant Genomics , 2007, International journal of plant genomics.

[39]  Sun Wha Park,et al.  Phylogenetic analysis of mitochondrial DNA control region in the swimming crab, portunus trituberculatus , 2009 .

[40]  E. F. ARMSTRONG,et al.  Annual Review of Biochemistry , 1944, Nature.

[41]  M. Rothschild,et al.  SNP analysis of AMY2 and CTSL genes in Litopenaeus vannamei and Penaeus monodon shrimp. , 2005, Animal genetics.

[42]  Z. Liua,et al.  DNA marker technologies and their applications in aquaculture genetics , 2004 .

[43]  Yang Wang,et al.  Transcriptome Profiling of Testis during Sexual Maturation Stages in Eriocheir sinensis Using Illumina Sequencing , 2012, PloS one.

[44]  Gapped BLAST and PSI-BLAST: A new , 1997 .

[45]  E. S. Chang,et al.  Regulation of crustacean molting: a review and our perspectives. , 2011, General and comparative endocrinology.

[46]  R. Lyons,et al.  Genes and growth performance in crustacean species : a review of relevant genomic studies in crustaceans and other taxa , 2013 .

[47]  M. Miya,et al.  Complete mitochondrial DNA sequence of the swimming crab, Portunus trituberculatus (Crustacea: Decapoda: Brachyura). , 2003, Gene.

[48]  Lin Fang,et al.  WEGO: a web tool for plotting GO annotations , 2006, Nucleic Acids Res..

[49]  Qianqian Li,et al.  Crustins from eyestalk cDNA library of swimming crab Portunus trituberculatus: molecular characterization, genomic organization and expression analysis. , 2012, Fish & shellfish immunology.

[50]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[51]  Nguyen Minh Thanh,et al.  Single nucleotide polymorphisms in the actin and crustacean hyperglycemic hormone genes and their correlation with individual growth performance in giant freshwater prawn Macrobrachium rosenbergii , 2010 .

[52]  Y. S. Hyun,et al.  Characterization of 20 microsatellite loci by multiplex PCR in swimming crab, Portunus trituberculatus , 2013, Genes & Genomics.

[53]  A. Elizur,et al.  Novel molecular approach to study moulting in crustaceans , 2007 .

[54]  Darren A. Natale,et al.  The COG database: an updated version includes eukaryotes , 2003, BMC Bioinformatics.

[55]  E. Liu,et al.  Next-generation DNA sequencing of paired-end tags (PET) for transcriptome and genome analyses. , 2009, Genome research.

[56]  S. Kalinowski,et al.  Revising how the computer program cervus accommodates genotyping error increases success in paternity assignment , 2007, Molecular ecology.

[57]  Xiaodong Bai,et al.  Tissue-Specific Transcriptomics of the Exotic Invasive Insect Pest Emerald Ash Borer (Agrilus planipennis) , 2010, PloS one.

[58]  Yu Cun Geographical Distribution and Faunal Analysis of Crab Resources in the East China Sea , 2003 .

[59]  Young A. Choi,et al.  Mining and characterizing microsatellites from citrus ESTs , 2006, Theoretical and Applied Genetics.

[60]  Yongmei Li,et al.  Transcriptome Analysis of Pacific White Shrimp (Litopenaeus vannamei) Hepatopancreas in Response to Taura Syndrome Virus (TSV) Experimental Infection , 2013, PloS one.

[61]  J. Yao,et al.  Generation of a reference transcriptome for evaluating rainbow trout responses to various stressors , 2011, BMC Genomics.

[62]  J. Chung,et al.  Crustacean hyperglycemic hormone (CHH) neuropeptidesfamily: Functions, titer, and binding to target tissues. , 2010, General and comparative endocrinology.

[63]  Ping Chen,et al.  cDNA cloning and characterization of peroxiredoxin gene from the swimming crab Portunus trituberculatus , 2011 .

[64]  Yu Cun COMPOSITION AND DISTRIBUTION OF ECONOMIC CRAB SPECIES IN THE EAST CHINA SEA , 2006 .

[65]  M. Sellars,et al.  Growing backwards: an inverted role for the shrimp ortholog of vertebrate myostatin and GDF11 , 2011, Journal of Experimental Biology.

[66]  J. Estellé,et al.  Liver transcriptome profile in pigs with extreme phenotypes of intramuscular fatty acid composition , 2012, BMC Genomics.

[67]  M. Rothschild,et al.  Short communication. SNP analyses of the 5HT1R and STAT genes in Pacific white shrimp, Litopenaeus vannamei , 2010 .

[68]  Qianqian Li,et al.  Three isoforms of anti-lipopolysaccharide factor identified from eyestalk cDNA library of swimming crab Portunus trituberculatus. , 2011, Fish & shellfish immunology.

[69]  Jeffrey B. Thuma,et al.  Invertebrate muscles: muscle specific genes and proteins. , 2005, Physiological reviews.

[70]  Jun-Bo Luan,et al.  Transcriptome analysis and comparison reveal divergence between two invasive whitefly cryptic species , 2011, BMC Genomics.

[71]  P. Gaffney,et al.  Development of Type I Genetic Markers from Expressed Sequence Tags in Highly Polymorphic Species , 2011, Marine Biotechnology.

[72]  Wei Zhang,et al.  A transcriptome analysis of mitten crab testes (Eriocheir sinensis) , 2010, Genetics and molecular biology.

[73]  S. Tanksley,et al.  An EST database for Liriodendron tulipifera L. floral buds: the first EST resource for functional and comparative genomics in Liriodendron , 2008, Tree Genetics & Genomes.

[74]  Se-Jin Lee,et al.  Regulation of skeletal muscle mass in mice by a new TGF-p superfamily member , 1997, nature.

[75]  Michael Y. Galperin,et al.  The COG database: new developments in phylogenetic classification of proteins from complete genomes , 2001, Nucleic Acids Res..

[76]  Yuan Liu,et al.  Molecular cloning, genomic structure and antimicrobial activity of PtALF7, a unique isoform of anti-lipopolysaccharide factor from the swimming crab Portunus trituberculatus. , 2013, Fish & shellfish immunology.

[77]  Q. Nie,et al.  Deep Sequencing-Based Transcriptome Analysis of Chicken Spleen in Response to Avian Pathogenic Escherichia coli (APEC) Infection , 2012, PloS one.

[78]  John Quackenbush,et al.  TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets , 2003, Bioinform..