The Whole-Genome and Transcriptome of the Manila Clam (Ruditapes philippinarum)

Abstract The manila clam, Ruditapes philippinarum, is an important bivalve species in worldwide aquaculture including Korea. The aquaculture production of R. philippinarum is under threat from diverse environmental factors including viruses, microorganisms, parasites, and water conditions with subsequently declining production. In spite of its importance as a marine resource, the reference genome of R. philippinarum for comprehensive genetic studies is largely unexplored. Here, we report the de novo whole-genome and transcriptome assembly of R. philippinarum across three different tissues (foot, gill, and adductor muscle), and provide the basic data for advanced studies in selective breeding and disease control in order to obtain successful aquaculture systems. An approximately 2.56 Gb high quality whole-genome was assembled with various library construction methods. A total of 108,034 protein coding gene models were predicted and repetitive elements including simple sequence repeats and noncoding RNAs were identified to further understanding of the genetic background of R. philippinarum for genomics-assisted breeding. Comparative analysis with the bivalve marine invertebrates uncover that the gene family related to complement C1q was enriched. Furthermore, we performed transcriptome analysis with three different tissues in order to support genome annotation and then identified 41,275 transcripts which were annotated. The R. philippinarum genome resource will markedly advance a wide range of potential genetic studies, a reference genome for comparative analysis of bivalve species and unraveling mechanisms of biological processes in molluscs. We believe that the R. philippinarum genome will serve as an initial platform for breeding better-quality clams using a genomic approach.

[1]  A. Papanicolaou,et al.  Transcriptome Analysis of the Sydney Rock Oyster, Saccostrea glomerata: Insights into Molluscan Immunity , 2016, PloS one.

[2]  Xiaoli Ma,et al.  Transcriptome Sequencing and Comparative Analysis of Ovary and Testis Identifies Potential Key Sex-Related Genes and Pathways in Scallop Patinopecten yessoensis , 2016, Marine Biotechnology.

[3]  S. Subramaniyam,et al.  Transcriptome Analysis Revealed Changes of Multiple Genes Involved in Haliotis discus hannai Innate Immunity during Vibrio parahemolyticus Infection , 2016, PloS one.

[4]  S. Watabe,et al.  Bivalve-specific gene expansion in the pearl oyster genome: implications of adaptation to a sessile lifestyle , 2016, Zoological Letters.

[5]  Hong-Seog Park,et al.  Sequencing, De Novo Assembly, and Annotation of the Transcriptome of the Endangered Freshwater Pearl Bivalve, Cristaria plicata, Provides Novel Insights into Functional Genes and Marker Discovery , 2016, PloS one.

[6]  Robert D. Finn,et al.  The Pfam protein families database: towards a more sustainable future , 2015, Nucleic Acids Res..

[7]  Katharina J. Hoff,et al.  BRAKER1: Unsupervised RNA-Seq-Based Genome Annotation with GeneMark-ET and AUGUSTUS , 2016, Bioinform..

[8]  Guanglei Liu,et al.  Cloning and Characterization of a Pyruvate Carboxylase Gene from Penicillium rubens and Overexpression of the Genein the Yeast Yarrowia lipolytica for Enhanced Citric Acid Production , 2015, Marine Biotechnology.

[9]  W. Warren,et al.  Developing tools for the study of molluscan immunity: The sequencing of the genome of the eastern oyster, Crassostrea virginica. , 2015, Fish & shellfish immunology.

[10]  S. Bhassu,et al.  RNA-seq analysis of Macrobrachium rosenbergii hepatopancreas in response to Vibrio parahaemolyticus infection , 2015, Gut Pathogens.

[11]  M. Gerdol,et al.  The genome of the Pacific oyster Crassostrea gigas brings new insights on the massive expansion of the C1q gene family in Bivalvia. , 2015, Developmental and comparative immunology.

[12]  Matthew W. Hahn,et al.  Convergent evolution of the genomes of marine mammals , 2015, Nature Genetics.

[13]  Baozhong Liu,et al.  Transcriptome Analysis of Shell Color-Related Genes in the Clam Meretrix meretrix , 2015, Marine Biotechnology.

[14]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[15]  Huaiyu Mi,et al.  The InterPro protein families database: the classification resource after 15 years , 2014, Nucleic Acids Res..

[16]  Scott Federhen,et al.  Type material in the NCBI Taxonomy Database , 2014, Nucleic Acids Res..

[17]  Robert D. Finn,et al.  Rfam 12.0: updates to the RNA families database , 2014, Nucleic Acids Res..

[18]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[19]  W. Pirovano,et al.  SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information , 2014, BMC Bioinformatics.

[20]  A. Huvet,et al.  Gonad transcriptome analysis of pearl oyster Pinctada margaritifera: identification of potential sex differentiation and sex determining genes , 2014, BMC Genomics.

[21]  L. Xiang,et al.  A novel C1q-domain-containing (C1qDC) protein from Mytilus coruscus with the transcriptional analysis against marine pathogens and heavy metals. , 2014, Developmental and comparative immunology.

[22]  Rajiv C. McCoy,et al.  Illumina TruSeq Synthetic Long-Reads Empower De Novo Assembly and Resolve Complex, Highly-Repetitive Transposable Elements , 2014, bioRxiv.

[23]  Yanyan Hu,et al.  Effects of benzo(a)pyrene on differentially expressed genes and haemocyte parameters of the clam Venerupis philippinarum , 2014, Ecotoxicology.

[24]  Sean R. Eddy,et al.  Infernal 1.1: 100-fold faster RNA homology searches , 2013, Bioinform..

[25]  Colin N. Dewey,et al.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis , 2013, Nature Protocols.

[26]  Aaron M. Newman,et al.  The genome sequence of the colonial chordate, Botryllus schlosseri , 2013, eLife.

[27]  Nicholas H. Putnam,et al.  Insights into bilaterian evolution from three spiralian genomes , 2012, Nature.

[28]  Cole Trapnell,et al.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions , 2013, Genome Biology.

[29]  Jian Wang,et al.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler , 2012, GigaScience.

[30]  Qiang Wang,et al.  The oyster genome reveals stress adaptation and complexity of shell formation , 2012, Nature.

[31]  Huan Zhang,et al.  A C1q Domain Containing Protein from Scallop Chlamys farreri Serving as Pattern Recognition Receptor with Heat-Aggregated IgG Binding Activity , 2012, PloS one.

[32]  B. Faircloth,et al.  Primer3—new capabilities and interfaces , 2012, Nucleic acids research.

[33]  Guangrui Huang,et al.  HaploMerger: Reconstructing allelic relationships for polymorphic diploid genome assemblies , 2012, Genome research.

[34]  A. Figueras,et al.  Transcriptomics of In Vitro Immune-Stimulated Hemocytes from the Manila Clam Ruditapes philippinarum Using High-Throughput Sequencing , 2012, PloS one.

[35]  S. Schreiber,et al.  Massively Parallel RNA Sequencing Identifies a Complex Immune Gene Repertoire in the lophotrochozoan Mytilus edulis , 2012, PloS one.

[36]  Hideo Aoki,et al.  Draft Genome of the Pearl Oyster Pinctada fucata: A Platform for Understanding Bivalve Biology , 2012, DNA research : an international journal for rapid publication of reports on genes and genomes.

[37]  Zhong Wang,et al.  Next-generation transcriptome assembly , 2011, Nature Reviews Genetics.

[38]  Adam M. Phillippy,et al.  Interactive metagenomic visualization in a Web browser , 2011, BMC Bioinformatics.

[39]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[40]  N. Friedman,et al.  Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2011, Nature Biotechnology.

[41]  M. Gerdol,et al.  The C1q domain containing proteins of the Mediterranean mussel Mytilus galloprovincialis: a widespread and diverse family of immune-related molecules. , 2011, Developmental and comparative immunology.

[42]  G. Chelazzi,et al.  Transcriptome sequencing and microarray development for the Manila clam, Ruditapes philippinarum: genomic tools for environmental monitoring , 2011, BMC Genomics.

[43]  Kwang-Sik Choi,et al.  Isolation and identification of Perkinsus olseni from feces and marine sediment using immunological and molecular techniques. , 2010, Journal of invertebrate pathology.

[44]  M. Figueras,et al.  Diversity and pathogenecity of Vibrio species in cultured bivalve molluscs. , 2010, Environmental microbiology reports.

[45]  C. Dang,et al.  Virus-like particles associated with brown muscle disease in Manila clam, Ruditapes philippinarum, in Arcachon Bay (France). , 2009, Journal of Fish Diseases.

[46]  Geoffrey J. Barton,et al.  Jalview Version 2—a multiple sequence alignment editor and analysis workbench , 2009, Bioinform..

[47]  Kwang-Sik Choi,et al.  Noble tandem-repeat galectin of Manila clam Ruditapes philippinarum is induced upon infection with the protozoan parasite Perkinsus olseni. , 2008, Developmental and comparative immunology.

[48]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[49]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[50]  S. Bohlson,et al.  Complement proteins C1q and MBL are pattern recognition molecules that signal immediate and long-term protective immune functions. , 2007, Molecular immunology.

[51]  Matthew M. Hill,et al.  A haplome alignment and reference sequence of the highly polymorphic Ciona savignyi genome , 2007, Genome Biology.

[52]  Kwang-Sik Choi,et al.  Occurrence of Perkinsus olseni in the Venus clam Protothaca jedoensis in Korean waters. , 2006, Journal of invertebrate pathology.

[53]  R. Hartmann-Petersen,et al.  Adrm1, a putative cell adhesion regulating protein, is a novel proteasome-associated factor. , 2006, Journal of molecular biology.

[54]  Fredo Durand,et al.  The point about oxidative stress in molluscs , 2005 .

[55]  W. Funk,et al.  The complete complement of C1q-domain-containing proteins in Homo sapiens. , 2005, Genomics.

[56]  Rolf Apweiler,et al.  InterProScan: protein domains identifier , 2005, Nucleic Acids Res..

[57]  A. Bainy,et al.  Oxidative stress in digestive gland and gill of the brown mussel (Perna perna) exposed to air and re-submersed , 2005 .

[58]  Pavel A. Pevzner,et al.  De novo identification of repeat families in large genomes , 2005, ISMB.

[59]  Ewan Birney,et al.  Automated generation of heuristics for biological sequence comparison , 2005, BMC Bioinformatics.

[60]  Burkhard Morgenstern,et al.  AUGUSTUS: a web server for gene finding in eukaryotes , 2004, Nucleic Acids Res..

[61]  C. Paillard,et al.  Effect of temperature on defense parameters in manila clam Ruditapes philippinarum challenged with Vibrio tapetis. , 2004, Diseases of aquatic organisms.

[62]  John Quackenbush,et al.  TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets , 2003, Bioinform..

[63]  D. Livingstone Contaminant-stimulated reactive oxygen species production and oxidative damage in aquatic organisms. , 2001, Marine pollution bulletin.

[64]  K. Reid,et al.  C1q: structure, function, and receptors. , 2000, Immunopharmacology.

[65]  X. Huang,et al.  CAP3: A DNA sequence assembly program. , 1999, Genome research.

[66]  P. David Heterozygosity–fitness correlations: new perspectives on old problems , 1998, Heredity.

[67]  S. Eddy,et al.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. , 1997, Nucleic acids research.

[68]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[69]  J. Widdows,et al.  Activity and Metabolism in the Mussel Mytilus edulis L. during Intertidal Hypoxia and Aerobic Recovery , 1986, Physiological Zoology.

[70]  A. Zwaan,et al.  Anaerobic metabolism in Bivalvia (Mollusca). Characteristics of anaerobic metabolism. , 1976, Comparative biochemistry and physiology. B, Comparative biochemistry.