Cloning, analysis and functional annotation of expressed sequence tags from the Earthworm Eisenia fetida

BackgroundEisenia fetida, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test organism in terrestrial ecotoxicology. In order to understand its gene expression response to environmental contaminants, we cloned 4032 cDNAs or expressed sequence tags (ESTs) from two E. fetida libraries enriched with genes responsive to ten ordnance related compounds using suppressive subtractive hybridization-PCR.ResultsA total of 3144 good quality ESTs (GenBank dbEST accession number EH669363–EH672369 and EL515444–EL515580) were obtained from the raw clone sequences after cleaning. Clustering analysis yielded 2231 unique sequences including 448 contigs (from 1361 ESTs) and 1783 singletons. Comparative genomic analysis showed that 743 or 33% of the unique sequences shared high similarity with existing genes in the GenBank nr database. Provisional function annotation assigned 830 Gene Ontology terms to 517 unique sequences based on their homology with the annotated genomes of four model organisms Drosophila melanogaster, Mus musculus, Saccharomyces cerevisiae, and Caenorhabditis elegans. Seven percent of the unique sequences were further mapped to 99 Kyoto Encyclopedia of Genes and Genomes pathways based on their matching Enzyme Commission numbers. All the information is stored and retrievable at a highly performed, web-based and user-friendly relational database called EST model database or ESTMD version 2.ConclusionThe ESTMD containing the sequence and annotation information of 4032 E. fetida ESTs is publicly accessible at http://mcbc.usm.edu/estmd/.

[1]  Mehdi Pirooznia,et al.  Toxicogenomic analysis provides new insights into molecular mechanisms of the sublethal toxicity of 2,4,6-trinitrotoluene in Eisenia fetida. , 2007, Environmental science & technology.

[2]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[3]  Gene Ontology Consortium The Gene Ontology (GO) database and informatics resource , 2003 .

[4]  John Parkinson,et al.  The earthworm Expressed Sequence Tag projectThe 7th international symposium on earthworm ecology · Cardiff · Wales · 2002 , 2003 .

[5]  Tipton Kf,et al.  Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB). Enzyme nomenclature. Recommendations 1992. Supplement: corrections and additions. , 1994 .

[6]  P. Brown,et al.  Combining SSH and cDNA microarrays for rapid identification of differentially expressed genes. , 1999, Nucleic acids research.

[7]  Todd Wylie,et al.  Analysis and functional classification of transcripts from the nematode Meloidogyne incognita , 2003, Genome Biology.

[8]  Il Je Yu,et al.  Gene-expression profiling using suppression-subtractive hybridization and cDNA microarray in rat mononuclear cells in response to welding-fume exposure , 2004, Toxicology and industrial health.

[9]  William H. Press,et al.  Numerical Recipes in FORTRAN - The Art of Scientific Computing, 2nd Edition , 1987 .

[10]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[11]  David Murphy,et al.  Microarray screening of suppression subtractive hybridization-PCR cDNA libraries identifies novel RNAs regulated by dehydration in the rat supraoptic nucleus. , 2006, Physiological genomics.

[12]  D. Vieau,et al.  Cloning and real-time PCR testing of 14 potential biomarkers in Eisenia fetida following cadmium exposure. , 2006, Environmental science & technology.

[13]  Kimberly Van Auken,et al.  WormBase: a multi-species resource for nematode biology and genomics , 2004, Nucleic Acids Res..

[14]  D. Nickerson,et al.  PolyPhred: automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing. , 1997, Nucleic acids research.

[15]  William H. Press,et al.  Numerical recipes in C. The art of scientific computing , 1987 .

[16]  J. Blake,et al.  Creating the Gene Ontology Resource : Design and Implementation The Gene Ontology Consortium 2 , 2001 .

[17]  Robert Giegerich,et al.  A discipline of dynamic programming over sequence data , 2004, Sci. Comput. Program..

[18]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[19]  Youping Deng,et al.  Analysis and functional annotation of expressed sequence tags from the fall armyworm Spodoptera frugiperda , 2006, BMC Genomics.

[20]  Bart Naudts,et al.  Molecular impact of propiconazole on Daphnia magna using a reproduction-related cDNA array. , 2006, Comparative biochemistry and physiology. Toxicology & pharmacology : CBP.

[21]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[22]  John Reynolds,et al.  The Status of Earthworm Biogeography, Diversity, and Taxonomy in North America Revisited with Glimpses into the Future , 2004 .

[23]  Soon Cheol Park,et al.  Transcriptome analysis in the midgut of the earthworm (Eisenia andrei) using expressed sequence tags. , 2005, Biochemical and biophysical research communications.

[24]  Sébastien Lemière,et al.  Metallothionein response following cadmium exposure in the oligochaete Eisenia fetida. , 2006, Comparative biochemistry and physiology. Toxicology & pharmacology : CBP.

[25]  L. L. Lloyd,et al.  Enzyme nomenclature — Recommendations of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology: Academic Press Ltd, London, UK, 1992. xiii + 862 pp. Price £40.00. ISBN 0-12-227165-3 , 1994 .

[26]  Paolo Uva,et al.  An annotated cDNA library and microarray for large-scale gene-expression studies in the ant Solenopsis invicta , 2007, Genome Biology.

[27]  P. Green,et al.  Consed: a graphical tool for sequence finishing. , 1998, Genome research.

[28]  M Galay-Burgos,et al.  Developing a new method for soil pollution monitoring using molecular genetic biomarkers , 2003, Biomarkers : biochemical indicators of exposure, response, and susceptibility to chemicals.

[29]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[30]  G. Mitta,et al.  The strong induction of metallothionein gene following cadmium exposure transiently affects the expression of many genes in Eisenia fetida: a trade-off mechanism? , 2007, Comparative biochemistry and physiology. Toxicology & pharmacology : CBP.

[31]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[32]  Aaron P. Campbell,et al.  Suppression subtractive hybridization: a method for generating differentially regulated or tissue-specific cDNA probes and libraries. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Hui-Hsien Chou,et al.  DNA sequence quality trimming and vector removal , 2001, Bioinform..