PEDE (Pig EST Data Explorer): construction of a database for ESTs derived from porcine full-length cDNA libraries

We generated the PEDE (Pig EST Data Explorer; http://pede.dna.affrc.go.jp/) database using sequences assembled from porcine 5' ESTs from oligo-capped full-length cDNA libraries. Thus far we have performed EST analysis of various organs (thymus, spleen, uterus, lung, liver, ovary and peripheral blood mononuclear cells) and assembled 68,076 high-quality sequences into 5546 contigs and 28,461 singlets. PEDE provides a search interface for getting results of homology searches and enables users to obtain information on sequence data and cDNA clones of interest. Single-nucleotide polymorphisms detected through comparison of the EST sequences are classified by origin (western and oriental breeds) and are searchable in the database. This database system can accelerate analyses of livestock traits and yields information that can lead to new applications in pigs as model systems for medical research.

[1]  Daniel Lee,et al.  The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species , 2001, Nucleic Acids Res..

[2]  Patrick Chardon,et al.  Sequence of the pig major histocompatibility region containing the classical class I genes , 2001, Immunogenetics.

[3]  J. Platt,et al.  The immunological barrier to xenotransplantation. , 2001, Immunity.

[4]  T. Ideker,et al.  Mining SNPs from EST databases. , 1999, Genome research.

[5]  Toyoyuki Takada,et al.  Genomic organization of the mammalian MHC. , 2003, Annual review of immunology.

[6]  G. Plastow,et al.  Construction of a new porcine whole-genome framework map using a radiation hybrid panel. , 2003, Animal genetics.

[7]  X. Huang,et al.  CAP3: A DNA sequence assembly program. , 1999, Genome research.

[8]  Takashi Shiina,et al.  Genomic structure around joining segments and constant regions of swine T-cell receptor alpha/delta (TRA/TRD) locus. , 2003, Immunology.

[9]  C Rogel-Gaillard,et al.  Sequence of the swine major histocompatibility complex region containing all non-classical class I genes. , 2001, Tissue antigens.

[10]  John Quackenbush,et al.  TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets , 2003, Bioinform..

[11]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology , 2003, Nucleic Acids Res..

[12]  G. Gerard,et al.  Reverse Transcriptase , 1997, Molecular biotechnology.

[13]  N. Shimizu,et al.  Construction and evaluation of a porcine bacterial artificial chromosome library. , 2000, Animal genetics.

[14]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[15]  Y. Wada,et al.  Development of an animal genome database and its search system , 1996, Comput. Appl. Biosci..

[16]  Takashi Shiina,et al.  Genomic structure around joining segments and constant regions of swine T‐cell receptor α/δ (TRA/TRD) locus , 2003 .

[17]  E. Kobayashi,et al.  A linkage map of 243 DNA markers in an intercross of Göttingen miniature and Meishan pigs. , 1999, Animal genetics.

[18]  D. Nickerson,et al.  PolyPhred: automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing. , 1997, Nucleic acids research.

[19]  Y. Suzuki,et al.  Construction and characterization of a full length-enriched and a 5'-end-enriched cDNA library. , 1997, Gene.

[20]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[21]  A Onishi,et al.  Pig cloning by microinjection of fetal fibroblast nuclei. , 2000, Science.