Long-Read Sequencing of Chicken Transcripts and Identification of New Transcript Isoforms

The chicken has long served as an important model organism in many fields, and continues to aid our understanding of animal development. Functional genomics studies aimed at probing the mechanisms that regulate development require high-quality genomes and transcript annotations. The quality of these resources has improved dramatically over the last several years, but many isoforms and genes have yet to be identified. We hope to contribute to the process of improving these resources with the data presented here: a set of long cDNA sequencing reads, and a curated set of new genes and transcript isoforms not currently represented in the most up-to-date genome annotation currently available to the community of researchers who rely on the chicken genome.

[1]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[2]  T. Andrews,et al.  The Ensembl automatic gene annotation system. , 2004, Genome research.

[3]  Cole Trapnell,et al.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions , 2013, Genome Biology.

[4]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[5]  Gos Micklem,et al.  Supporting Online Material Materials and Methods Figs. S1 to S50 Tables S1 to S18 References Identification of Functional Elements and Regulatory Circuits by Drosophila Modencode , 2022 .

[6]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[7]  Raymond K. Auerbach,et al.  Integrative Analysis of the Caenorhabditis elegans Genome by the modENCODE Project , 2010, Science.

[8]  Colin N. Dewey,et al.  Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution , 2004, Nature.

[9]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[10]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[11]  J. Berg Genome sequence of the nematode C. elegans: a platform for investigating biology. , 1998, Science.

[12]  J. Murray,et al.  Mutations in the human forkhead transcription factor FOXE3 associated with anterior segment ocular dysgenesis and cataracts. , 2001, Human molecular genetics.

[13]  S. Bergmann,et al.  The evolution of gene expression levels in mammalian organs , 2011, Nature.

[14]  Ewan Birney,et al.  Transcriptome analysis for the chicken based on 19,626 finished cDNA sequences and 485,337 expressed sequence tags. , 2005, Genome research.

[15]  S. Anand,et al.  MicroRNA-132–mediated loss of p120RasGAP activates the endothelium to facilitate pathological angiogenesis , 2010, Nature Medicine.

[16]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[17]  D. Burt,et al.  Emergence of the chicken as a model organism: implications for agriculture and biology. , 2007, Poultry science.

[18]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[19]  Thomas D. Wu,et al.  GMAP: a genomic mapping and alignment program for mRNA and EST sequence , 2005, Bioinform..

[20]  Philip Cayting,et al.  An encyclopedia of mouse DNA elements (Mouse ENCODE) , 2012, Genome Biology.

[21]  J. Seidman,et al.  Regulation of chamber-specific gene expression in the developing heart by Irx4. , 1999, Science.

[22]  International Human Genome Sequencing Consortium Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution , 2004 .

[23]  S. Searle,et al.  The Ensembl analysis pipeline. , 2004, Genome research.

[24]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[25]  Raymond K. Auerbach,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[26]  Colin N. Dewey,et al.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis , 2013, Nature Protocols.

[27]  Andrew Smith Genome sequence of the nematode C-elegans: A platform for investigating biology , 1998 .

[28]  T. Tatusova,et al.  NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2006, Nucleic Acids Research.

[29]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .