Digital Commons@Becker Digital Commons@Becker The repetitive landscape of the chicken genome The repetitive landscape of the chicken genome

Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7× coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM , for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats ( Birddawg , Hitchcock , Kronos , and Soprano ), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner -like transposon, Galluhop , in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1 , Galluhop , and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available.

[1]  M. Matzke,et al.  A 41–42 bp tandemly repeated sequence isolated from nuclear envelopes of chicken erythrocytes is located predominantly on microchromosomes , 1990, Chromosoma.

[2]  H. Kishino,et al.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA , 2005, Journal of Molecular Evolution.

[3]  R. Goldberg DNA sequence organization in the soybean plant , 1978, Biochemical Genetics.

[4]  Xiaofei Wang,et al.  Partially Inverted Tandem Repeat Isolated from Pericentric Region of Chicken Chromosome 8 , 2004, Chromosome Research.

[5]  J Quackenbush,et al.  Enrichment of Gene-Coding Sequences in Maize by Genome Filtration , 2003, Science.

[6]  James M. Eldred,et al.  Viral Discovery and Sequence Recovery Using DNA Microarrays , 2003, PLoS biology.

[7]  M. Delany,et al.  Telomeres in the chicken: genome stability and chromosome ends. , 2003, Poultry science.

[8]  Beat Keller,et al.  CACTA Transposons in Triticeae. A Diverse Family of High-Copy Repetitive Elements1 , 2003, Plant Physiology.

[9]  Yinan Yuan,et al.  High-Cot sequence analysis of the maize genome. , 2003, The Plant journal : for cell and molecular biology.

[10]  J. Jurka,et al.  The Esterase and PHD Domains in CR1-Like Non-LTR Retrotransposons , 2003, Molecular biology and evolution.

[11]  S. Wessler,et al.  Efficient capture of unique sequences from eukaryotic genomes. , 2002, Trends in genetics : TIG.

[12]  Daniel G Peterson,et al.  Integration of Cot analysis, DNA cloning, and high-throughput sequencing facilitates genome characterization and gene discovery. , 2002, Genome research.

[13]  M. Batzer,et al.  Alu repeats and human genomic diversity , 2002, Nature Reviews Genetics.

[14]  T. Gregory,et al.  A BIRD'S‐EYE VIEW OF THE C‐VALUE ENIGMA: GENOME SIZE, CELL SIZE, AND METABOLIC RATE IN THE CLASS AVES , 2002, Evolution; international journal of organic evolution.

[15]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[16]  F. Leung,et al.  Isolation and characterization of repetitive DNA sequences from Panax ginseng , 2002, Molecular Genetics and Genomics.

[17]  T. Wicker,et al.  Analysis of a contiguous 211 kb sequence in diploid wheat (Triticum monococcum L.) reveals multiple mechanisms of genome evolution. , 2001, The Plant journal : for cell and molecular biology.

[18]  J. Samarut,et al.  Identification of a new gene family specifically expressed in chicken embryonic stem cells and early embryo , 2001, Mechanisms of Development.

[19]  D. A. Kramerov,et al.  Structure and Origin of a Novel Dimeric Retroposon B1-dID , 2001, Journal of Molecular Evolution.

[20]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[21]  M. Schartl,et al.  First report on chicken genes and chromosomes 2000 , 2000, Cytogenetic and Genome Research.

[22]  J. Jurka Repbase update: a database and an electronic journal of repetitive elements. , 2000, Trends in genetics : TIG.

[23]  P. Schulze-Lefert,et al.  A contiguous 66-kb barley DNA sequence provides evidence for reversible genome expansion. , 2000, Genome research.

[24]  S. Klein,et al.  Localization of Xho1 repetitive sequences on autosomes in addition to the W chromosome in chickens and its relevance for sex diagnosis. , 2000, Animal genetics.

[25]  M. P. Cummings,et al.  PAUP* Phylogenetic analysis using parsimony (*and other methods) Version 4 , 2000 .

[26]  R. Plasterk,et al.  Resident aliens: the Tc1/mariner superfamily of transposable elements. , 1999, Trends in genetics : TIG.

[27]  S. Iida,et al.  Capture of a genomic HMG domain sequence by the En/Spm-related transposable element Tpn1 in the Japanese morning glory , 1999, Molecular and General Genetics MGG.

[28]  T. Heidmann,et al.  ERV-L Elements: a Family of Endogenous Retrovirus-Like Elements Active throughout the Evolution of Mammals , 1999, Journal of Virology.

[29]  Phillip SanMiguel,et al.  The paleontology of intergene retrotransposons of maize , 1998, Nature Genetics.

[30]  H. Kazazian,et al.  Mobile elements and disease. , 1998, Current opinion in genetics & development.

[31]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[32]  R. Wilson,et al.  High throughput fingerprint analysis of large-insert clones. , 1997, Genome research.

[33]  J. Burch,et al.  Chicken repeat 1 (CR1) elements, which define an ancient family of vertebrate non-LTR retrotransposons, contain two closely spaced open reading frames. , 1997, Gene.

[34]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[35]  R. Durbin,et al.  A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. , 1995, Gene.

[36]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[37]  M. Reitman,et al.  Evolution of chicken repeat 1 (CR1) elements: evidence for ancient subfamilies and multiple progenitors. , 1994, Molecular biology and evolution.

[38]  N. Fedoroff About maize transposable elements and development , 1989, Cell.

[39]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[40]  B. O’Malley,et al.  Genomic structure and possible retroviral origin of the chicken CR1 repetitive DNA sequence family. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[41]  D E Graham,et al.  Analysis of repeating DNA sequences by reassociation. , 1974, Methods in enzymology.