Gene Duplication and Gene Conversion in the Caenorhabditis elegans Genome

Abstract. A comprehensive analysis of duplication and gene conversion for 7394 Caenorhabditis elegans genes (about half the expected total for the genome) is presented. Of the genes examined, 40% are involved in duplicated gene pairs. Intrachromosomal or cis gene duplications occur approximately two times more often than expected. In general the closer the members of duplicated gene pairs are, the more likely it is that gene orientation is conserved. Gene conversion events are detectable between only 2% of the duplicated pairs. Even given the excesses of cis duplications, there is an excess of gene conversion events between cis duplicated pairs on every chromosome except the X chromosome. The relative rates of cis and trans gene conversion and the negative correlation between conversion frequency and DNA sequence divergence for unconverted regions of converted pairs are consistent with previous experimental studies in yeast. Three recent, regional duplications, each spanning three genes are described. All three have already undergone substantial deletions spanning hundreds of base pairs. The relative rates of duplication and deletion may contribute to the compactness of the C. elegans genome.

[1]  J. Haber,et al.  Gene conversions and crossing over during homologous and homeologous ectopic recombination in Saccharomyces cerevisiae. , 1993, Genetics.

[2]  T. Eickbush,et al.  Sequence identity in an early chorion multigene family is the result of localized gene conversion. , 1991, Genetics.

[3]  J. Sulston,et al.  The genome of Caenorhabditis elegans. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[4]  S. Sawyer Statistical tests for detecting gene conversion. , 1989, Molecular biology and evolution.

[5]  R. Durbin,et al.  2.2 Mb of contiguous nucleotide sequence from chromosome III of C. elegans , 1994, Nature.

[6]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[7]  R. Durbin,et al.  Analysis of protein domain families in Caenorhabditis elegans. , 1997, Genomics.

[8]  S. Tanksley,et al.  Comparative linkage maps of the rice and maize genomes. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[9]  C. R. McClung,et al.  Intron loss and gain during evolution of the catalase gene family in angiosperms. , 1998, Genetics.

[10]  W. Li,et al.  Gene conversion and natural selection in the evolution of X-linked color vision genes in higher primates. , 1996, Molecular biology and evolution.

[11]  S Gangloff,et al.  Gene conversion plays the major role in controlling the stability of large tandem repeats in yeast. , 1996, The EMBO journal.

[12]  Thomas Blumenthal,et al.  RNA Processing and Gene Structure , 1997 .

[13]  M. Riley,et al.  Gene products of Escherichia coli: sequence comparisons and common ancestries. , 1995, Molecular biology and evolution.

[14]  S. Easteal,et al.  The partition matrix: exploring variable phylogenetic signals along nucleotide sequence alignments. , 1997, Molecular biology and evolution.

[15]  H. Robertson Two large families of chemoreceptor genes in the nematodes Caenorhabditis elegans and Caenorhabditis briggsae reveal extensive gene duplication, diversification, movement, and intron loss. , 1998, Genome research.

[16]  K. H. Wolfe,et al.  Molecular evidence for an ancient duplication of the entire yeast genome , 1997, Nature.

[17]  C. Chothia,et al.  Gene duplications in H. influenzae , 1995, Nature.

[18]  E. Candido,et al.  Locus encoding a family of small heat shock genes in Caenorhabditis elegans: two genes duplicated to form a 3.8-kilobase inverted repeat , 1985, Molecular and cellular biology.

[19]  Sean R. Eddy,et al.  Pfam: multiple sequence alignments and HMM-profiles of protein domains , 1998, Nucleic Acids Res..

[20]  A. Coulson,et al.  Toward a physical map of the genome of the nematode Caenorhabditis elegans. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[21]  E. Coissac,et al.  A comparative study of duplications in bacteria and eukaryotes: the importance of telomeres. , 1997, Molecular biology and evolution.

[22]  T. Petes,et al.  Recombination between repeated genes in microorganisms. , 1988, Annual review of genetics.

[23]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[24]  John C. Wootton,et al.  Statistics of Local Complexity in Amino Acid Sequences and Sequence Databases , 1993, Comput. Chem..

[25]  L. Symington,et al.  8 Recombination in Yeast , 1991 .

[26]  D. Hillis,et al.  Evidence for biased gene conversion in concerted evolution of ribosomal DNA. , 1991, Science.

[27]  A. Sidow,et al.  Gene duplications and the origins of vertebrate development. , 1994, Development (Cambridge, England). Supplement.

[28]  J. Kramer,et al.  Tandemly duplicated Caenorhabditis elegans collagen genes differ in their modes of splicing. , 1990, Journal of molecular biology.

[29]  S. Shyue,et al.  Intronic gene conversion in the evolution of human X-linked color vision genes. , 1994, Molecular biology and evolution.

[30]  J. Stephens,et al.  Statistical methods of DNA sequence analysis: detection of intragenic recombination or gene conversion. , 1985, Molecular biology and evolution.

[31]  J. Szostak,et al.  Unequal crossing over in the ribosomal DNA of Saccharomyces cerevisiae , 1980, Nature.