The evolutionary analysis of "orphans" from the Drosophila genome identifies rapidly diverging and incorrectly annotated genes.

In genome projects of eukaryotic model organisms, a large number of novel genes of unknown function and evolutionary history ("orphans") are being identified. Since many orphans have no known homologs in distant species, it is unclear whether they are restricted to certain taxa or evolve rapidly, either because of a lack of constraints or positive Darwinian selection. Here we use three criteria for the selection of putatively rapidly evolving genes from a single sequence of Drosophila melanogaster. Thirteen candidate genes were chosen from the Adh region on the second chromosome and 1 from the tip of the X chromosome. We succeeded in obtaining sequence from 6 of these in the closely related species D. simulans and D. yakuba. Only 1 of the 6 genes showed a large number of amino acid replacements and in-frame insertions/deletions. A population survey of this gene suggests that its rapid evolution is due to the fixation of many neutral or nearly neutral mutations. Two other genes showed "normal" levels of divergence between species. Four genes had insertions/deletions that destroy the putative reading frame within exons, suggesting that these exons have been incorrectly annotated. The evolutionary analysis of orphan genes in closely related species is useful for the identification of both rapidly evolving and incorrectly annotated genes.

[1]  Z. Yang,et al.  Substitution rates in Drosophila nuclear genes: implications for translational selection. , 2001, Genetics.

[2]  G. Bouffard,et al.  Gene discovery using computational and microarray analysis of transcription in the Drosophila melanogaster testis. , 2000, Genome research.

[3]  A. Clark,et al.  Molecular population genetics of male accessory gland proteins in Drosophila. , 2000, Genetics.

[4]  Z. Yang,et al.  Rates of nucleotide substitution and mammalian nuclear gene evolution. Approximate and maximum-likelihood methods lead to different conclusions. , 2000, Genetics.

[5]  R. Guigó,et al.  An assessment of gene prediction accuracy in large DNA sequences. , 2000, Genome research.

[6]  N. Goldman,et al.  Codon-substitution models for heterogeneous selection pressure at amino acid sites. , 2000, Genetics.

[7]  S. Lewis,et al.  Genome annotation assessment in Drosophila melanogaster. , 2000, Genome research.

[8]  B. Barrell,et al.  From sequence to chromosome: the tip of the X chromosome of D. melanogaster. , 2000, Science.

[9]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[10]  Gerald J. Wyckoff,et al.  Rapid evolution of male reproductive genes in the descent of man , 2000, Nature.

[11]  D. Tautz,et al.  Large number of replacement polymorphisms in rapidly evolving genes of Drosophila. Implications for genome-wide surveys of DNA polymorphism. , 1999, Genetics.

[12]  R George,et al.  An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: the Adh region. , 1999, Genetics.

[13]  Laurence D. Hurst,et al.  Do essential genes evolve slowly? , 1999, Current Biology.

[14]  S. Palumbi,et al.  Molecular genetics of ecological diversification: duplication and rapid evolution of toxin genes of the venomous gastropod Conus. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[15]  N. Blow,et al.  Adaptive evolution of color vision of the Comoran coelacanth (Latimeria chalumnae). , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[16]  L. Duret,et al.  Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[17]  S. Brenner Errors in genome annotation. , 1999, Trends in genetics : TIG.

[18]  Julio Rozas,et al.  DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis , 1999, Bioinform..

[19]  M. Aguadé,et al.  Natural selection on synonymous sites is correlated with gene length and recombination in Drosophila. , 1999, Genetics.

[20]  Yan P. Yuan,et al.  Predicting function: from genes to genomes and back. , 1998, Journal of molecular biology.

[21]  B C Meyers,et al.  Clusters of resistance genes in plants evolve by divergent selection and a birth-and-death process. , 1998, Genome research.

[22]  M. Kreitman,et al.  The correlation between synonymous and nonsynonymous substitutions in Drosophila: mutation, selection or relaxed constraints? , 1998, Genetics.

[23]  A. Civetta,et al.  Sex-related genes, directional sexual selection, and speciation. , 1998, Molecular biology and evolution.

[24]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[25]  P. Green,et al.  Consed: a graphical tool for sequence finishing. , 1998, Genome research.

[26]  P. Bork,et al.  Predicting functions from protein sequences—where are the bottlenecks? , 1998, Nature Genetics.

[27]  C. Aquadro,et al.  Rates of DNA sequence evolution are not sex-biased in Drosophila melanogaster and D. simulans. , 1997, Molecular biology and evolution.

[28]  M. Boguski,et al.  Functional genomics: it's all how you read it. , 1997, Science.

[29]  D. Tautz,et al.  A screen for fast evolving genes from Drosophila. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[31]  J. Powell,et al.  Evolution of codon usage bias in Drosophila. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[32]  C. Chothia,et al.  Population statistics of protein structures: lessons from structural classifications. , 1997, Current opinion in structural biology.

[33]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[34]  R. O’Neill,et al.  Evolution of the Sry genes. , 1997, Molecular biology and evolution.

[35]  H. Akashi,et al.  Molecular evolution between Drosophila melanogaster and D. simulans: reduced codon bias, faster rates of amino acid substitution, and larger proteins in D. melanogaster. , 1996, Genetics.

[36]  T Gojobori,et al.  Large-scale search for genes on which positive selection may operate. , 1996, Molecular biology and evolution.

[37]  S. Oliver From DNA sequence to biological function , 1996, Nature.

[38]  J. Powell,et al.  Intraspecific nuclear DNA variation in Drosophila. , 1996, Molecular biology and evolution.

[39]  W. Swanson,et al.  Extraordinary divergence and positive Darwinian selection in a fusagenic protein coating the acrosomal process of abalone spermatozoa. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[40]  H. Akashi,et al.  Inferring weak selection from patterns of polymorphism and divergence at "silent" sites in Drosophila DNA. , 1995, Genetics.

[41]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[42]  J. Hey,et al.  The effects of mutation and natural selection on codon bias in the genes of Drosophila. , 1994, Genetics.

[43]  H. Akashi Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. , 1994, Genetics.

[44]  J. Hey,et al.  Reduced natural selection associated with low recombination in Drosophila melanogaster. , 1993, Molecular biology and evolution.

[45]  C. Aquadro,et al.  African and North American populations of Drosophila melanogaster are very different at the DNA level , 1993, Nature.

[46]  Philip M. Murphy,et al.  Molecular mimicry and the generation of host defense protein diversity , 1993, Cell.

[47]  W. Li,et al.  Statistical tests of neutrality of mutations. , 1993, Genetics.

[48]  C. Chothia One thousand families for the molecular biologist , 1992, Nature.

[49]  M. Kreitman,et al.  Adaptive protein evolution at the Adh locus in Drosophila , 1991, Nature.

[50]  R. Hudson,et al.  Inferring the evolutionary histories of the Adh and Adh-dup loci in Drosophila melanogaster from patterns of polymorphism and divergence. , 1991, Genetics.

[51]  F. Wright The 'effective number of codons' used in a gene. , 1990, Gene.

[52]  F. Tajima Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. , 1989, Genetics.

[53]  D C Shields,et al.  "Silent" sites in Drosophila genes are not neutral: evidence of selection among synonymous codons. , 1988, Molecular biology and evolution.

[54]  R. Hudson,et al.  A test of neutral molecular evolution based on nucleotide data. , 1987, Genetics.

[55]  M. Nei Molecular Evolutionary Genetics , 1987 .

[56]  G. A. Watterson On the number of segregating sites in genetical models without recombination. , 1975, Theoretical population biology.

[57]  L. Orgel,et al.  Biochemical Evolution , 1971, Nature.