A Scan for Positively Selected Genes in the Genomes of Humans and Chimpanzees

Since the divergence of humans and chimpanzees about 5 million years ago, these species have undergone a remarkable evolution with drastic divergence in anatomy and cognitive abilities. At the molecular level, despite the small overall magnitude of DNA sequence divergence, we might expect such evolutionary changes to leave a noticeable signature throughout the genome. We here compare 13,731 annotated genes from humans to their chimpanzee orthologs to identify genes that show evidence of positive selection. Many of the genes that present a signature of positive selection tend to be involved in sensory perception or immune defenses. However, the group of genes that show the strongest evidence for positive selection also includes a surprising number of genes involved in tumor suppression and apoptosis, and of genes involved in spermatogenesis. We hypothesize that positive selection in some of these genes may be driven by genomic conflict due to apoptosis during spermatogenesis. Genes with maximal expression in the brain show little or no evidence for positive selection, while genes with maximal expression in the testis tend to be enriched with positively selected genes. Genes on the X chromosome also tend to show an elevated tendency for positive selection. We also present polymorphism data from 20 Caucasian Americans and 19 African Americans for the 50 annotated genes showing the strongest evidence for positive selection. The polymorphism analysis further supports the presence of positive selection in these genes by showing an excess of high-frequency derived nonsynonymous mutations.

[1]  R. Simes,et al.  An improved Bonferroni procedure for multiple tests of significance , 1986 .

[2]  Nicholas H. Barton,et al.  The Relative Rates of Evolution of Sex Chromosomes and Autosomes , 1987, The American Naturalist.

[3]  M. Nei,et al.  Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection , 1988, Nature.

[4]  M. Nei,et al.  Allelic genealogy under overdominant and frequency-dependent selection and polymorphism of major histocompatibility complex loci. , 1990, Genetics.

[5]  R. Hudson Gene genealogies and the coalescent process. , 1990 .

[6]  D. Hartl,et al.  Population genetics of polymorphism and divergence. , 1992, Genetics.

[7]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[8]  R. Swerdloff,et al.  Involvement of apoptosis in the induction of germ cell degeneration in adult rats after gonadotropin-releasing hormone antagonist treatment. , 1995, Endocrinology.

[9]  T Gojobori,et al.  Large-scale search for genes on which positive selection may operate. , 1996, Molecular biology and evolution.

[10]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[11]  D. Nickerson,et al.  PolyPhred: automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing. , 1997, Nucleic acids research.

[12]  Irene Garcia,et al.  An early and massive wave of germinal cell apoptosis is required for the development of functional spermatogenesis , 1997, The EMBO journal.

[13]  A. Hughes Rapid evolution of immunoglobulin superfamily C2 domains expressed in immune system cells. , 1997, Molecular biology and evolution.

[14]  W. Fitch,et al.  Long term trends in the evolution of H(3) HA1 human influenza type A. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[15]  R. Nielsen,et al.  Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. , 1998, Genetics.

[16]  V. Yang,et al.  Eukaryotic transcription factors: identification, characterization and functions. , 1998, The Journal of nutrition.

[17]  D. Grafham,et al.  Characterization of SCML1, a new gene in Xp22, with homology to developmental polycomb genes. , 1998, Genomics.

[18]  W. Fitch,et al.  Positive selection on the H3 hemagglutinin gene of human influenza virus A. , 1999, Molecular biology and evolution.

[19]  Ziheng Yang,et al.  Statistical methods for detecting molecular adaptation , 2000, Trends in Ecology & Evolution.

[20]  Doron Lancet,et al.  Dichotomy of single-nucleotide polymorphism haplotypes in olfactory receptor genes and pseudogenes , 2000, Nature Genetics.

[21]  Z. Yang,et al.  Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. , 2000, Molecular biology and evolution.

[22]  A. Clark,et al.  Evolutionary biology: Protamine wars , 2000, Nature.

[23]  Gerald J. Wyckoff,et al.  Rapid evolution of male reproductive genes in the descent of man , 2000, Nature.

[24]  A. Clark,et al.  Evolutionary EST analysis identifies rapidly evolving male reproductive proteins in Drosophila , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[25]  D. Hartl,et al.  Directional selection and the site-frequency spectrum. , 2001, Genetics.

[26]  Fang Yang,et al.  An abundance of X-linked genes expressed in spermatogonia , 2001, Nature Genetics.

[27]  R. Nielsen Statistical tests of selective neutrality in the age of genomics , 2001, Heredity.

[28]  S. Pääbo,et al.  Intra- and Interspecific Variation in Primate Gene Expression Patterns , 2002, Science.

[29]  A. Orth,et al.  Large-scale analysis of the human and mouse transcriptomes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Kateryna D. Makova,et al.  Strong male-driven evolution of DNA sequences in humans and apes , 2002, Nature.

[31]  Carsten Schwarz,et al.  Genomewide comparison of DNA sequences between humans and chimpanzees. , 2002, American journal of human genetics.

[32]  S. Necozione,et al.  Fas expression correlates with human germ cell degeneration in meiotic and post-meiotic arrest of spermatogenesis. , 2002, Molecular human reproduction.

[33]  Molly Przeworski,et al.  The signature of positive selection at randomly chosen loci. , 2002, Genetics.

[34]  M. Adams,et al.  Inferring Nonneutral Evolution from Human-Chimp-Mouse Orthologous Gene Trios , 2003, Science.

[35]  R. Nielsen,et al.  Pervasive adaptive evolution in mammalian fertilization proteins. , 2003, Molecular biology and evolution.

[36]  M. Kimmel,et al.  New explicit expressions for relative frequencies of single-nucleotide polymorphisms with application to statistical inference on population growth. , 2003, Genetics.

[37]  Rama S. Singh,et al.  Sex-linked mammalian sperm proteins evolve faster than autosomal ones. , 2003, Molecular biology and evolution.

[38]  S. Pääbo,et al.  A neutral explanation for the correlation of diversity with recombination rates in humans. , 2003, American journal of human genetics.

[39]  Anushya Muruganujan,et al.  PANTHER: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification , 2003, Nucleic Acids Res..

[40]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[41]  Nick Goldman,et al.  Accuracy and Power of Statistical Methods for Detecting Adaptive Evolution in Protein Coding Sequences and for Identifying Positively Selected Sites , 2004, Genetics.

[42]  J. Sikela,et al.  Lineage-Specific Gene Duplication and Loss in Human and Great Ape Evolution , 2004, PLoS biology.

[43]  M. Emerman,et al.  Ancient Adaptive Evolution of the Primate Antiviral DNA-Editing Enzyme APOBEC3G , 2004, PLoS biology.

[44]  M. Hattori,et al.  DNA sequence and comparative analysis of chimpanzee chromosome 22 , 2004, Nature.

[45]  Christine M. Malcom,et al.  Accelerated Evolution of Nervous System Genes in the Origin of Homo sapiens , 2004, Cell.

[46]  H. A. Orr,et al.  A Pseudohitchhiking Model of X vs. Autosomal Diversity , 2004, Genetics.

[47]  Gabor T. Marth,et al.  The Allele Frequency Spectrum in Genome-Wide Human Variation Data Reveals Signals of Differential Demographic History in Three Large World Populations , 2004, Genetics.

[48]  Jonathan Pevsner,et al.  Progress in the use of microarray technology to study the neurobiology of disease , 2004, Nature Neuroscience.

[49]  Jianzhi Zhang,et al.  Frequent false detection of positive selection by the likelihood method with branch-site models. , 2004, Molecular biology and evolution.