The strength of selection on ultraconserved elements in the human genome.

Ultraconserved elements are stretches of consecutive nucleotides that are perfectly conserved in multiple mammalian genomes. Although these sequences are identical in the reference human, mouse, and rat genomes, we identified numerous polymorphisms within these regions in the human population. To determine whether polymorphisms in ultraconserved elements affect fitness, we genotyped unrelated human DNA samples at loci within these sequences. For all single-nucleotide polymorphisms tested in ultraconserved regions, individuals homozygous for derived alleles (alleles that differ from the rodent reference genomes) were present, viable, and healthy. The distribution of allele frequencies in these samples argues against strong, ongoing selection as the force maintaining the conservation of these sequences. We then used two methods to determine the minimum level of selection required to generate these sequences. Despite the lack of fixed differences in these sequences between humans and rodents, the average level of selection on ultraconserved elements is less than that on essential genes. The strength of selection associated with ultraconserved elements suggests that mutations in these regions may have subtle phenotypic consequences that are not easily detected in the laboratory.

[1]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[2]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[3]  Jian Lu,et al.  Weak selection revealed by the whole-genome comparison of the X chromosome and autosomes of human and chimpanzee. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Ting Wang,et al.  Combining phylogenetic data with co-regulated genes to identify regulatory motifs , 2003, Bioinform..

[5]  Stephen T. Sherry,et al.  The Genetic Structure of Ancient Human Populations , 1993, Current Anthropology.

[6]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[7]  Yuseob Kim Effect of strong directional selection on weakly selected mutations at linked sites: implication for synonymous codon usage. , 2003, Molecular biology and evolution.

[8]  Elena Rivas,et al.  Noncoding RNA gene detection using comparative sequence analysis , 2001, BMC Bioinformatics.

[9]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[10]  Laurent Excoffier,et al.  Conserved noncoding sequences are selectively constrained and not mutation cold spots , 2006, Nature Genetics.

[11]  C. V. Jongeneel,et al.  Numerous potentially functional but non-genic conserved sequences on human chromosome 21 , 2002, Nature.

[12]  M. Nachman,et al.  Estimate of the mutation rate per nucleotide in humans. , 2000, Genetics.

[13]  H. Muller The American Journal of Human Genetics Vol . 2 No . 2 June 1950 Our Load of Mutations 1 , 2006 .

[14]  David Haussler,et al.  Into the heart of darkness: large-scale clustering of human non-coding DNA , 2004, ISMB/ECCB.

[15]  S. Eddy A Model of the Statistical Power of Comparative Genome Sequence Analysis , 2005, PLoS biology.

[16]  D. Hartl,et al.  Population genetics of polymorphism and divergence. , 1992, Genetics.

[17]  Shamil Sunyaev,et al.  Small fitness effect of mutations in highly conserved non-coding regions. , 2005, Human molecular genetics.

[18]  D. Roff The evolution of the G matrix: selection or drift? , 2000, Heredity.

[19]  Shamil Sunyaev,et al.  Evolutionary constraints in conserved nongenic sequences of mammals. , 2005, Genome research.

[20]  D. Haussler,et al.  Article Identification and Characterization of Multi-Species Conserved Sequences , 2022 .

[21]  Jeffrey Ross-Ibarra,et al.  Genetic Data Analysis II. Methods for Discrete Population Genentic Data , 2002 .

[22]  C. Bustamante,et al.  Population Genetics of Polymorphism and Divergence for Diploid Selection Models With Arbitrary Dominance , 2004, Genetics.

[23]  Ian Korf,et al.  Integrating genomic homology into gene structure prediction , 2001, ISMB.

[24]  G. Abecasis,et al.  A note on exact tests of Hardy-Weinberg equilibrium. , 2005, American journal of human genetics.

[25]  Charles R Cantor,et al.  The use of MassARRAY technology for high throughput genotyping. , 2002, Advances in biochemical engineering/biotechnology.

[26]  D Haussler,et al.  The share of human genomic DNA under selection estimated from human-mouse genomic alignments. , 2003, Cold Spring Harbor symposia on quantitative biology.

[27]  N. Takahata An attempt to estimate the effective size of the ancestral species common to two extant species from which homologous genes are sequenced. , 1986, Genetical research.

[28]  Chao Qian,et al.  Population , 1940, State Rankings 2020: A Statistical View of America.

[29]  D. Haussler,et al.  A distal enhancer and an ultraconserved exon are derived from a novel retroposon , 2006, Nature.

[30]  M. Kimura,et al.  On the probability of fixation of mutant genes in a population. , 1962, Genetics.

[31]  K. Holsinger The neutral theory of molecular evolution , 2004 .

[32]  M. Ruvolo,et al.  The geographic apportionment of mitochondrial genetic diversity in east African chimpanzees, Pan troglodytes schweinfurthii. , 1997, Molecular biology and evolution.

[33]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[34]  E. Thompson,et al.  Performing the exact test of Hardy-Weinberg proportion for multiple alleles. , 1992, Biometrics.