The complete human olfactory subgenome.

Olfactory receptors likely constitute the largest gene superfamily in the vertebrate genome. Here we present the nearly complete human olfactory subgenome elucidated by mining the genome draft with gene discovery algorithms. Over 900 olfactory receptor genes and pseudogenes (ORs) were identified, two-thirds of which were not annotated previously. The number of extrapolated ORs is in good agreement with previous theoretical predictions. The sequence of at least 63% of the ORs is disrupted by what appears to be a random process of pseudogene formation. ORs constitute 17 gene families, 4 of which contain more than 100 members each. "Fish-like" Class I ORs, previously considered a relic in higher tetrapods, constitute as much as 10% of the human repertoire, all in one large cluster on chromosome 11. Their lower pseudogene fraction suggests a functional significance. ORs are disposed on all human chromosomes except 20 and Y, and nearly 80% are found in clusters of 6-138 genes. A novel comparative cluster analysis was used to trace the evolutionary path that may have led to OR proliferation and diversification throughout the genome. The results of this analysis suggest the following genome expansion history: first, the generation of a "tetrapod-specific" Class II OR cluster on chromosome 11 by local duplication, then a single-step duplication of this cluster to chromosome 1, and finally an avalanche of duplication events out of chromosome 1 to most other chromosomes. The results of the data mining and characterization of ORs can be accessed at the Human Olfactory Receptor Data Exploratorium Web site (http://bioinfo.weizmann.ac.il/HORDE).

[1]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[2]  Doron Lancet,et al.  The human olfactory subgenome: from sequence to structure and evolution , 2001, Human Genetics.

[3]  D. Lancet,et al.  The genomic structure of human olfactory receptor genes. , 2000, Genomics.

[4]  Doron Lancet,et al.  The olfactory receptor gene superfamily: data mining, classification, and nomenclature , 2000, Mammalian Genome.

[5]  Doron Lancet,et al.  Dichotomy of single-nucleotide polymorphism haplotypes in olfactory receptor genes and pseudogenes , 2000, Nature Genetics.

[6]  S. Thein,et al.  Gene regulation and deregulation: a β globin perspective , 2000 .

[7]  Doron Lancet,et al.  GESTALT: a workbench for automatic integration and visualization of large-scale genomic sequence analyses , 2000, Bioinform..

[8]  G Glusman,et al.  Sequence, structure, and evolution of a complete human olfactory receptor gene cluster. , 2000, Genomics.

[9]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[10]  J. Postlethwait,et al.  Genome maps 10. Comparative genomics. Mammalian radiations. Wall chart. , 1999, Science.

[11]  D. Lancet,et al.  Primate evolution of an olfactory receptor cluster: diversification by gene conversion and recent emergence of pseudogenes. , 1999, Genomics.

[12]  E. Feingold,et al.  An olfactory receptor gene is located in the extended human beta-globin gene cluster and is expressed in erythroid cells. , 1999, Genomics.

[13]  G. Jékely,et al.  The Evolution of the Calpain Family as Reflected in Paralogous Chromosome Regions , 1999, Journal of Molecular Evolution.

[14]  M. Groudine,et al.  Conservation of sequence and structure flanking the mouse and human β-globin loci: The β-globin genes are embedded within an array of odorant receptor genes , 1999 .

[15]  L. Buck,et al.  Combinatorial Receptor Codes for Odors , 1999, Cell.

[16]  B. Trask,et al.  A genomic region encompassing a cluster of olfactory receptor genes and a myosin light chain kinase (MYLK) gene is duplicated on human chromosome regions 3q13-q21 and 3p13. , 1999, Genomics.

[17]  S. Forlani,et al.  Initiation, establishment and maintenance of Hox gene expression patterns in the mouse. , 1999, The International journal of developmental biology.

[18]  P. Mombaerts,et al.  Molecular biology of odorant receptors in vertebrates. , 1999, Annual review of neuroscience.

[19]  G van den Engh,et al.  Large multi-chromosomal duplications encompass many members of the olfactory receptor gene family in the human genome. , 1998, Human molecular genetics.

[20]  H. Breer,et al.  Olfactory receptors in aquatic and terrestrial vertebrates , 1998, Journal of Comparative Physiology A.

[21]  D. Lancet,et al.  Organization and evolution of olfactory receptor genes on human chromosome 11. , 1998, Genomics.

[22]  N. M. Brooke,et al.  A molecular timescale for vertebrate evolution , 1998, Nature.

[23]  B. Trask,et al.  Distribution of olfactory receptor genes in the human genome , 1998, Nature Genetics.

[24]  H. Breer,et al.  Identification of a novel G-protein coupled receptor expressed in distinct brain regions and a defined olfactory zone. , 1998, Receptors & channels.

[25]  B. Trask,et al.  Members of the olfactory receptor gene family are contained in large blocks of DNA duplicated polymorphically near the ends of human chromosomes. , 1998, Human molecular genetics.

[26]  W R Pearson,et al.  Comparison of DNA sequences with protein sequences. , 1997, Genomics.

[27]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[28]  P. Vanderhaeghen,et al.  Molecular cloning and chromosomal mapping of olfactory receptor genes expressed in the male germ line: evidence for their wide distribution in the human genome. , 1997, Biochemical and biophysical research communications.

[29]  Gregory D Schuler,et al.  Sequence mapping by electronic PCR , 1997, Genome research.

[30]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[31]  D. Lancet,et al.  Sequence analysis in the olfactory receptor gene cluster on human chromosome 17: recombinatorial events affecting receptor diversity. , 1996, Genomics.

[32]  P. Bernaola-Galván,et al.  Compositional segmentation and long-range fractal correlations in DNA sequences. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[33]  J. Thompson,et al.  Using CLUSTAL for multiple sequence alignments. , 1996, Methods in enzymology.

[34]  I. Connerton,et al.  Olfactory receptor-encoding genes and pseudogenes are expressed in humans. , 1996, Gene.

[35]  H. Breer,et al.  Two classes of olfactory receptors in xenopus laevis , 1995, Neuron.

[36]  D. Ledbetter,et al.  Olfactory receptor gene cluster on human chromosome 17: possible duplication of an ancestral receptor repertoire. , 1994, Human molecular genetics.

[37]  G M Shepherd,et al.  Emerging principles of molecular signal processing by mitral/tufted cells in the olfactory bulb. , 1994, Seminars in cell biology.

[38]  G. Bernardi,et al.  The isochore organization of the human genome and its evolutionary history--a review. , 1993, Gene.

[39]  E. Seidemann,et al.  Probability model for molecular recognition in biological receptor repertoires: significance to the olfactory system. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[40]  A. Chess,et al.  The family of genes encoding odorant receptors in the channel catfish , 1993, Cell.

[41]  A. Townsend-Nicholson,et al.  Novel G protein-coupled receptors: a gene family of putative human olfactory receptor sequences. , 1992, Brain research. Molecular brain research.

[42]  S. Schiffmann,et al.  Expression of members of the putative olfactory receptor gene family in mammalian germ cells , 1992, Nature.

[43]  R. Axel,et al.  A novel multigene family may encode odorant receptors: A molecular basis for odor recognition , 1991, Cell.

[44]  John S. Kauer,et al.  Contributions of topography and parallel processing to odor coding in the vertebrate olfactory pathway , 1991, Trends in Neurosciences.

[45]  Wen-Hsiung Li,et al.  Fundamentals of molecular evolution , 1990 .

[46]  M. J. Coon,et al.  The P450 superfamily: updated listing of all genes and recommended nomenclature for the chromosomal loci. , 1989, DNA.

[47]  C. Milstein,et al.  The Dynamic Nature of the Antibody Repertoire , 1988, Immunological reviews.

[48]  Jan L. A. van Rijckevorsek,et al.  Component and correspondence analysis: dimension reduction by functional approximation , 1988 .

[49]  D. Lancet Vertebrate olfactory reception. , 1986, Annual review of neuroscience.

[50]  Dayhoff Mo,et al.  The origin and evolution of protein superfamilies. , 1976 .