Complete sequence and gene map of a human major histocompatibility complex

Here we report the first complete sequence and gene map of a human major histocompatibility complex (MHC), a region on chromosome 6 which is essential to the immune system (reviewed in ref. 1). When it was discovered over 50 years ago the region was thought to specify histocompatibility genes, but their nature has been resolved only in the last two decades. Although many of the 224 identified gene loci (128 predicted to be expressed) are still of unknown function, we estimate that about 40% of the expressed genes have immune system function. Over 50% of the MHC has been sequenced twice, in different haplotypes, giving insight into the extraordinary polymorphism and evolution of this region. Several genes, particularly of the MHC class II and III regions, can be traced by sequence similarity and synteny to over 700 million years ago, dearly predating the emergence of the adaptive immune system some 400 million years ago. The sequence is expected to be invaluable for the identification of many common disease loci. In the past, the search for these loci has been hampered by the complexity of high gene density and linkage disequilibrium.

[1]  C. Auffray,et al.  The chicken B locus is a minimal essential major histocompatibility complex , 1999, Nature.

[2]  Francisco Antequera,et al.  Initiation of DNA replication at CpG islands in mammalian chromosomes , 1998, The EMBO journal.

[3]  N. Takahata,et al.  Molecular clock and recombination in primate Mhc genes , 1999, Immunological reviews.

[4]  Jerzy K. Kulski,et al.  Genomics of the major histocompatibility complex: haplotypes, duplication, retroviruses and disease , 1999, Immunological reviews.

[5]  P. Dijkwel,et al.  On the nature of replication origins in higher eukaryotes. , 1995, Current opinion in genetics & development.

[6]  J. Klein Natural history of the major histocompatibility complex , 1986 .

[7]  S Beck,et al.  Gene organisation, sequence variation and isochore structure at the centromeric boundary of the human MHC. , 1999, Journal of molecular biology.

[8]  T Gojobori,et al.  Precise switching of DNA replication timing in the GC content transition area in the human major histocompatibility complex , 1997, Molecular and cellular biology.

[9]  H. Lehrach,et al.  Linkage of TATA-binding protein and proteasome subunit C5 genes in mice and humans reveals synteny conserved between mammals and invertebrates. , 1997, Genomics.

[10]  T. Ikemura,et al.  Three genes in the human MHC class III region near the junction with the class II: gene for receptor of advanced glycosylation end products, PBX2 homeobox gene and a notch homolog, human counterpart of mouse mammary tumor gene int-3. , 1994, Genomics.

[11]  R. Wolff,et al.  A 1.1-Mb transcript map of the hereditary hemochromatosis locus. , 1997, Genome research.

[12]  W. Bodmer,et al.  Evolutionary Significance of the HL-A System , 1972, Nature.

[13]  M. McGinnis,et al.  HLA-DQA1 allele and suballele typing using noncoding sequence polymorphisms. Application to 4AOHW cell panel typing. , 1993, Human immunology.

[14]  S Beck,et al.  Large-scale sequence comparisons reveal unusually high levels of variation in the HLA-DQB1 locus in the class II region of the human MHC. , 1998, Journal of molecular biology.

[15]  T. Miyata,et al.  Comparison and evolution of human immunoglobulin VH segments located in the 3' 0.8-megabase region. Evidence for unidirectional transfer of segmental gene sequences. , 1994, The Journal of biological chemistry.

[16]  M. Kasahara The chromosomal duplication model of the major histocompatibility complex , 1999, Immunological reviews.

[17]  S. Weissman,et al.  Evolving views of the major histocompatibility complex. , 1997, Blood.

[18]  D. Geraghty,et al.  The complete genomic sequence of 424,015 bp at the centromeric end of the HLA class I region: gene content and polymorphism. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[19]  A Ando,et al.  A boundary of long-range G + C% mosaic domains in the human MHC locus: pseudoautosomal boundary-like sequence exists near the boundary. , 1995, Genomics.