Genome duplications and other features in 12 Mb of DNA sequence from human chromosome 16p and 16q.

Several publicly funded large-scale sequencing efforts have been initiated with the goal of completing the first reference human genome sequence by the year 2005. Here we present the results of analysis of 11.8 Mb of genomic sequence from chromosome 16. The apparent gene density varies throughout the region, but the number of genes predicted (84) suggests that this is a gene-poor region. This result may also suggest that the total number of human genes is likely to be at the lower end of published estimates. One of the most interesting aspects of this region of the genome is the presence of highly homologous, recently duplicated tracts of sequence distributed throughout the p-arm. Such duplications have implications for mapping and gene analysis as well as the predisposition to recurrent chromosomal structural rearrangements associated with genetic disease.

[1]  E. Eichler,et al.  Masquerading repeats: paralogous pitfalls of the human genome. , 1998, Genome research.

[2]  G. Germino,et al.  Gene conversion is a likely cause of mutation in PKD1. , 1998, Human molecular genetics.

[3]  M. Mattéi,et al.  A large polymorphic repeat in the pericentromeric region of human chromosome 15q contains three partial gene duplications. , 1998, Human molecular genetics.

[4]  M. L. Le Beau,et al.  Inversion of chromosome 16 and uncommon rearrangements of the CBFB and MYH11 genes in therapy-related acute myeloid leukemia: rare events related to DNA-topoisomerase II inhibitors? , 1998, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[5]  G C Overton,et al.  Analysis of EST-driven gene annotation in human genomic sequence. , 1998, Genome research.

[6]  H. Jacob,et al.  EbEST: an automated tool using expressed sequence tags to delineate gene structure. , 1998, Genome research.

[7]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[8]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[9]  J. Iovanna,et al.  Cloning and Expression of the Rat p8 cDNA, a New Gene Activated in Pancreas during the Acute Phase of Pancreatitis, Pancreatic Development, and Regeneration, and Which Promotes Cellular Growth* , 1997, The Journal of Biological Chemistry.

[10]  M. Adams,et al.  A tool for analyzing and annotating genomic sequences. , 1997, Genomics.

[11]  R. Wilson,et al.  High throughput fingerprint analysis of large-insert clones. , 1997, Genome research.

[12]  J. Rubin,et al.  Fluorescence in situ hybridization analysis of keratinocyte growth factor gene amplification and dispersion in evolution of great apes and humans. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[13]  A. C. Chinault,et al.  Homologous recombination of a flanking repeat gene cluster is a mechanism for a common contiguous gene deletion syndrome , 1997, Nature Genetics.

[14]  K. Lindpaintner,et al.  Mapping of both autosomal recessive and dominant variants of pseudoxanthoma elasticum to chromosome 16p13.1. , 1997, Human molecular genetics.

[15]  N. W. Davis,et al.  The complete genome sequence of Escherichia coli K-12. , 1997, Science.

[16]  A. Rosenthal,et al.  Genomic Organization of Two Novel Genes on Human Xq28: Compact Head to Head Arrangement ofIDHγ andTRAPδ Is Conserved in Rat and Mouse , 1997 .

[17]  L. Sandkuijl,et al.  A locus for autosomal recessive pseudoxanthoma elasticum, with penetrance of vascular symptoms in carriers, maps to chromosome 16p13.1. , 1997, Genome research.

[18]  E. Eichler,et al.  Interchromosomal duplications of the adrenoleukodystrophy locus: a phenomenon of pericentromeric plasticity. , 1997, Human molecular genetics.

[19]  Ying Xu,et al.  Inferring Gene Structures in Genomic Sequences Using Pattern Recognition and Expressed Sequence Tags , 1997, ISMB.

[20]  Thomas L. Madden,et al.  PowerBLAST: a new network BLAST application for interactive or automated sequence analysis and annotation. , 1997, Genome research.

[21]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[22]  S Minoshima,et al.  One-megabase sequence analysis of the human immunoglobulin lambda gene locus. , 1997, Genome research.

[23]  R. Moyzis,et al.  Genomic structure and complete nucleotide sequence of the Batten disease gene, CLN3. , 1997, Genomics.

[24]  D. Paslier,et al.  Human Chromosomal Fragile Site FRA16B Is an Amplified AT-Rich Minisatellite Repeat , 1997, Cell.

[25]  T. Kinzy,et al.  Conservation and Diversity of Eukaryotic Translation Initiation Factor eIF3* , 1997, The Journal of Biological Chemistry.

[26]  B. Dutrillaux,et al.  Emergence and scattering of multiple neurofibromatosis (NF1)-related sequences during hominoid evolution suggest a process of pericentromeric interchromosomal transposition. , 1997, Human molecular genetics.

[27]  Edward C. Uberbacher,et al.  Automated Gene Identification in Large-Scale Genomic Sequences , 1997, J. Comput. Biol..

[28]  A. Smit,et al.  The origin of interspersed repeats in the human genome. , 1996, Current opinion in genetics & development.

[29]  R. Davey,et al.  The anthracycline resistance-associated (ara) gene, a novel gene associated with multidrug resistance in a human leukaemia cell line. , 1996, British Journal of Cancer.

[30]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[31]  E. Mardis,et al.  Generation and analysis of 280,000 human expressed sequence tags. , 1996, Genome research.

[32]  K. O. Elliston,et al.  Toward the development of a gene index to the human genome: an assessment of the nature of high-throughput EST sequence data. , 1996, Genome research.

[33]  E. Eichler,et al.  Duplication of a gene-rich cluster between 16p11.1 and Xq28: a novel pericentromeric-directed mechanism for paralogous genome evolution. , 1996, Human molecular genetics.

[34]  C. Wijmenga,et al.  Identification of the chimeric protein product of the CBFB‐MYH11 fusion gene in inv(16) leukemia cells , 1996 .

[35]  H. Gainer,et al.  Identification of a novel protein containing two C2 domains selectively expressed in the rat brain and kidney , 1996, FEBS letters.

[36]  N. Nomura,et al.  Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain. , 1996, DNA research : an international journal for rapid publication of reports on genes and genomes.

[37]  R. Fleischmann,et al.  The Minimal Gene Complement of Mycoplasma genitalium , 1995, Science.

[38]  N. Doggett,et al.  An integrated physical map of human chromosome 16. , 1995, Nature.

[39]  R. Fleischmann,et al.  Initial assessment of human gene diversity and expression patterns based upon 83 million nucleotides of cDNA sequence. , 1995, Nature.

[40]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.

[41]  J. Hughes,et al.  The polycystic kidney disease 1 (PKD1) gene encodes a novel protein with multiple cell recognition domains , 1995, Nature Genetics.

[42]  Owen White,et al.  TIGR Assembler: A New Tool for Assembling Large Shotgun Sequencing Projects , 1995 .

[43]  Morris Schambelan,et al.  Liddle's syndrome: heritable human hypertension caused by mutations in the β subunit of the epithelial sodium channel , 1994, Cell.

[44]  Yin Xu,et al.  An Improved System for Exon Recognition and Gene Modeling in Human DNA Sequence , 1994, ISMB.

[45]  F. Matsuda,et al.  Recent translocation of variable and diversity segments of the human immunoglobulin heavy chain from chromosome 14 to chromosomes 15 and 16. , 1994, Genomics.

[46]  M. Adams,et al.  How many genes in the human genome? , 1994, Nature Genetics.

[47]  D. Le Paslier,et al.  Implications of FRA16A structure for the mechanism of chromosomal fragile site genesis. , 1994, Science.

[48]  Siep Thomas,et al.  THE POLYCYSTIC KIDNEY-DISEASE-1 GENE ENCODES A 14-KB TRANSCRIPT AND LIES WITHIN A DUPLICATED REGION ON CHROMOSOME-16 , 1994 .

[49]  N. Carter,et al.  Human immunoglobulin VH and D segments on chromosomes 15q11.2 and 16p11.2. , 1994, Human molecular genetics.

[50]  N. Samani,et al.  Chromosomal assignment of the human SA gene to 16p13.11 and demonstration of its expression in the kidney. , 1994, Biochemical and biophysical research communications.

[51]  D. Ward,et al.  In situ hybridization mapping of human chromosome 16: evidence for a high frequency of repetitive DNA sequences. , 1994, Cytogenetics and cell genetics.

[52]  Stylianos E. Antonarakis,et al.  Inversions disrupting the factor VIII gene are a common cause of severe haemophilia A , 1993, Nature Genetics.

[53]  D. Guo,et al.  Chromosomal assignment of human and rat hypertension candidate genes: type 1 angiotensin II receptor genes and the SA gene , 1993, Journal of hypertension.

[54]  J. Attwood,et al.  Fine genetic mapping of the Batten disease locus (CLN3) by haplotype analysis and demonstration of allelic association with chromosome 16p microsatellite loci. , 1993, Genomics.

[55]  R. Stallings,et al.  Refined physical mapping of chromosome 16-specific low-abundance repetitive DNA sequences. , 1993, Cytogenetics and cell genetics.

[56]  R Berger,et al.  A new gene, BCM, on chromosome 16 is fused to the interleukin 2 gene by a t(4;16)(q26;p13) translocation in a malignant T cell lymphoma. , 1992, The EMBO journal.

[57]  D. Ward,et al.  Chromosome 16-specific repetitive DNA sequences that map to chromosomal regions known to undergo breakage/rearrangement in leukemia cells. , 1992, Genomics.

[58]  Y. Murakami,et al.  Mapping of the humanGSPT1 gene, a human homolog of the yeastGST1 gene, to chromosomal band 16p13.1 , 1992, Somatic cell and molecular genetics.

[59]  A. Hagemeijer,et al.  Extensive cross-homology between the long and the short arm of chromosome 16 may explain leukemic inversions and translocations. , 1992, Blood.

[60]  J. Lalouel,et al.  A chimaeric llβ-hydroxylase/aldosterone synthase gene causes glucocorticoid-remediable aldosteronism and human hypertension , 1992, Nature.

[61]  L. Liotta,et al.  Cloning and characterization of a novel human cDNA that has DNA similarity to the conserved region of the collagenase gene family. , 1992, Genomics.

[62]  A. Kerlavage,et al.  Complementary DNA sequencing: expressed sequence tags and human genome project , 1991, Science.

[63]  H. Zachau,et al.  Structural features of transposed human VK genes and implications for the mechanism of their transpositions. , 1990, Nucleic acids research.

[64]  J. Nathans,et al.  Molecular genetics of inherited variation in human color vision. , 1986, Science.

[65]  N. Maeda,et al.  Duplication within the haptoglobin Hp2 gene , 1984, Nature.

[66]  S. Povey,et al.  Immunoglobulin heavy chain genes in humans are located on chromosome 14 , 1981 .