Large-scale sequencing of two regions in human chromosome 7q22: analysis of 650 kb of genomic sequence around the EPO and CUTL1 loci reveals 17 genes.

We have sequenced and annotated two genomic regions located in the Giemsa negative band q22 of human chromosome 7. The first region defined by the erythropoietin (EPO) locus is 228 kb in length and contains 13 genes. Whereas 3 genes (GNB2, EPO, PCOLCE) were known previously on the mRNA level, we have been able to identify 10 novel genes using a newly developed automatic annotation tool RUMMAGE-DP, which comprises >26 different programs mainly for exon prediction, homology searches, and compositional and repeat analysis. For precise annotation we have also resequenced ESTs identified to the region and assembled them to build large cDNAs. In addition, we have investigated the differential splicing of genes. Using these tools we annotated 4 of the 10 genes as a zonadhesin, a transferrin homolog, a nucleoporin-like gene, and an actin gene. Two genes showed weak similarity to an insulin-like receptor and a neuronal protein with a leucine-rich amino-terminal domain. Four predicted genes (CDS1-CDS4) CDS that have been confirmed on the mRNA level showed no similarity to known proteins and a potential function could not be assigned. The second region in 7q22 defined by the CUTL1 (CCAAT displacement protein and its splice variant) locus is 416 kb in length and contains three known genes, including PMSL12, APS, CUTL1, and a novel gene (CDS5). The CUTL1 locus, consisting of two splice variants (CDP and CASP), occupies >300 kb. Based on the G, C profile an isochore switch can be defined between the CUTL1 gene and the APS and PMSL12 genes.

[1]  S. Scherer,et al.  PMS2-related genes flank the rearrangement breakpoints associated with Williams syndrome and other diseases on human chromosome 7. , 1997, Genomics.

[2]  A. Stagg,et al.  CASP, a novel, highly conserved alternative-splicing product of the CDP/cut/cux gene, lacks cut-repeat and homeo DNA-binding domains, and interacts with full-length CDP in vitro. , 1997, Gene.

[3]  A. Yoshimura,et al.  Cloning and characterization of APS, an adaptor molecule containing PH and SH2 domains that is tyrosine phosphorylated upon B-cell receptor stimulation , 1997, Oncogene.

[4]  U. Surti,et al.  Two discrete regions of deletion at 7q in uterine leiomyomas , 1997, Genes, chromosomes & cancer.

[5]  S. Scherer,et al.  Loss of heterozygosity and reduced expression of the CUTL1 gene in uterine leiomyomas , 1997, Oncogene.

[6]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[7]  G. Lienhard,et al.  The 60-kDa Phosphotyrosine Protein in Insulin-treated Adipocytes Is a New Member of the Insulin Receptor Substrate Family* , 1997, The Journal of Biological Chemistry.

[8]  D. Garbers,et al.  Chromosome localization of the mouse zonadhesin gene and the human zonadhesin gene (ZAN). , 1997, Genomics.

[9]  S. Scherer,et al.  Molecular cytogenetic delineation of deletions and translocations involving chromosome band 7q22 in myeloid leukemias. , 1997, Blood.

[10]  D. Richardson,et al.  The molecular mechanisms of the metabolism and transport of iron in normal and neoplastic cells. , 1997, Biochimica et biophysica acta.

[11]  Michael Ruogu Zhang,et al.  Identification of protein coding regions in the human genome by quadratic discriminant analysis. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[12]  R Staden,et al.  The staden sequence analysis package , 1996, Molecular biotechnology.

[13]  S. Agrawal,et al.  Enzymatic labeling of nucleic acids , 1996, Molecular biotechnology.

[14]  Jerzy Jurka,et al.  Censor - a Program for Identification and Elimination of Repetitive Elements From DNA Sequences , 1996, Comput. Chem..

[15]  S. Scherer,et al.  Fine mapping of the human and mouse genes for the type I procollagen COOH-terminal proteinase enhancer protein. , 1996, Genomics.

[16]  K. Kinzler,et al.  Genomic organization of the human PMS2 gene family. , 1995, Genomics.

[17]  D. Garbers,et al.  A Sperm Membrane Protein That Binds in a Species-specific Manner to the Egg Extracellular Matrix Is Homologous to von Willebrand Factor (*) , 1995, The Journal of Biological Chemistry.

[18]  B. Cullen,et al.  Identification of a novel cellular cofactor for the Rev/Rex class of retroviral regulatory proteins , 1995, Cell.

[19]  Michael R. Green,et al.  A human nucleoporin-like protein that specifically interacts with HIV Rev , 1995, Nature.

[20]  D. S. Prestridge Predicting Pol II promoter sequences using transcription factor binding sites. , 1995, Journal of molecular biology.

[21]  S. Cross,et al.  CpG islands and genes. , 1995, Current opinion in genetics & development.

[22]  J. Deisenhofer,et al.  A structural basis of the interactions between leucine-rich repeats and protein ligands , 1995, Nature.

[23]  H. Prydz,et al.  Evaluation of the exon predictions of the GRAIL software. , 1994, Genomics.

[24]  R. Eddy,et al.  Type I procollagen COOH-terminal proteinase enhancer protein: identification, primary structure, and chromosomal localization of the cognate human gene (PCOLCE). , 1994, The Journal of biological chemistry.

[25]  E. Mardis High-throughput detergent extraction of M13 subclones for fluorescent DNA sequencing. , 1994, Nucleic acids research.

[26]  D. Dufort,et al.  The human cut homeodomain protein represses transcription from the c-myc promoter , 1994, Molecular and cellular biology.

[27]  X. Huang,et al.  An algorithm for identifying regions of a DNA sequence that satisfy a content requirement , 1994, Comput. Appl. Biosci..

[28]  R. Durbin,et al.  2.2 Mb of contiguous nucleotide sequence from chromosome III of C. elegans , 1994, Nature.

[29]  F. Zimmermann,et al.  Sequence and function analysis of a 4·3 kb fragment of Saccharomyces cerevisiae chromosome II including three open reading frames , 1993, Yeast.

[30]  Amos Bairoch,et al.  The PROSITE dictionary of sites and patterns in proteins, its current status , 1993, Nucleic Acids Res..

[31]  S. Scherer,et al.  Refined localization and yeast artificial chromosome (YAC) contig--mapping of genes and DNA segments in the 7q21-q32 region. , 1993, Human molecular genetics.

[32]  S. Scherer,et al.  Regional localization of the CCAAT displacement protein gene (CUTL1) to 7q22 by analysis of somatic cell hybrids. , 1993, Genomics.

[33]  P. Borst Transferrin receptor, antigenic variation and the prospect of a trypanosome vaccine. , 1991, Trends in genetics : TIG.

[34]  A. Mould,et al.  Procollagen type I C-proteinase enhancer is a naturally occurring connective tissue glycoprotein. , 1990, Biochemical and biophysical research communications.

[35]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[36]  D. T. Jones,et al.  Chromosomal localization of genes encoding guanine nucleotide-binding protein subunits in mouse and human. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[37]  B. Birren,et al.  Distinct forms of the beta subunit of GTP-binding regulatory proteins identified by molecular cloning. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[38]  C. H. Lin,et al.  Cloning and expression of the human erythropoietin gene. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[39]  L. Kedes,et al.  Evolution of the functional human beta-actin gene and its multi-pseudogene family: conservation of noncoding regions and chromosomal dispersion of pseudogenes , 1985, Molecular and cellular biology.

[40]  H. Sussman,et al.  Similarities between the transferrin receptor proteins on human reticulocytes and human placentae. , 1981, The Journal of biological chemistry.

[41]  H. Birnboim,et al.  A rapid alkaline extraction procedure for screening recombinant plasmid DNA. , 1979, Nucleic acids research.

[42]  R. Radloff,et al.  A dye-buoyant-density method for the detection and isolation of closed circular duplex DNA: the closed circular DNA in HeLa cells. , 1967, Proceedings of the National Academy of Sciences of the United States of America.

[43]  A. de la Chapelle,et al.  Mutations predisposing to hereditary nonpolyposis colorectal cancer. , 1997, Advances in cancer research.

[44]  Peter Beighton,et al.  de la Chapelle, A. , 1997 .

[45]  X Huang,et al.  Fast comparison of a DNA sequence with a protein sequence database. , 1996, Microbial & comparative genomics.

[46]  Y Xu,et al.  Recognizing exons in genomic sequence using GRAIL II. , 1994, Genetic engineering.

[47]  M H Skolnick,et al.  A probabilistic model for detecting coding regions in DNA sequences. , 1994, IMA journal of mathematics applied in medicine and biology.

[48]  M. Craxton Cosmid sequencing. , 1993, Methods in molecular biology.

[49]  T. Dexter,et al.  Erythropoietin and myeloid colony stimulating factors. , 1992, Trends in biotechnology.

[50]  J. Huppert Somatic cell hybrids. , 1983, Folia biologica.