Heart-specific genes revealed by expressed sequence tag (EST) sampling

BackgroundCardiovascular diseases are the primary cause of death worldwide; the identification of genes specifically expressed in the heart is thus of major biomedical interest. We carried out a comprehensive analysis of gene-expression profiles using expressed sequence tags (ESTs) to identify genes overexpressed in the human adult heart. The initial set of genes expressed in the heart was constructed by clustering and assembling ESTs from heart cDNA libraries. Expression profiles were then generated for each gene by counting their cognate ESTs in all libraries. Differential expression was assessed by applying a previously published statistical procedure to these profiles.ResultsWe identified 35 cardiac-specific genes overexpressed in the heart, some of which displayed significant coexpression. Some genes had no previously recognized cardiac function. Of the 35 genes, 32 were mapped back onto the human genome sequence. According to Online Mendelian Inheritance in Man (OMIM), five genes were previously known as heart-disease genes and one gene was located in the locus of a bleeding disorder. Analysis of the promoter regions of this collection of genes provides the first list of putative regulatory elements associated with differential cardiac expression.ConclusionThis study shows that ESTs are still a powerful tool to identify differentially expressed genes. We present a list of genes specifically expressed in the human heart, one of which is a candidate for a bleeding disorder. In addition, we provide the first set of putative regulatory elements, the combination of which appears correlated with heart-specific gene expression.

[1]  M. K. Atilla,et al.  A Further Study of Seminal Plasma: Lactate Dehydrogenase and Lactate Dehydrogenase-X Activities and Diluted Semen Absorbance , 1997, European journal of clinical chemistry and clinical biochemistry : journal of the Forum of European Clinical Chemistry Societies.

[2]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[3]  J. Collado-Vides,et al.  Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. , 1998, Journal of molecular biology.

[4]  Xin Chen,et al.  TRANSFAC: an integrated system for gene expression regulation , 2000, Nucleic Acids Res..

[5]  D. Stekel,et al.  The comparison of gene expression from multiple cDNA libraries. , 2000, Genome research.

[6]  S. Gygi,et al.  Correlation between Protein and mRNA Abundance in Yeast , 1999, Molecular and Cellular Biology.

[7]  I. Jonassen,et al.  Predicting gene regulatory elements in silico on a genomic scale. , 1998, Genome research.

[8]  J. Seilhamer,et al.  A comparison of selected mRNA and protein abundances in human liver , 1997, Electrophoresis.

[9]  A. Orth,et al.  Large-scale analysis of the human and mouse transcriptomes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Tatsuhiko Tsunoda,et al.  Estimating transcription factor bindability on DNA , 1999, Bioinform..

[11]  J. Claverie,et al.  The significance of digital gene expression profiles. , 1997, Genome research.

[12]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[13]  Jacques van Helden,et al.  Regulatory Sequence Analysis Tools , 2003, Nucleic Acids Res..

[14]  C. Lee,et al.  Identification of differentially expressed genes in cardiac hypertrophy by analysis of expressed sequence tags. , 2000, Genomics.

[15]  X. Huang,et al.  A contig assembly program based on sensitive detection of fragment overlaps. , 1992, Genomics.

[16]  S. Bortoluzzi,et al.  The human adult skeletal muscle transcriptional profile reconstructed by a novel computational approach. , 2000, Genome research.

[17]  C. Pilarsky,et al.  Exhaustive mining of EST libraries for genes differentially expressed in normal and tumour tissues. , 1999, Nucleic acids research.

[18]  S. Tsui,et al.  A catalogue of genes in the cardiovascular system as identified by expressed sequence tags. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[19]  M. Boguski,et al.  dbEST — database for “expressed sequence tags” , 1993, Nature Genetics.

[20]  H. Blum,et al.  [DNA chip technology]. , 1999, Deutsche medizinische Wochenschrift.

[21]  C. Watson,et al.  DNA chip technolgy , 1999 .

[22]  S. Bortoluzzi,et al.  A computational reconstruction of the adult human heart transcriptional profile. , 2000, Journal of molecular and cellular cardiology.

[23]  J. Felsenstein,et al.  A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. , 1994, Molecular biology and evolution.

[24]  Philipp Bucher,et al.  The Eukaryotic Promoter Database, EPD: new entry types and links to gene expression data , 2002, Nucleic Acids Res..

[25]  Robert R. Sokal,et al.  A statistical method for evaluating systematic relationships , 1958 .

[26]  Michael Gribskov,et al.  Methods and Statistics for Combining Motif Match Scores , 1998, J. Comput. Biol..

[27]  J. Jurka Repbase update: a database and an electronic journal of repetitive elements. , 2000, Trends in genetics : TIG.

[28]  S. Bortoluzzi,et al.  Detecting differentially expressed genes in multiple tag sampling experiments: comparative evaluation of statistical tests. , 2001, Human molecular genetics.

[29]  T. Werner,et al.  Regulatory context is a crucial part of gene function. , 2002, Trends in genetics : TIG.

[30]  J. Claverie Computational methods for the identification of differential and coordinated gene expression. , 1999, Human molecular genetics.

[31]  Kousaku Okubo,et al.  Large scale cDNA sequencing for analysis of quantitative and qualitative aspects of gene expression , 1992, Nature Genetics.