DNA, diseases and databases: disastrously deficient.

Recent progress in disease genetics and genome-related medicine has been substantial, with vast amounts of data being generated. However, this progress has not been matched by adequate database projects that gather and organize these data to enable their useful exploitation. This research area is complex, entailing core databases, locus-specific databases, national mutation databases, genotype-phenotype databases and patient databases--and much work is required to develop and properly integrate these various resources. To promote this, we present a timely overview of the field, emphasize its over-riding importance and discuss the disastrously deficient progress made so far. Many factors contribute to this slow progress (e.g. technological hurdles, publication requirements, the short-sighted and popularist research system). A lack of targeted funding is arguably the most fundamental problem, but one that can be solved.

[1]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[2]  Hans-Dieter Pohlenz,et al.  PhenomicDB: a multi-species genotype/phenotype database for comparative phenomics , 2005, Bioinform..

[3]  J. Hall A clinician's plea , 2003, Nature Genetics.

[4]  Jeroen Aerssens,et al.  Data mining of public SNP databases for the selection of intragenic SNPs , 2002, Human mutation.

[5]  E. Lander The New Genomics: Global Views of Biology , 1996, Science.

[6]  Y. Kan,et al.  The same beta-globin gene mutation is present on nine different beta-thalassemia chromosomes in a Sardinian population. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[7]  R G H Cotton,et al.  The HUGO Mutation Database Initiative , 1998, The Pharmacogenomics Journal.

[8]  Sharon Marsh,et al.  SNP databases and pharmacogenetics: great start, but a long way to go , 2002, Human mutation.

[9]  Jian Su,et al.  Recognizing Names in Biomedical Texts: a Machine Learning Approach , 2004 .

[10]  D. Labie,et al.  Common haplotype dependency of high G gamma-globin gene expression and high Hb F levels in beta-thalassemia and sickle cell anemia patients. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[12]  Yan P. Yuan,et al.  HGVbase: a human sequence variation database emphasizing data quality and a broad spectrum of data sources , 2002, Nucleic Acids Res..

[13]  C R Scriver,et al.  PAHdb: A locus‐specific knowledgebase , 2000, Human mutation.

[14]  Ourania Horaitis,et al.  The challenge of documenting mutation across the genome: The human genome variation society approach , 2004, Human mutation.

[15]  G. Patrinos,et al.  Recording human globin gene variation. , 2004, Hemoglobin.

[16]  George P Patrinos,et al.  Hellenic National Mutation Database: a prototype database for mutations leading to inherited disorders in the Hellenic population , 2005, Human mutation.

[17]  R. Gerlai Phenomics: fiction or the future? , 2002, Trends in Neurosciences.

[18]  V. McKusick Mendelian inheritance in man , 1971 .

[19]  Pertti Aula,et al.  Database for the mutations of the Finnish disease heritage , 2002, Human mutation.

[20]  A F Brown,et al.  MuStaR™ and other software for locus‐specific mutation databases , 2000, Human mutation.

[21]  Webb Miller,et al.  Improvements in the HbVar database of human hemoglobin variants and thalassemia mutations for population and sequence variation studies , 2004, Nucleic Acids Res..

[22]  P. Stenson,et al.  Human Gene Mutation Database (HGMD®): 2003 update , 2003, Human mutation.

[23]  J. Long,et al.  Current limitations of SNP data from the public domain for studies of complex disorders: a test for ten candidate genes for obesity and osteoporosis , 2004, BMC Genetics.

[24]  A. Cuticchia,et al.  Arab genetic disease database (AGDDB): A population‐specific clinical and mutation database , 2002, Human mutation.

[25]  N E Morton,et al.  Genetic epidemiology of single-nucleotide polymorphisms. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[26]  S. Lewis,et al.  The generic genome browser: a building block for a model organism system database. , 2002, Genome research.

[27]  R. Altman,et al.  PharmGKB: the pharmacogenetics and pharmacogenomics knowledge base. , 2005, Methods in molecular biology.

[28]  Russ B. Altman,et al.  PharmGKB: the Pharmacogenetics Knowledge Base , 2002, Nucleic Acids Res..

[29]  Andrew P Feinberg,et al.  An integrated epigenetic and genetic approach to common human disease. , 2004, Trends in genetics : TIG.

[30]  G. Patrinos,et al.  Aγ‐haplotypes: A new group of genetic markers for thalassemic mutations inside the 5′ regulatory region of the human Aγ‐globin gene , 2001 .

[31]  Limsoon Wong,et al.  Accomplishments and challenges in literature data mining for biology , 2002, Bioinform..

[32]  Jocelyn Kaiser,et al.  Population Databases Boom, From Iceland to the U.S. , 2002, Science.

[33]  D. Fredman,et al.  HGVbase: a curated resource describing human DNA variation and phenotype relationships , 2004, Nucleic Acids Res..

[34]  C. Sabatti,et al.  The Human Phenome Project , 2003, Nature Genetics.

[35]  L. Stein Creating a bioinformatics nation , 2002, Nature.

[36]  E. Lander,et al.  On the allelic spectrum of human disease. , 2001, Trends in genetics : TIG.

[37]  Ourania Horaitis,et al.  Time for a unified system of mutation description and reporting: a review of locus-specific mutation databases. , 2002, Genome research.

[38]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..

[39]  J. Naylor,et al.  Mendelian inheritance in man: A catalog of human genes and genetic disorders , 1996 .

[40]  C. R. Scriver,et al.  After the genome—the phenome? , 2004, Journal of Inherited Metabolic Disease.

[41]  P. Stenson,et al.  Human Gene Mutation Database (HGMD , 2003 .

[42]  George P Patrinos,et al.  HbVar: A relational database of human hemoglobin variants and thalassemia mutations at the globin gene server , 2002, Human mutation.

[43]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2004, Nucleic Acids Res..

[44]  Jung-Hsien Chiang,et al.  GIS: a biomedical text-mining system for gene information discovery , 2004, Bioinform..

[45]  Russ B Altman,et al.  PharmGKB: the pharmacogenetics and pharmacogenomics knowledge base. , 2005, Methods in molecular biology.