Data integration standards in model organisms: from genotype to phenotype in the laboratory mouse

Abstract The tremendous progress of the genome sequencing centers, combined with computational advances in algorithms for genome assembly and gene model prediction, provide the research community with valuable new resources in the form of complete, or nearly complete, genome sequences for a wide variety of organisms that serve as platforms to investigate biological systems. The challenge facing the bioinformatics community is how to integrate the rapidly emerging genomic data with experimental data, such as gene expression, protein interactions, cell processes and systems characteristics under select perturbations. Data integration is key to understanding at all levels because the process of integration brings together disparate types of data in formats that support effective data mining, pattern detection and hypothesis generation. Databases for model organisms are valuable sources of integrated data from the level of the genome to that of the phenotype. Databases for model organisms promote data integration through the development and implementation of nomenclature standards, controlled vocabularies and ontologies, that allow data different organisms to be compared and contrasted.

[1]  M. Justice,et al.  Mouse as the measure of man? , 2000, Trends in genetics : TIG.

[2]  G. Barsh,et al.  Biological insights through genomics: mouse to man. , 1996, The Journal of clinical investigation.

[3]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[4]  B. Berger,et al.  ARACHNE: a whole-genome shotgun assembler. , 2002, Genome research.

[5]  Hans Lehrach,et al.  Functional Annotation of Mouse Genome Sequences , 2001, Science.

[6]  J. Blake,et al.  Creating the Gene Ontology Resource : Design and Implementation The Gene Ontology Consortium 2 , 2001 .

[7]  Carole A. Goble,et al.  Ontology-based Knowledge Representation for Bioinformatics , 2000, Briefings Bioinform..

[8]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[9]  M. Meisler The role of the laboratory mouse in the human genome project. , 1996, American journal of human genetics.

[10]  Ray Paton,et al.  Toward Principles for the Representation of Hierarchical Knowledge in Formal Ontologies , 1999, Data Knowl. Eng..

[11]  Judith A. Blake,et al.  Mouse genome informatics in a new age of biological inquiry , 2000, Proceedings IEEE International Symposium on Bio-Informatics and Biomedical Engineering.

[12]  K. Paigen,et al.  A miracle enough: the power of mice , 1995, Nature Medicine.

[13]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[14]  Philip Lijnzaad,et al.  The Ensembl genome database project , 2002, Nucleic Acids Res..

[15]  D. Cox,et al.  An action plan for mouse genomics , 1999, Nature Genetics.