Conceptual Modeling of Human Genome: Integration Challenges

While Information Systems (IS) principles have been successfully applied to the design, implementation and management of a diverse set of domains, the Bioinformatics domain in general and the Genomic one in particular, often lacks a rigorous IS background, based on elaborating a precise Conceptual Model where the relevant concepts of the domain were properly defined. On the contrary, current genomic data repositories focus on the solution space in the form of diverse, ad-hoc databases that use to be hard to manage, evolve and intercommunicate. Conceptual Modeling as a central strategy is then far from the current biological data source ontologies that are heterogeneous, imprecise and too often even inconsistent when compared among them. To solve this problem, a concrete Conceptual Schema for the Human Genome (CSHG) is introduced in its latest version on this chapter. With a holistic perspective, the CSHG focuses on the different genomic views that must be integrated and emphasizes the value of the approach in order to deal appropriately the challenge of correctly interpreting the human genome.

[1]  Christian S. Jensen,et al.  Capturing Temporal Constraints in Temporal ER Models , 2008, ER.

[2]  Gudmundur A. Thorisson,et al.  Genotype–phenotype databases: challenges and solutions for the post-genomic era , 2009, Nature Reviews Genetics.

[3]  Alex Bateman,et al.  The InterPro database, an integrated documentation resource for protein families, domains and functional sites , 2001, Nucleic Acids Res..

[4]  Han Min Wong,et al.  e-Fungi: a data resource for comparative analysis of fungal genomes , 2007, BMC Genomics.

[5]  Antoni Olivé,et al.  Conceptual modeling of information systems , 2007 .

[6]  J. Lupski,et al.  The complete genome of an individual by massively parallel DNA sequencing , 2008, Nature.

[7]  Francis S. Collins,et al.  The Language of Life: DNA and the Revolution in Personalized Medicine , 2009 .

[8]  Oscar Pastor,et al.  Model-Based Engineering Applied to the Interpretation of the Human Genome , 2008, The Evolution of Conceptual Modeling.

[9]  George W. Bell,et al.  Mapping of Meiotic Single-Stranded DNA Reveals Double-Strand-Break Hotspots near Centromeres and Telomeres , 2007, Current Biology.

[10]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[11]  K. Paigen,et al.  Mammalian recombination hot spots: properties, control and evolution , 2010, Nature Reviews Genetics.

[12]  D. Valle,et al.  Online Mendelian Inheritance In Man (OMIM) , 2000, Human mutation.

[13]  L. Stein Creating a bioinformatics nation , 2002, Nature.

[14]  Norman W. Paton,et al.  Model-driven user interfaces for bioinformatics data resources: regenerating the wheel as an alternative to reinventing it , 2006, BMC Bioinformatics.

[15]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[16]  Bernhard Thalheim,et al.  Handbook of Conceptual Modeling - Theory, Practice, and Research Challenges , 2011 .

[17]  M. Gerstein,et al.  What is a gene, post-ENCODE? History and updated definition. , 2007, Genome research.

[18]  Philip Lijnzaad,et al.  The Ensembl genome database project , 2002, Nucleic Acids Res..

[19]  Oscar Pastor,et al.  Model-driven architecture in practice - a software production environment based on conceptual modeling , 2007 .

[20]  G. Holmquist,et al.  Chromosome bands, their chromatin flavors, and their functional features. , 1992, American journal of human genetics.

[21]  Roland H. Kaschek,et al.  On the evolution of conceptual modeling , 2008, The Evolution of Conceptual Modeling.

[22]  Carole A. Goble,et al.  Conceptual modelling of genomic information , 2000, Bioinform..

[23]  Oscar Pastor,et al.  A Conceptual Modeling Approach To Improve Human Genome Understanding , 2011, Handbook of Conceptual Modeling.

[24]  Wei Wei,et al.  Modeling the Semantics of 3D Protein Structures , 2004, ER.

[25]  Lincoln Stein,et al.  Reactome: a database of reactions, pathways and biological processes , 2010, Nucleic Acids Res..

[26]  Hongjun Lu,et al.  Conceptual Modeling – ER 2004 , 2004, Lecture Notes in Computer Science.

[27]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[28]  D. Cooper,et al.  Human Gene Mutation Database , 1996, Human Genetics.

[29]  Norman W. Paton,et al.  Conceptual data modelling for bioinformatics , 2002, Briefings Bioinform..

[30]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[31]  Oscar Pastor,et al.  Conceptual Modeling Meets the Human Genome , 2008, ER.