Design and development of a genomic information system to manage breast cancer data

The heterogeneity and dispersion of genomic data that currently exists between the multiple genomic databases is a big problem for the geneticists when they look for information for their genomic diagnosis. This problem is especially important when it is referred to genetic data about breast cancer due to the large amount of data available about this disease caused by the high incidence in society. The work in this Doctoral Thesis expects to solve this problem by analyzing all available information about breast cancer in the databases and integrating it into an information system created following the Conceptual Modeling rules. With this information system the data about breast cancer will be stored in one efficient and well-structured database, making it easier for geneticists searching for information about this disease.

[1]  Toshio Kojima,et al.  The phenotype and genotype experiment object model (PaGE‐OM): a robust data structure for information related to DNA variation , 2009, Human mutation.

[2]  P. Stenson,et al.  The Human Gene Mutation Database: 2008 update , 2009, Genome Medicine.

[3]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..

[4]  Anne E. Trefethen,et al.  Toward interoperable bioscience data , 2012, Nature Genetics.

[5]  Oscar Pastor,et al.  Conceptual Modeling of Human Genome Mutations - A Dichotomy Between what we Have and What we Should Have , 2010, BIOINFORMATICS.

[6]  Mingming Jia,et al.  COSMIC (the Catalogue of Somatic Mutations in Cancer): a resource to investigate acquired mutations in human cancer , 2009, Nucleic Acids Res..

[7]  Mingming Jia,et al.  COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer , 2010, Nucleic Acids Res..

[8]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[9]  Ian N M Day,et al.  dbSNP in the detail and copy number complexities , 2010, Human mutation.

[10]  Csilla Szabo,et al.  The Breast Cancer Information Core: Database design, structure, and scope , 2000, Human mutation.

[11]  V. McKusick Mendelian Inheritance in Man and Its Online Version, OMIM , 2007, The American Journal of Human Genetics.

[12]  P. Stenson,et al.  Human Gene Mutation Database—A biomedical information and research resource , 2000, Human mutation.

[13]  Ewa Deelman,et al.  New tools and methods for direct programmatic access to the dbSNP relational database , 2010, Nucleic Acids Res..

[14]  C. Sallée,et al.  Locus‐specific databases: from ethical principles to practice , 2005, Human mutation.

[15]  Carole A. Goble,et al.  Conceptual modelling of genomic information , 2000, Bioinform..

[16]  Oscar Pastor,et al.  Enforcing Conceptual Modeling to improve the understanding of human genome , 2010, 2010 Fourth International Conference on Research Challenges in Information Science (RCIS).

[17]  Oliver Hofmann,et al.  ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level , 2010, Bioinform..

[18]  I. Fokkema,et al.  LOVD: Easy creation of a locus‐specific sequence variation database using an “LSDB‐in‐a‐box” approach , 2005, Human mutation.

[19]  Chris Mungall,et al.  Genome-Wide Analysis of Human Disease Alleles Reveals That Their Locations Are Correlated in Paralogous Proteins , 2008, PLoS Comput. Biol..

[20]  Oscar Pastor,et al.  Conceptual Modeling Meets the Human Genome , 2008, ER.

[21]  J. Vadgama,et al.  BRCA1 and BRCA2 gene mutation analysis: visit to the Breast Cancer Information Core (BIC). , 1999, Oncology research.

[22]  M Krawczak,et al.  The human gene mutation database , 1998, Nucleic Acids Res..

[23]  Mingming Jia,et al.  Data mining using the Catalogue of Somatic Mutations in Cancer BioMart , 2011, Database J. Biol. Databases Curation.