Approche préventive de la qualité des données d'importation dans le contexte de la protéomique clinique

Dans le domaine biomedical, la proteomique est confrontee a des sources de donnees de plus en plus nombreuses et a des volumes de donnees tres importants du fait de la multiplication des technologies dites a haut debit. L'heterogeneite de la provenance des donnees implique de fait une heterogeneite dans la representation et le contenu de ces donnees. Les donnees peuvent aussi se reveler incorrectes ce qui engendre des erreurs sur les conclusions des experiences proteomiques. Notre approche a pour objectif de garantir la qualite initiale des donnees lors de leur importation dans un systeme d'information dedie a la proteomique. Elle est basee sur le couplage entre des modeles representant les sources et le systeme proteomique, et des ontologies utilisees comme mediatrices entre les modeles. Les differents controles que nous proposons de mettre en place garantissent la validite des domaines de valeurs, la semantique et la coherence des donnees lors de l'importation.

[1]  OntologiesGio WiederholdStanford UniversityNovember Interoperation, Mediation, and Ontologies , 1994 .

[2]  Sebastian Rudolph,et al.  ELP: Tractable Rules for OWL 2 , 2008, SEMWEB.

[3]  Vijayan Sugumaran,et al.  Ontologies for conceptual modeling: their creation, use, and management , 2002, Data Knowl. Eng..

[4]  Marc Linster Viewing Knowledge Engineering as a Symbiosis of Modeling to Make Sense and Modeling to Implement Systems , 1992, GWAI.

[5]  Robert L. Ashenhurst,et al.  Ontological aspects of information modeling , 1996, Minds and Machines.

[6]  Ronald G. Ross,et al.  Principles of the business rule approach: Ronald G. Ross, Addison-Wesley Information Technology Series, February 2003, 256pp., price £30.99, ISBN 0-201-78893-4 , 2004, Int. J. Inf. Manag..

[7]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[8]  Boris Motik,et al.  Query Answering for OWL-DL with Rules , 2004, SEMWEB.

[9]  Nicola Guarino,et al.  Formal Ontology and Information Systems , 1998 .

[10]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[11]  Samir AbdelRahman,et al.  A Multiple-Domain Ontology Builder , 2010, COLING.

[12]  Aris M. Ouksel,et al.  A classification of semantic conflicts in heterogeneous database systems , 1995, J. Organ. Comput..

[13]  Peter Buneman,et al.  Challenges in Integrating Biological Data Sources , 1995, J. Comput. Biol..

[14]  Bob J. Wielinga,et al.  Using explicit ontologies in KBS development , 1997, Int. J. Hum. Comput. Stud..

[15]  Jérôme Euzenat,et al.  Ten Challenges for Ontology Matching , 2008, OTM Conferences.

[16]  Ian Horrocks,et al.  OWL rules: A proposal and prototype implementation , 2005, J. Web Semant..

[17]  Thomas C. Redman,et al.  Data Quality: The Field Guide , 2001 .

[18]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[19]  Jungyun Seo,et al.  Classifying schematic and data heterogeneity in multidatabase systems , 1991, Computer.

[20]  John V. Carlis,et al.  Genomic data modeling , 2003, Inf. Syst..

[21]  Stuart E. Madnick,et al.  Representing and reasoning about semantic conflicts in heterogeneous information systems , 1997 .

[22]  Andrea Omicini,et al.  Coordinating e-health systems with TuCSoN semantic tuple centres , 2011, SIAP.

[23]  Boris Motik,et al.  Reconciling description logics and rules , 2010, JACM.

[24]  G. Belleannée Le système TNM: 3 lettres pour un langage riche mais parfois ambigu , 2006 .

[25]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[26]  S J Willson Measuring inconsistency in phylogenetic trees. , 1998, Journal of theoretical biology.

[27]  Omar Chiotti,et al.  A process for building a domain ontology: an experience in developing a government budgetary ontology , 2006 .

[28]  Stephen J. Mellor,et al.  Executable UML How to Build Class Models , 2001 .

[29]  Theodore Johnson,et al.  Exploratory Data Mining and Data Cleaning , 2003 .

[30]  Stuart E. Madnick,et al.  A Metadata Approach to Resolving Semantic Conflicts , 2011, VLDB.

[31]  Andrew D. Spear Ontology for the Twenty First Century: An Introduction with Recommendations , 2006 .

[32]  Ian Horrocks,et al.  A proposal for an owl rules language , 2004, WWW '04.