An automated reasoning framework for translational research

In this paper we propose a novel approach to the design and implementation of knowledge-based decision support systems for translational research, specifically tailored to the analysis and interpretation of data from high-throughput experiments. Our approach is based on a general epistemological model of the scientific discovery process that provides a well-founded framework for integrating experimental data with preexisting knowledge and with automated inference tools. In order to demonstrate the usefulness and power of the proposed framework, we present its application to Genome-Wide Association Studies, and we use it to reproduce a portion of the initial analysis performed on the well-known WTCCC dataset. Finally, we describe a computational system we are developing, aimed at assisting translational research. The system, based on the proposed model, will be able to automatically plan and perform knowledge discovery steps, to keep track of the inferences performed, and to explain the obtained results.

[1]  M Stefanelli,et al.  NEOANEMIA: a knowledge-based system emulating diagnostic reasoning. , 1990, Computers and biomedical research, an international journal.

[2]  Mark M Iles,et al.  What Can Genome-Wide Association Studies Tell Us about the Genetics of Common Disease , 2008, PLoS genetics.

[3]  Tao Xu,et al.  Pegasys: software for executing and integrating analyses of biological sequences , 2004, BMC Bioinformatics.

[4]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[5]  Riccardo Bellazzi,et al.  Multi-Criteria Decision Making Approaches for Quality Control of Genome-Wide Association Studies , 2009, Summit on translational bioinformatics.

[6]  Riccardo Bellazzi,et al.  Therapy Planning by Combining Ai and Decision Theoretic Techniques , 1989, AIME.

[7]  E. Shortliffe Clinical decision-support systems , 1990 .

[8]  H. Simon,et al.  Models of Discovery : and other topics in the methods of science , 1977 .

[9]  Martijn J. Schuemie,et al.  Structuring and extracting knowledge for the support of hypothesis generation in molecular biology , 2009, BMC Bioinformatics.

[10]  Mark D. Wilkinson,et al.  BioMOBY: An Open Source Biological Web Services Proposal , 2002, Briefings Bioinform..

[11]  P. Argos,et al.  SRS: information retrieval system for molecular biology data banks. , 1996, Methods in enzymology.

[12]  L. Magnani Abduction, Reason, and Science. Process of Discovery and Explanation , 2001 .

[13]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[14]  M Stefanelli,et al.  Medical diagnostic reasoning: Epistemological modeling as a strategy for design of computer-based consultation programs , 1993, Theoretical medicine.

[15]  Martin Senger,et al.  BioMoby extensions to the Taverna workflow management and enactment software , 2006, BMC Bioinformatics.

[16]  Anna L. Gloyn,et al.  Type 2 Diabetes Susceptibility Gene TCF7L2 and Its Role in β-Cell Function , 2009, Diabetes.

[17]  Heiko Schoof,et al.  BioMOBY Successfully Integrates Distributed Heterogeneous Bioinformatics Web Services. The PlaNet Exemplar Case1 , 2005, Plant Physiology.

[18]  Liz Sonenberg,et al.  Keeping the patient asleep and alive: Towards a computational cognitive model of disturbance management in anaesthesia , 2007, Cognitive Systems Research.

[19]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[20]  Damian Smedley,et al.  BioMart – biological queries made easy , 2009, BMC Genomics.

[21]  Ken E. Whelan,et al.  The Automation of Science , 2009, Science.

[22]  T. Jin,et al.  The Wnt signaling pathway effector TCF7L2 and type 2 diabetes mellitus. , 2008, Molecular endocrinology.

[23]  Olivier Poch,et al.  Knowledge-based expert systems and a proof-of-concept case study for multiple sequence alignment construction and analysis , 2009, Briefings Bioinform..

[24]  Angelo Nuzzo,et al.  Phenotypic and genotypic data integration and exploration through a web-service architecture , 2009, BMC Bioinformatics.

[25]  C. Peirce,et al.  Philosophical Writings of Peirce , 1955 .

[26]  Ezio Bartocci,et al.  BioWMS: a web-based Workflow Management System for bioinformatics , 2007, BMC Bioinformatics.

[27]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[28]  T. Oinn,et al.  Soaplab - a unified Sesame door to analysis tools , 2003 .

[29]  Blaz Zupan,et al.  Orange: From Experimental Machine Learning to Interactive Data Mining , 2004, PKDD.

[30]  Paolo Romano,et al.  Automation of in-silico data analysis processes through workflow management systems , 2007, Briefings Bioinform..

[31]  Angelo Nuzzo,et al.  A Dynamic Query System for Supporting Phenotype Mining in Genetic Studies , 2007, MedInfo.

[32]  M. Ramoni,et al.  An epistemological framework for medical knowledge-based systems , 1992, IEEE Trans. Syst. Man Cybern..

[33]  G. Schuler,et al.  Entrez: molecular biology database and retrieval system. , 1996, Methods in enzymology.

[34]  Angelo Nuzzo,et al.  Genephony: a knowledge management tool for genome-wide research , 2009, BMC Bioinformatics.