Distributed Problem Solving Environment Dedicated to DNA Sequence Annotation

Genomic sequence analysis is a task using techniques coming from different fields in order to extract biologically relevant objects (genes, regulatory signals...) from rough DNA sequences. Analysis methods, coming from domains like statistics, sequence alignment or pattern matching, have been developed and regrouped into program libraries. These libraries aim at helping biologists to manipulate data. However their use revealed itself being too difficult as it requires the user to have background knowledge to be handled efficiently. In order to tackle this problem, the ImaGene system has been proposed. ImaGene is a system built upon a generic task model allowing to model methods of DNA sequence analysis and to execute them thanks to shell scripts, binaries and specific libraries already developed. This system makes it possible to manipulate these methods and present a synthesis of the obtained results in a cartographic interface allowing the biologist to evaluate the biological pertinence of the results and to annotate DNA sequences. In order to profit from analysis methods and specialized libraries and to simplify their accesses, we have provided ImaGene with the possibility of using distributed methods and binaries. The solution chosen consists in transferring task code from server sites to client sites; but, unlike the solution adopted in Java, the data is computed at the server site if necessary. This distribution of the system has highlighted some problems like managing different versions of methods and dealing with tasks that have been recently decomposed into new sub-tasks.

[1]  Johannes van den Elst Modélisation de connaissances pour le pilotage de programmes de traitement d'images , 1996 .

[2]  Bob J. Wielinga,et al.  CommonKADS: a comprehensive methodology for KBS development , 1994, IEEE Expert.

[3]  B. Chandrasekaran,et al.  Generic Tasks in Knowledge-Based Reasoning: High-Level Building Blocks for Expert System Design , 1986, IEEE Expert.

[4]  Gilles Bisson,et al.  Cooperative Computer System For Genome Sequence Analysis , 1995, ISMB.

[5]  Isabelle Delouis Lisa : un langage reflexif pour la modelisation du controle dans les systemes a base de connaissances. application a la planification des reseaux electriques , 1993 .

[6]  Jutta Willamowski,et al.  Modelisation de taches pour la resolution de problemes en cooperation systeme-utilisateur. (Task modeling for problem solving in system-user-cooperation) , 1994 .

[7]  John R. Rice,et al.  Future Research Directions in Problem Solving Environments for Computational Science , 1991, Programming Environments for High-Level Scientific Problem Solving.

[8]  V. Clement,et al.  Supervision of Perception Tasks for Autonomous Systems: the Ocapi Approach , 1992, Proceedings of the Third Annual Conference of AI, Simulation, and Planning in High Autonomy Systems 'Integrating Perception, Planning and Action'..

[9]  A Danchin,et al.  The European Bacillus subtilis genome sequencing project: current status and accessibility of the data from a new World Wide Web site. , 1996, Microbiology.

[10]  James R. Rice,et al.  From Scientific Software Libraries to Problem Solving Environments John R. Rice , 1996 .

[11]  Yuval Shahar,et al.  Task Modeling with Reusable Problem-Solving Methods , 1995, Artif. Intell..

[12]  Todd R. Johnson,et al.  Task-structure analysis for knowledge modeling , 1992, CACM.

[13]  Véronique Clément Raisonnements cognitifs appliqués au pilotage d'algorithmes de traitement d'images , 1990 .

[14]  R. H. Baltz,et al.  A gene cloning system for 'Streptomyces toyocaensis'. , 1996, Microbiology.

[15]  Mathias Chaillot Une architecture de contrôle réactif pour la résolution coopérative de problèmes , 1993 .

[16]  Eugene Fink Statistical Selection Among Problem-Solving Methods. , 1997 .