A Generic Solution for Automated Collecting and Integration of Biological Data from Web Sources

We present here a work dealing with automated collecting and integration of data along a user-defined scenario. Aspects such as query construction, query submission, parsing of returned document, filtering of desired data and storing them in a structured document have been considered as well as the chaining between the various steps of the scenario. Automation of the process allows to refresh the data in a time-saving manner in order to take into account the frequent changes in source contents. A configuration module distinct from the execution module allows to modify the scenario steps according to user preferences and/or source changes.

[1]  Marie-Dominique Devignes,et al.  Collecte et intégration de données biologiques hétérogènes sur le web , 2002, Ingénierie des Systèmes d Inf..

[2]  Louiqa Raschid,et al.  Optimized seamless integration of biomolecular data , 2001, Proceedings 2nd Annual IEEE International Symposium on Bioinformatics and Bioengineering (BIBE 2001).

[3]  Val Tannen,et al.  K2/Kleisli and GUS: Experiments in integrated access to genomic data sources , 2001, IBM Syst. J..