Towards ontology-based semantic web from data-intensive web: A reverse engineering approach

The development of the World Wide Web is a great success story with respect to the number of users and the amount of information that is nowadays offered by the WWW. However, most of the information that is available has to be interpreted by humans; machine support is rather limited. The next generation of the web, the semantic web, seeks to make information more usable by machines by introducing a more rigorous structure based on ontologies. In this context we try to propose a novel and integrated approach for a semi-automated migration of data-intensive web sites into ontology-based semantic web and thus, make the web content machineunderstandable. Our approach is based on the idea that semantics can be extracted from the structures and the instances of HTML forms which are the most convenient interface to communicate with relational databases on the current Web. This semantics is exploited to help build ontology.

[1]  Sidi Mohamed Benslimane,et al.  Building domain-specific ontology from data-intensive web site: an HTML forms-based reverse engineering approach , 2005, SITIS.

[2]  Irina Astrova,et al.  Reverse Engineering of Relational Databases to Ontologies , 2004, ESWS.

[3]  Heikki Mannila,et al.  Design of Relational Databases , 1992 .

[4]  Raphael Volz,et al.  Migrating data-intensive web sites into the Semantic Web , 2002, SAC '02.

[5]  David W. Embley,et al.  Towards Ontology Generation from Tables , 2005, World Wide Web.

[6]  Michel C. A. Klein,et al.  Ontology Evolution: Not the Same as Schema Evolution , 2004, Knowledge and Information Systems.

[7]  Veda C. Storey,et al.  Reverse Engineering of Relational Databases: Extraction of an EER Model from a Relational Database , 1994, Data Knowl. Eng..

[8]  Mimoun Malki,et al.  Rétro-ingénierie des bases de données relationnelles : approche basée sur l'analyse de formulaires , 1999, INFORSID.

[9]  N. Mfourga,et al.  Extracting entity-relationship schemas from relational databases: a form-driven approach , 1997, Proceedings of the Fourth Working Conference on Reverse Engineering.

[10]  HongJiang Zhang,et al.  HTML page analysis based on visual cues , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[11]  Peter M. G. Apers,et al.  Object-Oriented Views of Relational Databases Incorporating Behaviour , 1995, DASFAA.

[12]  Steffen Staab,et al.  Unveiling the hidden bride: deep annotation for mapping and migrating legacy data to the Semantic Web , 2004, J. Web Semant..

[13]  David W. Embley,et al.  Towards Semantic Understanding -- An Approach Based on Information Extraction Ontologies , 2004, ADC.

[14]  Irina Astrova,et al.  An HTML-Form-Driven Approach to Reverse Engineering of Relational Databases to Ontologies , 2005, Databases and Applications.

[15]  Frederick H. Lochovsky,et al.  Data extraction and label assignment for web databases , 2003, WWW '03.

[16]  Vipul Kashyap,et al.  Design and Creation of Ontologies for Environmental Information Retrieval1 , 1999 .

[17]  Piero Fraternali,et al.  Tools and approaches for developing data-intensive Web applications: a survey , 1999, CSUR.

[18]  Jean-Marc Petit,et al.  Relational Database Reverse Engineering: A Method Based on Query Analysis , 1995, Int. J. Cooperative Inf. Syst..

[19]  Klaus R. Dittrich,et al.  On the Migration of Relational Schemas and Data to Object-OrientedDatabase Systems , 1997 .

[20]  Steffen Staab,et al.  From Manual to Semi-Automatic Semantic Annotation: About Ontology-Based Text Annotation Tools , 2000, SAIC@COLING.

[21]  Mimoun Malki,et al.  Extraction of Object-oriented Schemas from Existing Relational Databases: a Form-driven Approach , 2002, Informatica.