Re-engineering Web Application towards Linked Data: a Model-Based Approach

Nowadays, Wide World Web is a wide network of information resources found in documents, such as HTML pages, PHP, etc. Most are intended for consumer uses, whether by human or machines (i.e., programs). So the Web grows with the emergence of new technologies, such as Web services, mobile applications and Web applications. These technologies manipulate data in multiple formats in a hidden or unusable way to users. In parallel, there is also another growth of desire for directly accessing data on the Web. However, the current Web cannot meet this need. So, we have for instance to switch to the semantic Web by reengineering our classic Web applications into RDF Linked data. This can be justified by the fact that (in the semantic Web) data are represented in RDF format which makes them directly available to users. In this paper, we propose a model-based approach to transform Web applications into semantic ones. This is done by extracting data from the Web applications, and transforming them into RDF format. These data are extracted from HTML files, in particularly from tables, lists and links. This engineering process is capitalized thanks to the use of model driven engineering settings. It takes as input a set of elements: (1) the input file which contains the data that we want to transform; these data are extracted from the Web application, (2) two meta-models, namely, the source meta-model (for HTML) and a target meta-model (for RDF), and (3) the transformation rules. Executing this process generates an output file in RDF format, which is one of the most important technologies of the Semantic Web, also known as Web of Data.

[1]  Michel Klein Tutorial: The Semantic Web , 2001 .

[2]  Jeffrey T. Pollock Semantic Web For Dummies , 2009 .

[3]  Mario Piattini,et al.  PRECISO: A Reverse Engineering Tool to Discover Web Services from Relational Databases , 2009, 2009 16th Working Conference on Reverse Engineering.

[4]  Timothy W. Finin,et al.  Semantic Message Passing for Generating Linked Data from Tables , 1999, SEMWEB.

[5]  Hafida Belbachir,et al.  A New OWL2 Based Approach for Relational Database Description , 2014 .

[6]  Esperanza Marcos,et al.  Migrating Traditional Web Applications to CMS-based Web Applications , 2015, CLEI Selected Papers.

[7]  Alessandra Mileo,et al.  Using linked data to mine RDF from wikipedia's tables , 2014, WSDM.

[8]  Mimoun Malki,et al.  Ontology-Alignment Techniques: Survey and Analysis , 2015 .

[9]  Daisy Zhe Wang,et al.  WebTables: exploring the power of tables on the web , 2008, Proc. VLDB Endow..

[10]  David W. Embley,et al.  Factoring web tables , 2011, IEA/AIE'11.

[11]  Frank Budinsky,et al.  Eclipse modeling framework : a developer's guide , 2004 .

[12]  Raphaël Troncy,et al.  Enabling Linked Data Publication with the Datalift Platform , 2012, Semantic Cities @ AAAI.

[13]  D. Powers Evaluation: From Precision, Recall and F-Factor to ROC, Informedness, Markedness & Correlation , 2008 .

[14]  Timothy W. Finin,et al.  T2LD: Interpreting and Representing Tables as Linked Data , 2010, SEMWEB.

[15]  Nikolas Mitrou,et al.  Exposing scholarly information as Linked Open Data: RDFizing DSpace contents , 2014, Electron. Libr..

[16]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[17]  Jean Bézivin,et al.  Model Driven Engineering: An Emerging Technical Space , 2005, GTTSE.

[18]  Mimoun Malki,et al.  An Approach for Discovering and Maintaining Links in RDF Linked Data , 2017 .

[19]  David W. Embley,et al.  Data Extraction from Web Tables: The Devil is in the Details , 2011, 2011 International Conference on Document Analysis and Recognition.