Schema matching technique for heterogeneous web database

The Search engine, as one of the most popular approaches of information finding tools, has been developed for finding information from internet user. In the web search, the information is in unstructured and structured format. Structured format is in the form of semantic and schema heterogeneity. Several shortcomings are associated with existing Web database in relation to the schema. In the commercial world, the various schemas matching techniques are developed. Though they are helpful in searching information up to some extent, but still they are semi-automatic and require various steps to achieve its objective. The purpose of schema matching is to automatically recognize similar elements of the assorted schemas and generate mapping expression to formulate on a single schema. We can handle the problem by two things such as: a. We construct either name of schema element and structure of the schema or formal ontology and mapping process. b. Searching mechanisms are used to narrow down our search and display the list of matched documents. Formal ontology and Mapping Process are needed for schema matching. In this context an Instance Based search system is proposed for finding more accurate data. In order to implement the transformation code, different kind of language (HTML, XML, RDF) is already used. This paper addresses the heterogeneity problems between the schemas. The problem is tackled by developing mapping language, which is able to handle with schema heterogeneity problems. These mapping strategies constraints are expressed by the language such as the source to target and target dependencies. It displays the list of matched documents through which user can get an accurate and effective information from the raised query.

[1]  Cecil Eng Huang Chua,et al.  Instance-based attribute identification in database integration , 2003, The VLDB Journal.

[2]  Catriel Beeri,et al.  A Proof Procedure for Data Dependencies , 1984, JACM.

[3]  Ronald Fagin,et al.  Data exchange: semantics and query answering , 2005, Theor. Comput. Sci..

[4]  Marko Smiljanic,et al.  XML schema matching : balancing efficiency and effectiveness by means of clustering , 2006 .

[5]  Jeffrey F. Naughton,et al.  On schema matching with opaque column names and data values , 2003, SIGMOD '03.

[6]  Erhard Rahm,et al.  Generic Schema Matching with Cupid , 2001, VLDB.

[7]  Wenfei Fan,et al.  Putting context into schema matching , 2006, VLDB.

[8]  Clement T. Yu,et al.  WISE-Integrator: An Automatic Integrator of Web Search Interfaces for E-Commerce , 2003, VLDB.

[9]  Felix Naumann,et al.  Schema matching using duplicates , 2005, 21st International Conference on Data Engineering (ICDE'05).

[10]  Erhard Rahm,et al.  COMA - A System for Flexible Combination of Schema Matching Approaches , 2002, VLDB.

[11]  Paolo Papotti,et al.  Scalable data exchange with functional dependencies , 2010, Proc. VLDB Endow..

[12]  Chris Clifton,et al.  Semantic Integration in Heterogeneous Databases Using Neural Networks , 1994, VLDB.

[13]  Ronald Fagin,et al.  Towards a theory of schema-mapping optimization , 2008, PODS.

[14]  Wei-Ying Ma,et al.  Instance-based Schema Matching for Web Databases by Domain-specific Query Probing , 2004, VLDB.