An Algorithm for Extracting Referential Integrity Relations Using Similarity during RDB-to-XML Translation

XML is rapidly becoming technologies for information exchange and representation. It causes many research issues such as semantic modeling methods, conversion for interoperability with other models, and so on. Especially, the most important issue in practical area is how to achieve the interoperability between XML model and relational database model. Until now, many methods have been proposed to achieve it. However, several problems still remain. Most of all, existing methods do not consider implicit referential integrity relations, so it causes loss of information. This paper proposes an algorithm for extracting referential integrity relations during RDB to XML translation. The key point of our method is how to find implicit referential integrity relations among columns which have different names to represent the same semantic. To resolve it, we define an enhanced extraction algorithm which based on a widely used ontology, WordNet. The proposed algorithm can reduce an extraction time among comparison columns in RDB tables and prevent loss of information.

[1]  Dongwon Lee,et al.  NeT & CoT: translating relational schemas to XML schemas using semantic constraints , 2002, CIKM '02.

[2]  Andrew W. Moore,et al.  Internet traffic classification using bayesian analysis techniques , 2005, SIGMETRICS '05.

[3]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[4]  Gerhard Goos,et al.  Computer Science Today: Recent Trends and Developments , 1995 .

[5]  Renata Teixeira,et al.  Traffic classification on the fly , 2006, CCRV.

[6]  Graeme Hirst,et al.  Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures , 2004 .

[7]  Jinhyung Kim,et al.  An Algorithm for Automatic Inference of Referential Integrities During Translation from Relational Database to XML Schema , 2005, CIS.

[8]  Wesley W. Chu,et al.  Effective Schema Conversions between XML and Relational Models , 2002 .

[9]  Wim Dehaene,et al.  A mixed abstraction level co-simulation case study using SystemC for system on chip verification , 2003, 2003 Design, Automation and Test in Europe Conference and Exhibition.

[10]  Anthony McGregor,et al.  Flow Clustering Using Machine Learning Techniques , 2004, PAM.

[11]  Sebastian Zander,et al.  A preliminary performance comparison of five machine learning algorithms for practical IP traffic flow classification , 2006, CCRV.

[12]  Dongwon Lee,et al.  Nesting-Based Relational-to-XML Schema Translation , 2001, International Workshop on the Web and Databases.