A DTD or XML schema in its current textual form commonly lacks clarity and readability, therefore erroneous, poor quality design and usage are inevitable. A canonical conceptual model for XML documents will provide an effective mean of designing XML documents. The DTD is an early standard for XML and used in legacy systems widely. This paper presents an approach for converting a DTD to a conceptual model. By using the conceptual model, users can restructure the existed XML documents, improve its quality, and integrate various XML documents. In our approach, the conceptual model is represented using standard UML class notation, annotated with a few additional constraints, and the core of the converting process is a set of transformation rules that consider not only the DTD structure but also its semantics in order to generate the corresponding elements, especially, the inheritance structure can be extracted from the DTD structure.
[1]
David J. DeWitt,et al.
Relational Databases for Querying XML Documents: Limitations and Opportunities
,
1999,
VLDB.
[2]
Alexandra Poulovassilis,et al.
A Semantic Approach to Integrating XML and Structured Data Sources
,
2001,
CAiSE.
[3]
Andrew Goodchild,et al.
UML and XML Schema
,
2002,
Australasian Database Conference.
[4]
Rainer Eckstein,et al.
XML Conceptual Modeling Using UML
,
2000,
ER.
[5]
Ivar Jacobson,et al.
The unified modeling language reference manual
,
2010
.
[6]
Arnaud Sahuguet.
Everything You Ever Wanted to Know About DTDs, But Were Afraid to Ask
,
2000,
WebDB.