An X-ray on web-available XML schemas

XML has conquered its place as the most used standard for representing Web data. An XML schema may be employed for similar purposes of those from database schemas. There are different languages to write an XML schema, such as DTD and XSD. In this paper, we provide a general view, an X-Ray, on Web-available XSD files by identifying which XSD constructs are more and less frequently used. Furthermore, we provide an evolution perspective, showing results from XSD files collected in 2005 and 2008. Hence, we can also draw some conclusions on what trends seem to exist in XSD usage. The results of such study provide relevant information for developers of XML applications, tools and algorithms in which the schema has a distinguished role.

[1]  Vishu Krishnamurthy,et al.  Performance Challenges in Object-Relational DBMSs , 1999, IEEE Data Eng. Bull..

[2]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[3]  Daniela Florescu,et al.  Storing and Querying XML Data using an RDMBS , 1999, IEEE Data Eng. Bull..

[4]  Nicolás Marín,et al.  Review of Data on the Web: from relational to semistructured data and XML by Serge Abiteboul, Peter Buneman, and Dan Suciu. Morgan Kaufmann 1999. , 2003, SGMD.

[5]  Iraklis Varlamis,et al.  Bridging XML-schema and relational databases: a system for generating and manipulating relational databases using valid XML documents , 2001, DocEng '01.

[6]  Athman Bouguettaya,et al.  Deploying and managing Web services: issues, solutions, and directions , 2008, The VLDB Journal.

[7]  Eric van der Vlist,et al.  XML Schema , 2002 .

[8]  Lipyeow Lim,et al.  Preserving XML queries during schema evolution , 2007, WWW '07.

[9]  Laura M. Haas,et al.  Information integration in the enterprise , 2008, CACM.

[10]  Denilson Barbosa,et al.  Studying the XML Web: Gathering Statistics from an XML Sample , 2005, World Wide Web.

[11]  Karl Aberer,et al.  Query optimization in XML structured-document databases , 2005, The VLDB Journal.

[12]  Dan Suciu,et al.  Data on the Web: From Relations to Semistructured Data and XML , 1999 .

[13]  Frank Neven,et al.  DTDs versus XML schema: a practical study , 2004, WebDB '04.

[14]  Dongwon Lee,et al.  Comparative analysis of six XML schema languages , 2000, SGMD.

[15]  Michael Benedikt,et al.  XPath satisfiability in the presence of DTDs , 2008, JACM.

[16]  David J. DeWitt,et al.  Relational Databases for Querying XML Documents: Limitations and Opportunities , 1999, VLDB.

[17]  Sihem Amer-Yahia,et al.  A comprehensive solution to the XML-to-relational mapping problem , 2004, WIDM '04.

[18]  Serge Abiteboul,et al.  Inferring structure in semistructured data , 1997, SGMD.

[19]  Byron Choi,et al.  What are real DTDs like? , 2002, WebDB.

[20]  C. M. Sperberg-McQueen,et al.  eXtensible Markup Language (XML) 1.0 (Second Edition) , 2000 .

[21]  Juliana Freire,et al.  From XML schema to relations: a cost-based approach to XML storage , 2002, Proceedings 18th International Conference on Data Engineering.