Soft Cardinality Constraints on XML Data - How Exceptions Prove the Business Rule

We introduce soft cardinality constraints which need to be satisfied on average only, and thus permit violations in a controlled manner. Starting from a highly expressive but intractable class, we establish a fragment that is maximal with respect to both expressivity and efficiency. More precisely, we characterise the associated implication problem axiomatically and develop a low-degree polynomial time decision algorithm. Any increase in expressivity of our fragment results in coNP-hardness of the implication problem. Finally, we extensively test the performance of our algorithm. The performance evaluation provides first-hand evidence that reasoning about expressive notions of soft cardinality constraints on XML data is practically efficient and scales well. Our results unleash soft cardinality constraints on real-world XML practice, where a little more semantics makes applications a lot more effective in contexts where exceptions to common rules may occur.

[1]  Wenfei Fan,et al.  Keys for XML , 2002, Comput. Networks.

[2]  Dan Suciu,et al.  Database and XML Technologies , 2004, Lecture Notes in Computer Science.

[3]  Sven Hartmann,et al.  Efficient reasoning about a robust XML key fragment , 2009, TODS.

[4]  Wenfei Fan,et al.  On the Complexity of Verifying Consistency of XML Specifications , 2008, SIAM J. Comput..

[5]  Sven Hartmann,et al.  Numerical constraints on XML data , 2010, Inf. Comput..

[6]  Sven Hartmann,et al.  Soft Constraints and Heuristic Constraint Correction in Entity-Relationship Modelling , 2001, Semantics in Databases.

[7]  Tok Wang Ling,et al.  Conceptual Modeling - ER 2011, 30th International Conference, ER 2011, Brussels, Belgium, October 31 - November 3, 2011. Proceedings , 2011, ER.

[8]  Pradeep K. Khosla,et al.  Mechanisms for detecting and handling timing errors , 1997, Commun. ACM.

[9]  Cong Yu,et al.  XML schema refinement through redundancy detection and normalization , 2008, The VLDB Journal.

[10]  Bernhard Thalheim,et al.  Semantics in Databases , 2001, Semantics in Databases.

[11]  Sven Hartmann,et al.  Performance Analysis of Algorithms to Reason about XML Keys , 2012, DEXA.

[12]  Wenfei Fan,et al.  Keys with Upward Wildcards for XML , 2001, DEXA.

[13]  Frank Dignum,et al.  The Role of Deontic Logic in the Specification of Information Systems , 1998, Logics for Databases and Information Systems.

[14]  Wenfei Fan,et al.  Conditional Functional Dependencies for Data Cleaning , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[15]  Kenneth N. Brown Soft Consistencies for Weighted CSPs , 2003 .

[16]  Angelo Montanari,et al.  From Entity Relationship to XML Schema: a Graph-Theoretic Approach (Extended Abstract) , 2009, SEBD.

[17]  Alun D. Preece,et al.  A Semantic web approach to handling soft constraints in virtual organisations , 2008, Electron. Commer. Res. Appl..

[18]  Sven Hartmann,et al.  A Precious Class of Cardinality Constraints for Flexible XML Data Processing , 2011, ER.

[19]  Bernhard Thalheim Integrity Constraints in (Conceptual) Database Models , 2008, The Evolution of Conceptual Modeling.

[20]  Wenfei Fan,et al.  What's Hard about XML Schema Constraints? , 2002, DEXA.

[21]  Roland H. Kaschek,et al.  On the evolution of conceptual modeling , 2008, The Evolution of Conceptual Modeling.