Approximate Reasoning about Generalized Conditional Independence with Complete Random Variables

The implication problem of conditional statements about the independence of finitely many sets of random variables is studied in the presence of controlled uncertainty. Uncertainty refers to the possibility of missing data. As a control mechanism random variables can be declared complete, in which case data on these random variables cannot be missing. While the implication of conditional independence statements is not axiomatizable, a finite Horn axiomatization is established for the expressive class of saturated conditional independence statements under controlled uncertainty. Complete random variables allow us to balance the expressivity of sets of saturated statements with the efficiency of deciding their implication. This ability can soundly approximate reasoning in the absence of missing data. Delobel’s class of full first-order hierarchical database decompositions are generalized to the presence of controlled uncertainty, and their implication problem shown to be equivalent to that of saturated conditional independence.

[1]  G. Grätzer General Lattice Theory , 1978 .

[2]  Moninder Singh,et al.  Learning Bayesian Networks from Incomplete Data , 1997, AAAI/IAAI.

[3]  Sebastian Link,et al.  Reasoning about Saturated Conditional Independence Under Uncertainty: Axioms, Algorithms, and Levesque's Situations to the Rescue , 2013, AAAI.

[4]  Joseph Y. Halpern Reasoning about uncertainty , 2003 .

[5]  David Maxwell Chickering,et al.  Efficient Approximations for the Marginal Likelihood of Bayesian Networks with Hidden Variables , 1997, Machine Learning.

[6]  Gustavo E. A. P. A. Batista,et al.  An analysis of four missing data treatment methods for supervised learning , 2003, Appl. Artif. Intell..

[7]  Adnan Darwiche,et al.  Modeling and Reasoning with Bayesian Networks , 2009 .

[8]  Christian Herrmann Corrigendum to "On the undecidability of implications between embedded multivalued database dependencies" [Inform. and Comput. 122(1995) 221-235] , 2006, Inf. Comput..

[9]  Sven Hartmann,et al.  The implication problem of data dependencies over SQL table definitions: Axiomatic, algorithmic and logical characterizations , 2012, TODS.

[10]  Kamran Parsaye-Ghomi,et al.  Inferences involving embedded multivalued dependencies and transitive dependencies , 1980, SIGMOD '80.

[11]  Daniel Kahneman,et al.  Probabilistic reasoning , 1993 .

[12]  Dan Wu,et al.  On the implication problem for probabilistic conditional independency , 2000, IEEE Trans. Syst. Man Cybern. Part A.

[13]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[14]  Marc Gyssens,et al.  Logical and algorithmic properties of stable conditional independence , 2010, Int. J. Approx. Reason..

[15]  Sebastian Link,et al.  Independence in Database Relations , 2013, WoLLIC.

[16]  Dan Geiger,et al.  Logical and algorithmic properties of independence and their application to Bayesian networks , 1990, Annals of Mathematics and Artificial Intelligence.

[17]  Bo Thiesson,et al.  Accelerated Quantification of Bayesian Networks with Incomplete Data , 1995, KDD.

[18]  Christian Herrmann On the Undecidability of Implications Between Embedded Multivalued Database Dependencies , 1995, Inf. Comput..

[19]  S. Sullivant Gaussian conditional independence relations have no finite complete characterization , 2007, 0704.2847.

[20]  Jeff Z. Pan,et al.  An Argument-Based Approach to Using Multiple Ontologies , 2009, SUM.

[21]  Sven Hartmann,et al.  Probabilistic Conditional Independence under Schema Certainty and Uncertainty , 2012, SUM.

[22]  Claude Delobel,et al.  Normalization and hierarchical dependencies in the relational data model , 1978, TODS.

[23]  Paolo Atzeni,et al.  Functional Dependencies and Constraints on Null Values in Database Relations , 1986, Inf. Control..

[24]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[25]  Erich Grädel,et al.  Dependence and Independence , 2012, Stud Logica.

[26]  Y. Edmund Lien,et al.  On the Equivalence of Database Models , 1982, JACM.

[27]  J. Pearl,et al.  Logical and Algorithmic Properties of Conditional Independence and Graphical Models , 1993 .

[28]  Sebastian Link Propositional Reasoning about Saturated Conditional Probabilistic Independence , 2012, WoLLIC.

[29]  Carlo Zaniolo,et al.  Database relations with null values , 1982, J. Comput. Syst. Sci..

[30]  Alan Bundy,et al.  Towards Ontology Evolution in Physics , 2008, WoLLIC.

[31]  A. Dawid Conditional Independence in Statistical Theory , 1979 .

[32]  Marc Gyssens,et al.  On the conditional independence implication problem: A lattice-theoretic approach , 2008, Artif. Intell..

[33]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[34]  Zvi Galil,et al.  An Almost Linear-Time Algorithm for Computing a Dependency Basis in a Relational Database , 1982, JACM.

[35]  Linda C. van der Gaag,et al.  Stable Independence in Perfect Maps , 2005, UAI.

[36]  S. Lauritzen The EM algorithm for graphical association models with missing data , 1995 .

[37]  Nir Friedman,et al.  Learning Belief Networks in the Presence of Missing Values and Hidden Variables , 1997, ICML.

[38]  Sven Hartmann,et al.  Reasoning about functional and full hierarchical dependencies over partial relations , 2013, Inf. Sci..

[39]  Sebastian Link,et al.  Sound approximate reasoning about saturated conditional probabilistic independence under controlled uncertainty , 2013, J. Appl. Log..

[40]  Richard S. Zemel,et al.  Recommender Systems, Missing Data and Statistical Model Estimation , 2011, IJCAI.