Logics for Approximating Implication Problems of Saturated Conditional Independence

Random variables are declared complete whenever they must not admit missing data. Intuitively, the larger the set of complete random variables the closer the implication of saturated conditional independence statements is approximated. Two different notions of implication are studied. In the classical notion, a statement is implied jointly by a set of statements, the fixed set of random variables and its subset of complete random variables. For the notion of pure implication the set of random variables is left undetermined. A first axiomatization for the classical notion is established that distinguishes purely implied from classically implied statements. Axiomatic, algorithmic and logical characterizations of pure implication are established. The latter appeal to applications in which the existence of random variables is uncertain, for example, when statements are integrated from different sources, when random variables are unknown or when they shall remain hidden.

[1]  Y. Edmund Lien,et al.  On the Equivalence of Database Models , 1982, JACM.

[2]  J. Pearl,et al.  Logical and Algorithmic Properties of Conditional Independence and Graphical Models , 1993 .

[3]  Ronald Fagin,et al.  An Equivalence Between Relational Database Dependencies and a Fragment of Propositional Logic , 1981, JACM.

[4]  Nir Friedman,et al.  Learning Belief Networks in the Presence of Missing Values and Hidden Variables , 1997, ICML.

[5]  Sebastian Link,et al.  Saturated Conditional Independence with Fixed and Undetermined Sets of Incomplete Random Variables , 2014, UAI.

[6]  Sven Hartmann,et al.  Probabilistic Conditional Independence under Schema Certainty and Uncertainty , 2012, SUM.

[7]  David Maxwell Chickering,et al.  Efficient Approximations for the Marginal Likelihood of Bayesian Networks with Hidden Variables , 1997, Machine Learning.

[8]  Sebastian Link,et al.  Sound approximate reasoning about saturated conditional probabilistic independence under controlled uncertainty , 2013, J. Appl. Log..

[9]  Richard S. Zemel,et al.  Recommender Systems, Missing Data and Statistical Model Estimation , 2011, IJCAI.

[10]  Sven Hartmann,et al.  The implication problem of data dependencies over SQL table definitions: Axiomatic, algorithmic and logical characterizations , 2012, TODS.

[11]  Joseph Y. Halpern Reasoning about uncertainty , 2003 .

[12]  Sebastian Link,et al.  Appropriate inferences of data dependencies in relational databases , 2012, Annals of Mathematics and Artificial Intelligence.

[13]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[14]  Marco Schaerf,et al.  Tractable Reasoning via Approximation , 1995, Artif. Intell..

[15]  Christian Herrmann On the Undecidability of Implications Between Embedded Multivalued Database Dependencies , 1995, Inf. Comput..

[16]  S. Sullivant Gaussian conditional independence relations have no finite complete characterization , 2007, 0704.2847.

[17]  S. Lauritzen The EM algorithm for graphical association models with missing data , 1995 .

[18]  Chengqi Zhang,et al.  Cost-Sensitive Imputing Missing Values with Ordering , 2007, AAAI.

[19]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[20]  Sebastian Link Charting the completeness frontier of inference systems for multivalued dependencies , 2008, Acta Informatica.

[21]  Joachim Biskup Inferences of Multivalued Dependencies in Fixed and Undetermined Universes , 1980, Theor. Comput. Sci..

[22]  Marc Gyssens,et al.  Logical and algorithmic properties of stable conditional independence , 2010, Int. J. Approx. Reason..

[23]  Maurizio Lenzerini,et al.  The scientific legacy of Marco Cadoli in Artificial Intelligence , 2013, Intelligenza Artificiale.

[24]  Daniel Kahneman,et al.  Probabilistic reasoning , 1993 .

[25]  Dan Wu,et al.  On the implication problem for probabilistic conditional independency , 2000, IEEE Trans. Syst. Man Cybern. Part A.

[26]  Foster J. Provost,et al.  Handling Missing Values when Applying Classification Models , 2007, J. Mach. Learn. Res..

[27]  Moninder Singh,et al.  Learning Bayesian Networks from Incomplete Data , 1997, AAAI/IAAI.

[28]  A. Dawid Conditional Independence in Statistical Theory , 1979 .

[29]  Marc Gyssens,et al.  On the conditional independence implication problem: A lattice-theoretic approach , 2008, Artif. Intell..

[30]  Kamran Parsaye-Ghomi,et al.  Inferences involving embedded multivalued dependencies and transitive dependencies , 1980, SIGMOD '80.

[31]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[32]  Zvi Galil,et al.  An Almost Linear-Time Algorithm for Computing a Dependency Basis in a Relational Database , 1982, JACM.

[33]  Jeff Z. Pan,et al.  An Argument-Based Approach to Using Multiple Ontologies , 2009, SUM.

[34]  Sebastian Link Characterisations of multivalued dependency implication over undetermined universes , 2012, J. Comput. Syst. Sci..