Information Dependency and Its Applications

Independence is a basic concept of probability theory and statistics. In a lot of elds of sciences, dependency of di erent variables is gained lots of attention from scientists. A measure, named information dependency, is proposed to express the dependency of a group of random variables. This measure is de ned as the Kullback-Leibler divergence of a joint distribution with respect to a product-marginal distribution of these random variables. In the bivariate case, this measure is known as mutual information of two random variables. Thus, the measure information dependency has a strong relationship with the Information Theory. The thesis aims to give a thorough study of the information dependency from both mathematical and practical viewpoints. Concretely, we would like to research three following problems: i. Proving that the information dependency is a useful tool to express the dependency of a group of random variables by comparing it with other measures of dependency. ii. Studying the methods to estimate the information dependency based on the samples of a group of random variables. iii. Investigating how the Independent Component Analysis problem, an interesting problem in statistics, can be solved using information dependency.

[1]  Olivier Rioul A Simple Proof of the Entropy-Power Inequality via Properties of Mutual Information , 2007, 2007 IEEE International Symposium on Information Theory.

[2]  I. Vajda,et al.  Convex Statistical Distances , 2018, Statistical Inference for Engineers and Data Scientists.

[3]  A. Kraskov,et al.  Estimating mutual information. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Jean-François Cardoso High-Order Constrasts for Independent Component Analysis , 1999, Neural Comput..

[5]  Christian Jutten,et al.  Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture , 1991, Signal Process..

[6]  Nikos D. Sidiropoulos,et al.  Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  C. B. Bell Mutual Information and Maximal Correlation as Measures of Dependence , 1962 .

[8]  Moon,et al.  Estimation of mutual information using kernel density estimators. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[9]  Andrew B. Nobel Analysis of a complexity-based pruning scheme for classification trees , 2002, IEEE Trans. Inf. Theory.

[10]  Satishs Iyengar,et al.  Multivariate Models and Dependence Concepts , 1998 .

[11]  Shrikanth S. Narayanan,et al.  Nonproduct Data-Dependent Partitions for Mutual Information Estimation: Strong Consistency and Applications , 2010, IEEE Transactions on Signal Processing.

[12]  C. Spearman The proof and measurement of association between two things. , 2015, International journal of epidemiology.

[13]  M. D. Taylor Multivariate measures of concordance , 2007 .

[14]  Gerald S. Rogers,et al.  Mathematical Statistics: A Decision Theoretic Approach , 1967 .

[15]  Roger B. Nelsen,et al.  Nonparametric measures of multivariate association , 1996 .

[16]  Nils Blomqvist,et al.  On a Measure of Dependence Between two Random Variables , 1950 .

[17]  H. Gebelein Das statistische Problem der Korrelation als Variations‐ und Eigenwertproblem und sein Zusammenhang mit der Ausgleichsrechnung , 1941 .

[18]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[19]  Athanasios C. Micheas,et al.  Measuring stochastic dependence using φ -divergence , 2006 .

[20]  Igor Vajda,et al.  Asymptotically Sufficient Partitions and Quantizations , 2006, IEEE Transactions on Information Theory.

[21]  J. Kent Information gain and a general measure of correlation , 1983 .

[22]  M. Sklar Fonctions de repartition a n dimensions et leurs marges , 1959 .

[23]  Igor Vajda,et al.  Estimation of the Information by an Adaptive Partitioning of the Observation Space , 1999, IEEE Trans. Inf. Theory.

[24]  Barnabás Póczos,et al.  On the Estimation of alpha-Divergences , 2011, AISTATS.

[25]  Reza Modarres,et al.  Measures of Dependence , 2011, International Encyclopedia of Statistical Science.

[26]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[27]  I. Vajda Theory of statistical inference and information , 1989 .

[28]  G. Lugosi,et al.  Consistency of Data-driven Histogram Methods for Density Estimation and Classification , 1996 .

[29]  Terrence J. Sejnowski,et al.  Independent Component Analysis Using an Extended Infomax Algorithm for Mixed Sub-Gaussian and Super-Gaussian Sources , 1999, Neural Comput..

[30]  C. Quesenberry,et al.  A nonparametric estimate of a multivariate density function , 1965 .

[31]  S. Kotz,et al.  Correlation and dependence , 2001 .

[32]  Friedrich Schmid,et al.  Multivariate Extensions of Spearman's Rho and Related Statistics , 2007 .

[33]  A. Kolmogorov On Logical Foundations of Probability Theory , 1983 .

[34]  D. Slepian The one-sided barrier problem for Gaussian noise , 1962 .

[35]  Friedrich Schmid,et al.  Copula-Based Measures of Multivariate Association , 2010 .

[36]  Emmanuel Vincent,et al.  Validity of the Independence Assumption for the Separation of Instantaneous and Convolutive Mixtures of Speech and Music Sources , 2009, ICA.

[37]  W. Kruskal Ordinal Measures of Association , 1958 .

[38]  Matthew P. Wand,et al.  Kernel Smoothing , 1995 .

[39]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[40]  Harry Joe,et al.  Multivariate concordance , 1990 .

[41]  E. H. Linfoot An Informational Measure of Correlation , 1957, Inf. Control..

[42]  K. Pearson On the Theory of Contingency and Its Relation to Association and Normal Correlation , 2013 .

[43]  L. Rüschendorf On the distributional transform, Sklar's theorem, and the empirical copula process , 2009 .

[44]  H. Lebesgue Sur l'intégration des fonctions discontinues , 1910 .

[45]  Shrikanth Narayanan,et al.  Information divergence estimation based on data-dependent partitions , 2010 .

[46]  Seungjin Choi,et al.  Independent Component Analysis , 2009, Handbook of Natural Computing.