A General Derivative Identity for the Conditional Expectation with Focus on the Exponential Family

Consider a pair of random vectors $(\mathrm{X}, \mathrm{Y})$ and the conditional expectation operator $\mathbb{E}[\mathrm{X} \mid \mathrm{Y}=\mathrm{y}]$. This work studies analytic properties of the conditional expectation by characterizing various derivative identities. The paper consists of two parts. In the first part of the paper, a general derivative identity for the conditional expectation is derived. Specifically, for the Markov chain $\mathrm{U} \leftrightarrow \mathrm{X} \leftrightarrow \mathrm{Y}$, a compact expression for the Jacobian matrix of $\mathbb{E}[\mathrm{U} \mid \mathrm{Y}=\mathrm{y}]$ is derived. In the second part of the paper, the main identity is specialized to the exponential family. Moreover, via various choices of the random vector U, the new identity is used to recover and generalize several known identities and derive some new ones. As a first example, a connection between the Jacobian of $\mathbb{E}[\mathrm{X} \mid \mathrm{Y}=\mathrm{y}]$ and the conditional variance is established. As a second example, a recursive expression between higher order conditional expectations is found, which is shown to lead to a generalization of the Tweedy’s identity. Finally, as a third example, it is shown that the k-th order derivative of the conditional expectation is proportional to the $(k+1)$-th order conditional cumulant.

[1]  Amin G. Jaffer,et al.  On Relations Between Detection and Estimation of Discrete Time Processes , 1972, Inf. Control..

[2]  Shlomo Shamai,et al.  On the Distribution of the Conditional Mean Estimator in Gaussian Noise , 2021, 2020 IEEE Information Theory Workshop (ITW).

[3]  W. Hager,et al.  and s , 2019, Shallow Water Hydraulics.

[4]  Shlomo Shamai,et al.  A General Derivative Identity for the Conditional Mean Estimator in Gaussian Noise and Some Applications , 2020, 2020 IEEE International Symposium on Information Theory (ISIT).

[5]  Xin Guo,et al.  On the optimality of conditional expectation as a Bregman predictor , 2005, IEEE Trans. Inf. Theory.

[6]  Amin G. Jaffer,et al.  A note on conditional moments of random signals in Gaussian noise (Corresp.) , 1972, IEEE Trans. Inf. Theory.

[7]  Daniel Pérez Palomar,et al.  Gradient of mutual information in linear vector Gaussian channels , 2006, IEEE Transactions on Information Theory.

[8]  Bradley Efron,et al.  Local False Discovery Rates , 2005 .

[9]  H. Robbins An Empirical Bayes Approach to Statistics , 1956 .

[10]  Wael Alghamdi,et al.  Polynomial Approximations of Conditional Expectations in Scalar Gaussian Channels , 2021, 2021 IEEE International Symposium on Information Theory (ISIT).

[11]  D. Blackwell Conditional Expectation and Unbiased Sequential Estimation , 1947 .

[12]  R. Esposito,et al.  On a Relation between Detection and Estimation in Decision Theory , 1968, Inf. Control..

[13]  Loren W. Nolte,et al.  Some geometric properties of the likelihood ratio (Corresp.) , 1971, IEEE Trans. Inf. Theory.

[14]  Shlomo Shamai,et al.  The Interplay Between Information and Estimation Measures , 2013, Found. Trends Signal Process..