Testing for Conditional Mean Independence with Covariates through Martingale Difference Divergence

As a crucial problem in statistics is to decide whether additional variables are needed in a regression model. We propose a new multivariate test to investigate the conditional mean independence of Y given X conditioning on some known effect Z, i.e., E(Y|X, Z) = E(Y|Z). Assuming that E(Y|Z) and Z are linearly related, we reformulate an equivalent notion of conditional mean independence through transformation, which is approximated in practice. We apply the martingale difference divergence (Shao and Zhang, 2014) to measure conditional mean dependence, and show that the estimation error from approximation is negligible, as it has no impact on the asymptotic distribution of the test statistic under some regularity assumptions. The implementation of our test is demonstrated by both simulations and a financial data example.

[1]  Runze Li,et al.  Feature Screening via Distance Correlation Learning , 2012, Journal of the American Statistical Association.

[2]  S. Chatterjee,et al.  Regression Analysis by Example , 1979 .

[3]  Quang Vuong,et al.  NONPARAMETRIC SIGNIFICANCE TESTING , 2000, Econometric Theory.

[4]  Tyler H. McCormick,et al.  An Expectation Conditional Maximization Approach for Gaussian Graphical Models , 2017, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[5]  H. White,et al.  A Consistent Characteristic-Function-Based Test for Conditional Independence , 2003 .

[6]  Heping Zhang,et al.  Conditional Distance Correlation , 2015, Journal of the American Statistical Association.

[7]  Maria L. Rizzo,et al.  Partial Distance Correlation with Methods for Dissimilarities , 2013, 1310.2926.

[8]  R. Cook,et al.  Dimension reduction for conditional mean in regression , 2002 .

[9]  Ze Jin,et al.  Generalizing distance covariance to measure and test multivariate mutual dependence via complete and incomplete V-statistics , 2017, J. Multivar. Anal..

[10]  E. Fama,et al.  A Five-Factor Asset Pricing Model , 2014 .

[11]  Chih-Ling Tsai,et al.  Testing covariates in high-dimensional regression , 2014 .

[12]  Maria L. Rizzo,et al.  Measuring and testing dependence by correlation of distances , 2007, 0803.4101.

[13]  Ze Jin,et al.  Independent Component Analysis via Energy-based and Kernel-based Mutual Dependence Measures , 2018, 1805.06639.

[14]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[15]  Yang Feng,et al.  A Conditional Dependence Measure with Applications to Undirected Graphical Models , 2015 .

[16]  Vachik S. Dave,et al.  Feature Selection for Classification under Anonymity Constraint , 2015, Trans. Data Priv..

[17]  Thomas M. Stoker,et al.  Goodness-of-fit tests for kernel regression with an application to option implied volatilities , 2001 .

[18]  W. Sharpe CAPITAL ASSET PRICES: A THEORY OF MARKET EQUILIBRIUM UNDER CONDITIONS OF RISK* , 1964 .

[19]  Craig Hiemstra,et al.  Testing for Linear and Nonlinear Granger Causality in the Stock Price-Volume Relation , 1994 .

[20]  Yanlin Tang,et al.  Testing for the presence of significant covariates through conditional marginal regression , 2018 .

[21]  Xiaofeng Shao,et al.  Martingale Difference Correlation and Its Use in High-Dimensional Variable Screening , 2014 .

[22]  J. Mossin EQUILIBRIUM IN A CAPITAL ASSET MARKET , 1966 .

[23]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[24]  Pierre Legendre,et al.  Comparison of permutation methods for the partial correlation and partial mantel tests , 2000 .

[25]  J. Bien,et al.  Hierarchical Sparse Modeling: A Choice of Two Group Lasso Formulations , 2015, 1512.01631.

[26]  Bin Chen,et al.  TESTING FOR THE MARKOV PROPERTY IN TIME SERIES , 2011, Econometric Theory.

[27]  J. Lintner THE VALUATION OF RISK ASSETS AND THE SELECTION OF RISKY INVESTMENTS IN STOCK PORTFOLIOS AND CAPITAL BUDGETS , 1965 .

[28]  E. Fama,et al.  Common risk factors in the returns on stocks and bonds , 1993 .

[29]  Samuel J. Clark,et al.  Bayesian Joint Spike-and-Slab Graphical Lasso , 2018, ICML.

[30]  Shouyang Wang,et al.  Granger Causality in Risk and Detection of Extreme Risk Spillover Between Financial Markets , 2009 .

[31]  Yanqin Fan,et al.  Consistent model specification tests : Omitted variables and semiparametric functional forms , 1996 .

[32]  Xiaohan Yan,et al.  Rare Feature Selection in High Dimensions , 2018, Journal of the American Statistical Association.

[33]  Feng Liang,et al.  Bayesian Regularization for Graphical Models With Unequal Shrinkage , 2018, Journal of the American Statistical Association.

[34]  Xiaofeng Shao,et al.  Partial martingale difference correlation , 2015 .

[35]  Jianqing Fan,et al.  High Dimensional Covariance Matrix Estimation in Approximate Factor Models , 2011, Annals of statistics.

[36]  Hamdi Raïssi,et al.  Testing linear causality in mean when the number of estimated parameters is high , 2011 .