Generalized nonparametric smoothing with mixed discrete and continuous data

The nonparametric smoothing technique with mixed discrete and continuous regressors is considered. It is generally admitted that it is better to smooth the discrete variables, which is similar to the smoothing technique for continuous regressors but using discrete kernels. However, such an approach might lead to a potential problem which is linked to the bandwidth selection for the continuous regressors due to the presence of the discrete regressors. Through the numerical study, it is found that in many cases, the performance of the resulting nonparametric regression estimates may deteriorate if the discrete variables are smoothed in the way previously addressed, and that a fully separate estimation without any smoothing of the discrete variables may provide significantly better results both for bias and variance. As a solution, it is suggested a simple generalization of the nonparametric smoothing technique with both discrete and continuous data to address this problem and to provide estimates with more robust performance. The asymptotic theory for the new nonparametric smoothing method is developed and the finite sample behavior of the proposed generalized approach is studied through extensive Monte-Carlo experiments as well an empirical illustration.

[1]  M. C. Jones,et al.  A comparison of local constant and local linear regression quantile estimators , 1997 .

[2]  Jeffrey S. Racine,et al.  Nonparametric estimation of distributions with categorical and continuous data , 2003 .

[3]  Léopold Simar,et al.  Local likelihood estimation of truncated regression and its partial derivatives: theory and application , 2008 .

[4]  R. Russell,et al.  Human Capital and Convergence: A Production-Frontier Approach , 2005 .

[5]  Qi Li,et al.  Efficient Estimation of Average Treatment Effects with Mixed Categorical and Continuous Data , 2009 .

[6]  Valentin Zelenyuk,et al.  Testing for (Efficiency) Catching-up , 2007 .

[7]  V. Zelenyuk,et al.  Technological Change , Technological Catch-up , and Capital Deepening : Relative Contributions to Growth and Convergence During 90 ’ s ∗ , 2004 .

[8]  María Dolores Martínez Miranda,et al.  The choice of smoothing parameter in nonparametric regression through Wild Bootstrap , 2004, Comput. Stat. Data Anal..

[9]  Léopold Simar,et al.  On Testing Equality of Distributions of Technical Efficiency Scores , 2006 .

[10]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[11]  Qi Li,et al.  Nonparametric Econometrics: Theory and Practice , 2006 .

[12]  Jeffrey S. Racine,et al.  A Consistent Model Specification Test with Mixed Discrete and Continuous Data , 2006 .

[13]  Jianqing Fan,et al.  Generalized likelihood ratio statistics and Wilks phenomenon , 2001 .

[14]  Jeffrey S. Racine,et al.  Nonparametric Estimation of Conditional CDF and Quantile Functions With Mixed Categorical and Continuous Data , 2008 .

[15]  Jeffrey S. Racine,et al.  Nonparametric Estimation of Regression Functions in the Presence of Irrelevant Regressors , 2007, The Review of Economics and Statistics.

[16]  J. Hart,et al.  Testing the Significance of Categorical Predictor Variables in Nonparametric Regression Models , 2006 .

[17]  Jianqing Fan,et al.  Variable Bandwidth and Local Linear Regression Smoothers , 1992 .

[18]  Valentin Zelenyuk,et al.  Technological Change and Transition: Relative Contributions to Worldwide Growth During the 1990s , 2008 .

[19]  Aman Ullah,et al.  Nonparametric Econometrics: Introduction , 1999 .

[20]  Christopher F. Parmeter,et al.  Economies of Scope of Lending and Mobilizing Deposits in Microfinance Institutions: A Semiparametric Analysis , 2010 .

[21]  Qi Li,et al.  Categorical semiparametric varying‐coefficient models , 2013 .

[22]  Jianqing Fan Local Linear Regression Smoothers and Their Minimax Efficiencies , 1993 .

[23]  Thanasis Stengos,et al.  Intertemporal Pricing and Price Discrimination: A Semiparametric Hedonic Analysis of the Personal Computer Market , 2006 .

[24]  Qi Li,et al.  Cross-validation and the estimation of probability distributions with categorical data , 2006 .

[25]  M. Wand,et al.  Multivariate Locally Weighted Least Squares Regression , 1994 .

[26]  Adrian W. Bowman,et al.  Computational aspects of nonparametric smoothing with illustrations from the sm library , 2003, Comput. Stat. Data Anal..

[27]  R. R. Russell,et al.  Technological Change, Technological Catch-up, and Capital Deepening: Relative Contributions to Growth and Convergence , 2002 .

[28]  M. Frölich Non-Parametric Regression for Binary Dependent Variables , 2006 .

[29]  C. J. Stone,et al.  Consistent Nonparametric Regression , 1977 .

[30]  W. Cleveland,et al.  Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting , 1988 .

[31]  O. Linton,et al.  Local nonlinear least squares: Using parametric information in nonparametric regression , 2000 .

[32]  Jeffrey S. Racine,et al.  CROSS-VALIDATED LOCAL LINEAR NONPARAMETRIC REGRESSION , 2004 .

[33]  Jianqing Fan Design-adaptive Nonparametric Regression , 1992 .

[34]  Jeffrey S. Racine,et al.  Nonparametric estimation of regression functions with both categorical and continuous data , 2004 .

[35]  B. Silverman,et al.  Weak and strong uniform consistency of kernel regression estimates , 1982 .

[36]  D. Henderson,et al.  The Impact of Homework on Student Achievement , 2006 .

[37]  Jianqing Fan,et al.  Local polynomial modelling and its applications , 1994 .

[38]  Jeffrey S. Racine,et al.  Cross-Validation and the Estimation of Conditional Probability Densities , 2004 .

[39]  Jianqing Fan,et al.  Nonparametric inference with generalized likelihood ratio tests , 2007 .

[40]  W. D. Walls,et al.  Screen wars, star wars, and sequels , 2009 .

[41]  Léopold Simar,et al.  Categorical data in local maximum likelihood: theory and applications to productivity analysis , 2015 .

[42]  D. Henderson A Test for Multimodality of Regression Derivatives with an Application to Nonparametric Growth Regressions , 2010 .

[43]  Jeffrey S. Racine,et al.  Growth and convergence: A profile of distribution dynamics and mobility , 2007 .

[44]  L. Simar,et al.  To Smooth or Not to Smooth? The Case of Discrete Variables in Nonparametric Regressions , 2011 .

[45]  Christopher F. Parmeter,et al.  Nonparametric estimation of a hedonic price function , 2007 .