RFCDE: Random Forests for Conditional Density Estimation

Random forests is a common non-parametric regression technique which performs well for mixed-type data and irrelevant covariates, while being robust to monotonic variable transformations. Existing random forest implementations target regression or classification. We introduce the RFCDE package for fitting random forest models optimized for nonparametric conditional density estimation, including joint densities for multiple responses. This enables analysis of conditional probability distributions which is useful for propagating uncertainty and of joint distributions that describe relationships between multiple responses and covariates. RFCDE is released under the MIT open-source license and can be accessed at this https URL . Both R and Python versions, which call a common C++ library, are available.

[1]  Dirk Eddelbuettel,et al.  Rcpp: Seamless R and C++ Integration , 2011 .

[2]  Stefan Behnel,et al.  Cython: The Best of Both Worlds , 2011, Computing in Science & Engineering.

[3]  Rafael Izbicki,et al.  Converting High-Dimensional Regression to High-Dimensional Conditional Density Estimation , 2017, 1704.08095.

[4]  Achim Zeileis,et al.  Transformation Forests , 2017, 1701.02110.

[5]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[6]  Nicolai Meinshausen,et al.  Quantile Regression Forests , 2006, J. Mach. Learn. Res..