论文信息 - Flexible Bayesian Nonlinear Model Configuration

Flexible Bayesian Nonlinear Model Configuration

Regression models are used in a wide range of applications providing a powerful scientific tool for researchers from different fields. Linear, or simple parametric, models are often not sufficient to describe complex relationships between input variables and a response. Such relationships can be better described through flexible approaches such as neural networks, but this results in less interpretable models and potential overfitting. Alternatively, specific parametric nonlinear functions can be used, but the specification of such functions is in general complicated. In this paper, we introduce a flexible approach for the construction and selection of highly flexible nonlinear parametric regression models. Nonlinear features are generated hierarchically, similarly to deep learning, but have additional flexibility on the possible types of features to be considered. This flexibility, combined with variable selection, allows us to find a small set of important features and thereby more interpretable models. Within the space of possible functions, a Bayesian approach, introducing priors for functions based on their complexity, is considered. A genetically modi ed mode jumping Markov chain Monte Carlo algorithm is adopted to perform Bayesian inference and estimate posterior probabilities for model averaging. In various applications, we illustrate how our approach is used to obtain meaningful nonlinear models. Additionally, we compare its predictive performance with several machine learning algorithms.

[1] Kjersti Aas,et al. Explaining individual predictions when features are dependent: More accurate approximations to Shapley values , 2019, Artif. Intell..

[2] G. Storvik,et al. Estimating the marginal likelihood with Integrated nested Laplace approximation (INLA) , 2016, 1611.01450.

[3] R. Kohn,et al. Speeding Up MCMC by Efficient Data Subsampling , 2014, Journal of the American Statistical Association.

[4] Katja Ickstadt,et al. Comparing Logic Regression Based Methods for Identifying SNP Interactions , 2007, BIRD.

[5] Achilleas Zapranis,et al. Stock performance modeling using neural networks: A comparative study with regression models , 1994, Neural Networks.

[6] Sean R. Davis,et al. NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[7] David R. Anderson,et al. Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[8] Eric R. Ziegel,et al. Generalized Linear Models , 2002, Technometrics.

[9] N. Pillai,et al. Ergodicity of Approximate MCMC Chains with Applications to Large Data Sets , 2014, 1405.0182.

[10] Santi Cassisi,et al. Evolution of Stars and Stellar Populations , 2005, Galactic Astronomy.

[11] Yarin Gal,et al. Uncertainty in Deep Learning , 2016 .

[12] Wei-Yin Loh,et al. Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[13] James G. Scott,et al. Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem , 2010, 1011.2333.

[14] Robert Kohn,et al. Speeding up MCMC by Delayed Acceptance and Data Subsampling , 2015, 1507.06110.

[15] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[16] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Alex Graves,et al. Practical Variational Inference for Neural Networks , 2011, NIPS.

[18] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[19] Michael I. Jordan,et al. An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[20] Julien Cornebise,et al. Weight Uncertainty in Neural Network , 2015, ICML.

[21] Michael L. Littman,et al. Bayesian Adaptive Sampling for Variable Selection and Model Averaging , 2011 .

[22] H. Rue,et al. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations , 2009 .

[23] Fred Collopy,et al. How effective are neural networks at forecasting and prediction? A review and evaluation , 1998 .

[24] Charles M. Bishop,et al. Ensemble learning in Bayesian neural networks , 1998 .

[25] John R. Koza,et al. Genetic programming as a means for programming computers by natural selection , 1994 .

[26] Sg Waugh,et al. Extending and benchmarking Cascade-Correlation : extensions to the Cascade-Correlation architecture and benchmarking of feed-forward supervised artificial neural networks , 1995 .

[27] Nial Friel,et al. Estimating the evidence – a review , 2011, 1111.1957.

[28] D. Sargent,et al. Comparison of artificial neural networks with other statistical approaches , 2001, Cancer.