Sequential adaptive design for jump regression estimation

Selecting input data or design points for statistical models has been of great interest in sequential design and active learning. In this paper, we present a new strategy of selecting the design points for a regression model when the underlying regression function is discontinuous. Two main motivating examples are (1) compressed material imaging with the purpose of accelerating the imaging speed and (2) design for regression analysis over a phase diagram in chemistry. In both examples, the underlying regression functions have discontinuities, so many of the existing design optimization approaches cannot be applied for the two examples because they mostly assume a continuous regression function. There are some studies for estimating a discontinuous regression function from its noisy observations, but all noisy observations are typically provided in advance in these studies. In this paper, we develop a design strategy of selecting the design points for regression analysis with discontinuities. We first review the existing approaches relevant to design optimization and active learning for regression analysis and discuss their limitations in handling a discontinuous regression function. We then present our novel design strategy for a regression analysis with discontinuities: some statistical properties with a fixed design will be presented first, and then these properties will be used to propose a new criterion of selecting the design points for the regression analysis. Sequential design of experiments with the new criterion will be presented with numerical examples.

[1]  P. Qiu Jump Surface Estimation, Edge Detection, and Image Restoration , 2007 .

[2]  Peihua Qiu,et al.  The Local Piecewisely Linear Kernel Smoothing Procedure for Fitting Jump Regression Surfaces , 2004, Technometrics.

[3]  Jianqing Fan,et al.  Local polynomial modelling and its applications , 1994 .

[4]  P. Mykland,et al.  Nonlinear Experiments: Optimal Design and Inference Based on Likelihood , 1993 .

[5]  Peihua Qiu,et al.  Discontinuous regression surfaces fitting , 1998 .

[6]  Stergios B. Fotopoulos,et al.  All of Nonparametric Statistics , 2007, Technometrics.

[7]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[8]  M. Rosenblatt,et al.  Multivariate k-nearest neighbor density estimates , 1979 .

[9]  G. Wahba Spline models for observational data , 1990 .

[10]  Emmanuel J. Candès,et al.  On the Fundamental Limits of Adaptive Sensing , 2011, IEEE Transactions on Information Theory.

[11]  Thomas J. Santner,et al.  The Design and Analysis of Computer Experiments , 2003, Springer Series in Statistics.

[12]  Lawrence Carin,et al.  Active Learning and Basis Selection for Kernel-Based Linear Models: A Bayesian Perspective , 2010, IEEE Transactions on Signal Processing.

[13]  Hovav A. Dror,et al.  Sequential Experimental Designs for Generalized Linear Models , 2008 .

[14]  Andreas Krause,et al.  Efficient Informative Sensing using Multiple Robots , 2014, J. Artif. Intell. Res..

[15]  Matthew Malloy,et al.  Near-Optimal Adaptive Compressed Sensing , 2012, IEEE Transactions on Information Theory.

[16]  Rahul Rao,et al.  Autonomy in materials research: a case study in carbon nanotube growth , 2016 .

[17]  Dale L. Zimmerman,et al.  Optimal network design for spatial prediction, covariance parameter estimation, and empirical prediction , 2006 .

[18]  Andreas Krause,et al.  Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies , 2008, J. Mach. Learn. Res..

[19]  P. Qiu Image processing and jump regression analysis , 2005 .

[20]  M. Stein,et al.  Spatial sampling design for prediction with estimated parameters , 2006 .

[21]  Sonja Kuhnt,et al.  Design and analysis of computer experiments , 2010 .

[22]  Adam D. Bull Spatially-adaptive sensing in nonparametric regression , 2012, 1207.0327.

[23]  L. Carin,et al.  Applying compressive sensing to TEM video: a substantial frame rate increase on any camera , 2015, Advanced Structural and Chemical Imaging.

[24]  Ambuj Tewari,et al.  Active Learning for Non-Parametric Regression Using Purely Random Trees , 2018, NeurIPS.

[25]  W. Yao,et al.  Sequential design for nonparametric inference , 2012 .

[26]  H. Chernoff Sequential Analysis and Optimal Design , 1987 .

[27]  D. Wiens,et al.  Robust sequential designs for nonlinear regression , 2002 .

[28]  Peihua Qiu,et al.  Jump-preserving surface reconstruction from noisy data , 2009 .

[29]  M. Wand,et al.  Multivariate Locally Weighted Least Squares Regression , 1994 .

[30]  Andreas Krause,et al.  Near-optimal sensor placements in Gaussian processes , 2005, ICML.

[31]  B. Yandell,et al.  Jump Detection in Regression Surfaces , 1997 .