论文信息 - Unsupervised Sequential Sensor Acquisition

Unsupervised Sequential Sensor Acquisition

In many security and healthcare systems a sequence of sensors/tests are used for detection and diagnosis. Each test outputs a prediction of the latent state, and carries with it inherent costs. Our objective is to learn strategies for selecting tests to optimize accuracy & costs. Unfortunately it is often impossible to acquire in-situ ground truth annotations and we are left with the problem of unsupervised sensor selection (USS). We pose USS as a version of stochastic partial monitoring problem with an unusual reward structure (even noisy annotations are unavailable). Unsurprisingly no learner can achieve sublinear regret without further assumptions. To this end we propose the notion of weak-dominance. This is a condition on the joint probability distribution of test outputs and latent state and says that whenever a test is accurate on an example, a later test in the sequence is likely to be accurate as well.We empirically verify that weak dominance holds on real datasets and prove that it is a maximal condition for achieving sublinear regret. We reduce USS to a special case of multi-armed bandit problem with side information and develop polynomial time algorithms that achieve sublinear regret.

[1] D. Teneketzis,et al. Asymptotically Efficient Adaptive Allocation Schemes for Controlled I.I.D. Processes: Finite Paramet , 1988 .

[2] Dan Roth,et al. Learning cost-sensitive active classifiers , 2002, Artif. Intell..

[3] Nathan R. Sturtevant,et al. Learning when to stop thinking and do something! , 2009, ICML '09.

[4] Shie Mannor,et al. From Bandits to Experts: On the Value of Side-Observations , 2011, NIPS.

[5] Venkatesh Saligrama,et al. Supervised Sequential Classification Under Budget Constraints , 2013, AISTATS.

[6] Noga Alon,et al. From Bandits to Experts: A Tale of Domination and Independence , 2013, NIPS.

[7] Venkatesh Saligrama,et al. Multi-stage classifier design , 2012, Machine Learning.

[8] Russell Greiner,et al. Online Learning with Costly Features and Labels , 2013, NIPS.

[9] Csaba Szepesvári,et al. Partial Monitoring - Classification, Regret Bounds, and Algorithms , 2014, Math. Oper. Res..

[10] Koby Crammer,et al. Prediction with Limited Advice and Multiarmed Bandits with Paid Observations , 2014, ICML.

[11] Noga Alon,et al. Online Learning with Feedback Graphs: Beyond Bandits , 2015, COLT.

[12] Yifan Wu,et al. Online Learning with Gaussian Payoffs and Side Observations , 2015, NIPS.

[13] J. Soto,et al. Effect of an Institutional Triaging Algorithm on the Use of Multidetector CT for Patients with Blunt Abdominopelvic Trauma over an 8-year Period. , 2017, Radiology.