论文信息 - Adaptive Ensemble Learning with Confidence Bounds for Personalized Diagnosis

Adaptive Ensemble Learning with Confidence Bounds for Personalized Diagnosis

With the advances in the field of medical informatics, automated clinical decision support systems are becoming the de facto standard in personalized diagnosis. In order to establish high accuracy and confidence in personalized diagnosis, massive amounts of distributed, heterogeneous, correlated and highdimensional patient data from different sources such as wearable sensors, mobile applications, Electronic Health Record (EHR) databases etc. need to be processed. This requires learning both locally and globally due to privacy constraints and/or distributed nature of the multi-modal medical data. In the last decade, a large number of meta-learning techniques have been proposed in which local learners make online predictions based on their locally-collected data instances, and feed these predictions to an ensemble learner, which fuses them and issues a global prediction. However, most of these works do not provide performance guarantees or, when they do, these guarantees are asymptotic. None of these existing works provide confidence estimates about the issued predictions or rate of learning guarantees for the ensemble learner. In this paper, we provide a systematic ensemble learning method called Hedged Bandits, which comes with both long run (asymptotic) and short run (rate of learning) performance guarantees. Moreover, we show that our proposed method outperforms all existing ensemble learning techniques, even in the presence of concept drift.

[1] D.P.L. Simons. Consumer Electronics Opportunities in Remote and Home Healthcare , 2008, 2008 Digest of Technical Papers - International Conference on Consumer Electronics.

[2] John Kelley,et al. WhozThat? evolving an ecosystem for context-aware mobile social networks , 2008, IEEE Network.

[3] William Nick Street,et al. Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[4] Edward J. Delp,et al. An overview of multimedia content protection in consumer electronics devices , 2001, Signal Process. Image Commun..

[5] Indre Zliobaite,et al. Learning under Concept Drift: an Overview , 2010, ArXiv.

[6] Marcus A. Maloof,et al. Using additive expert ensembles to cope with concept drift , 2005, ICML.

[7] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[8] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[9] Xin Yao,et al. DDD: A New Ensemble Approach for Dealing with Concept Drift , 2012, IEEE Transactions on Knowledge and Data Engineering.

[10] Marcus A. Maloof,et al. Dynamic Weighted Majority: An Ensemble Method for Drifting Concepts , 2007, J. Mach. Learn. Res..

[11] Bhavani M. Thuraisingham,et al. Integrating Novel Class Detection with Classification for Concept-Drifting Data Streams , 2009, ECML/PKDD.

[12] Yukun Cao,et al. An intelligent fuzzy-based recommendation system for consumer electronic products , 2007, Expert Syst. Appl..

[13] Piotr J. Slomka,et al. IMPROVED ACCURACY OF MYOCARDIAL PERFUSION SPECT FOR DETECTION OF CORONARY ARTERY DISEASE BY MACHINE LEARNING IN A LARGE POPULATION , 2013 .

[14] Avrim Blum,et al. Empirical Support for Winnow and Weighted-Majority Algorithms: Results on a Calendar Scheduling Domain , 2004, Machine Learning.

[15] Manfred K. Warmuth,et al. The weighted majority algorithm , 1989, 30th Annual Symposium on Foundations of Computer Science.

[16] Vladimir Vapnik,et al. Principles of Risk Minimization for Learning Theory , 1991, NIPS.

[17] Mihaela van der Schaar,et al. Ensemble of distributed learners for online classification of dynamic data streams , 2013, IEEE Transactions on Signal and Information Processing over Networks.

[18] William Nick Street,et al. A streaming ensemble algorithm (SEA) for large-scale classification , 2001, KDD '01.

[19] Salvatore J. Stolfo,et al. The application of AdaBoost for distributed, scalable and on-line learning , 1999, KDD '99.