Ensemble Learning Frameworks for the Discovery of Multi-component Quantitative Models in Biomedical Applications

Increasing availability of multi-scale physiological data opens new horizons for quantitative modeling in biomedical applications. However, practical limitations of existing approaches include both the low accuracy of the simplified analytical models and empirical expert-defined rules and the insufficient interpretability and stability of the pure data-driven models. Recently it was shown that generic boosting-based frameworks can be successfully used to address these challenges of quantitative modeling in financial applications. Boosting and similar ensemble learning techniques are capable of discovering robust multi-component meta-models from a collection of existing and well-understood base models. Accuracy and stability of such interpretable ensembles of complementary models are often significantly higher than those of the single models. Here we establish the plausibility that this ensemble learning approach can overcome such challenges also in biomedical applications.

[1]  Robert E. Schapire,et al.  Design and analysis of efficient learning algorithms , 1992, ACM Doctoral dissertation award ; 1991.

[2]  James J. Chen,et al.  Ensemble methods for classification of patients for personalized medicine with high-dimensional data , 2007, Artif. Intell. Medicine.

[3]  Abdennaceur Kachouri,et al.  HRV analysis using wavelet package transform and Least Square Support Vector Machine , .

[4]  Jesse Davis,et al.  Machine Learning for Personalized Medicine : Will This Drug Give Me a Heart Attack ? , 2008 .

[5]  Alan Julian Izenman,et al.  Modern Multivariate Statistical Techniques , 2008 .

[6]  Jacques Bélair,et al.  Dynamical disease : mathematical analysis of human illness , 1995 .

[7]  John H. Maindonald,et al.  Modern Multivariate Statistical Techniques: Regression, Classification and Manifold Learning , 2009 .

[8]  Karel Hana,et al.  FRACTAL AND MULTIFRACTAL PROPERTIES OF HEARTBEAT INTERVAL SERIES IN EXTREMAL STATES OF THE HUMAN ORGANISM , 2003 .

[9]  G. Rätsch Robust Boosting via Convex Optimization , 2001 .

[10]  Martin P. Paulus,et al.  Increased amygdala activation is related to heart rate during emotion processing in adolescent subjects , 2007, Neuroscience Letters.

[11]  Pere Caminal,et al.  Methods derived from nonlinear dynamics for analysing heart rate variability , 2009, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[12]  V. C. Veera Reddy,et al.  ANN FOR CLASSIFICATION OF CARDIAC ARRHYTHMIAS , 2008 .

[13]  George D. Magoulas,et al.  Machine Learning in Medical Applications , 2001, Machine Learning and Its Applications.

[14]  C. A. Perry,et al.  Knowledge bases in medicine: a review. , 1990, Bulletin of the Medical Library Association.

[15]  V. Gavrishchaka BOOSTING-BASED FRAMEWORK FOR PORTFOLIO STRATEGY DISCOVERY AND OPTIMIZATION , 2006 .

[16]  Gunnar Rätsch,et al.  Robust Boosting via Convex Optimization: Theory and Applications , 2007 .

[17]  T. Wood Brain and body in sport and exercise , 2006, British Journal of Sports Medicine.

[18]  Shimon Edelman,et al.  Representation and recognition in vision , 1999 .

[19]  Shimon Ullman,et al.  Single-example Learning of Novel Classes using Representation by Similarity , 2005, BMVC.

[20]  James R. Evans,et al.  Introduction to Quantitative EEG and Neurofeedback , 2011 .

[21]  Stephen W Porges,et al.  Electroencephalogram and heart rate regulation to familiar and unfamiliar people in children with autism spectrum disorders. , 2009, Child development.

[22]  Pierre Baldi,et al.  Bioinformatics - the machine learning approach (2. ed.) , 2000 .

[23]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[24]  Alan Julian Izenman,et al.  Modern Multivariate Statistical Techniques: Regression, Classification, and Manifold Learning , 2008 .

[25]  Valeriy V. Gavrishchaka,et al.  Boosting-Based Frameworks in Financial Modeling: Application to Symbolic Volatility Forecasting , 2006 .

[26]  V. V. Gavrishchaka,et al.  Discovery Of Multi-component PortfolioStrategies With Continuous Tuning To TheChanging Market Micro-regimes UsingInput-dependent Boosting , 2008 .

[27]  Evian Gordon,et al.  Integrative Neuroscience , 2003 .

[28]  John A. Springer,et al.  Drug safety assurance through clinical genotyping: near-term considerations for a system-wide implementation of personalized medicine. , 2008, Personalized medicine.