Synergy of physics-based reasoning and machine learning in biomedical applications: towards unlimited deep learning with limited data

ABSTRACT Technological advancements enable collecting vast data, i.e., Big Data, in science and industry including biomedical field. Increased computational power allows expedient analysis of collected data using statistical and machine-learning approaches. Historical data incompleteness problem and curse of dimensionality diminish practical value of pure data-driven approaches, especially in biomedicine. Advancements in deep learning (DL) frameworks based on deep neural networks (DNN) improved accuracy in image recognition, natural language processing, and other applications yet severe data limitations and/or absence of transfer-learning-relevant problems drastically reduce advantages of DNN-based DL. Our earlier works demonstrate that hierarchical data representation can be alternatively implemented without NN, using boosting-like algorithms for utilization of existing domain knowledge, tolerating significant data incompleteness, and boosting accuracy of low-complexity models within the classifier ensemble, as illustrated in physiological-data analysis. Beyond obvious use in initial-factor selection, existing simplified models are effectively employed for generation of realistic synthetic data for later DNN pre-training. We review existing machine learning approaches, focusing on limitations caused by training-data incompleteness. We outline our hybrid framework that leverages existing domain-expert models/knowledge, boosting-like model combination, DNN-based DL and other machine learning algorithms for drastic reduction of training-data requirements. Applying this framework is illustrated in context of analyzing physiological data. GRAPHICAL ABSTRACT

[1]  Andrey Kazennov,et al.  The cornucopia of meaningful leads: Applying deep adversarial autoencoders for new molecule development in oncology , 2016, Oncotarget.

[2]  A. Yang,et al.  Heart Rate Variability in Psychiatric Disorders , 2010 .

[3]  Zhenyi Yang,et al.  Advantages of Hybrid Deep Learning Frameworks in Applications with Limited Data , 2018, International Journal of Machine Learning and Computing.

[4]  Alberto Cano,et al.  A survey on graphic processing unit computing for large‐scale data mining , 2018, WIREs Data Mining Knowl. Discov..

[5]  Madalena Costa,et al.  Multiscale entropy analysis of biological signals. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[6]  J. Elder The Generalization Paradox of Ensembles , 2003 .

[7]  K. Kaski,et al.  Dynamics of market correlations: taxonomy and portfolio analysis. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Valeriy V. Gavrishchaka,et al.  Optimization of the neural-network geomagnetic model for forecasting large-amplitude substorm events , 2001 .

[9]  V. Gavrishchaka,et al.  Multimoment convecting flux tube model of the polar wind system with return current and microprocesses , 2007 .

[10]  W. Horton,et al.  A low‐dimensional energy‐conserving state space model for substorm dynamics , 1996 .

[11]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[12]  Yan Liu,et al.  Distilling Knowledge from Deep Networks with Applications to Healthcare Domain , 2015, ArXiv.

[13]  Yoshihiro Miyake,et al.  Relationship between fractal property of gait cycle and severity of Parkinson's disease , 2011, 2011 IEEE/SICE International Symposium on System Integration (SII).

[14]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..

[15]  C. Maranas,et al.  Recent advances in the reconstruction of metabolic models and integration of omics data. , 2014, Current opinion in biotechnology.

[16]  Jeffrey M. Hausdorff,et al.  Dynamic markers of altered gait rhythm in amyotrophic lateral sclerosis. , 2000, Journal of applied physiology.

[17]  Arpan Kumar Kar,et al.  Big Data Analytics: A Review on Theoretical Contributions and Tools Used in Literature , 2017, Global Journal of Flexible Systems Management.

[18]  Zongxu Pan,et al.  Transfer Learning with Deep Convolutional Neural Network for SAR Target Classification with Limited Labeled Data , 2017, Remote. Sens..

[19]  Danuta Makowiec,et al.  Scale Invariant Properties in Heart Rate Signals , 2006 .

[20]  Heart Rate Variability Standards of Measurement , Physiological Interpretation , and Clinical Use Task Force of the European Society of Cardiology the North American Society of Pacing Electrophysiology , 2022 .

[21]  J. A. Stewart,et al.  Nonlinear Time Series Analysis , 2015 .

[22]  Valeriy V. Gavrishchaka,et al.  Ensemble Decomposition Learning for Optimal Utilization of Implicitly Encoded Knowledge in Biomedical Applications , 2011 .

[23]  Jeffrey M. Hausdorff,et al.  Altered fractal dynamics of gait: reduced stride-interval correlations with aging and Huntington's disease. , 1997, Journal of applied physiology.

[24]  Ronald M. Summers,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, IEEE Transactions on Medical Imaging.

[25]  Pere Caminal,et al.  Methods derived from nonlinear dynamics for analysing heart rate variability , 2009, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[26]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[27]  V. Gavrishchaka BOOSTING-BASED FRAMEWORK FOR PORTFOLIO STRATEGY DISCOVERY AND OPTIMIZATION , 2006 .

[28]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[29]  Valeriy V. Gavrishchaka,et al.  Ensemble Learning Frameworks for the Discovery of Multi-component Quantitative Models in Biomedical Applications , 2010, 2010 Second International Conference on Computer Modeling and Simulation.

[30]  Adrienne L. Fairhall,et al.  Analysis of Neuronal Spike Trains, Deconstructed , 2016, Neuron.

[31]  Olga V. Senyukova,et al.  Generic Ensemble-Based Representation of Global Cardiovascular Dynamics for Personalized Treatment Discovery and Optimization , 2016, ICCCI.

[32]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[33]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[34]  Tom Chau,et al.  An empirical examination of detrended fluctuation analysis for gait data. , 2010, Gait & posture.

[35]  Spiros C. Denaxas,et al.  Big biomedical data and cardiovascular disease research: opportunities and challenges. , 2015, European heart journal. Quality of care & clinical outcomes.

[36]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[37]  Euclid Seeram,et al.  Big Data: The Next Era of Informatics and Data Science in Medical Imaging- A Literature Review , 2018 .

[38]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[39]  Ila Rani Fiete,et al.  Learning and coding in biological neural networks , 2003 .

[40]  Gunnar Rätsch,et al.  Robust Boosting via Convex Optimization: Theory and Applications , 2007 .

[41]  Karel Hana,et al.  FRACTAL AND MULTIFRACTAL PROPERTIES OF HEARTBEAT INTERVAL SERIES IN EXTREMAL STATES OF THE HUMAN ORGANISM , 2003 .

[42]  Valeriy V. Gavrishchaka,et al.  Universal Multi-complexity Measures for Physiological State Quantification in Intelligent Diagnostics and Monitoring Systems , 2013 .

[43]  Olga V. Senyukova,et al.  Multi-expert evolving system for objective psychophysiological monitoring and fast discovery of effective personalized therapies , 2017, 2017 Evolving and Adaptive Intelligent Systems (EAIS).

[44]  Yoshua Bengio,et al.  Why Does Unsupervised Pre-training Help Deep Learning? , 2010, AISTATS.

[45]  Robert E. Schapire,et al.  Design and analysis of efficient learning algorithms , 1992, ACM Doctoral dissertation award ; 1991.

[46]  Jeffrey M. Hausdorff,et al.  Maturation of gait dynamics: stride-to-stride variability and its temporal organization in children. , 1999, Journal of applied physiology.

[47]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[48]  V. Tikhomirov On the Representation of Continuous Functions of Several Variables as Superpositions of Continuous Functions of one Variable and Addition , 1991 .

[49]  A. Akıncı,et al.  Heart rate variability in diabetic children: Sensitivity of the time- and frequency-domain methods , 1993, Pediatric Cardiology.

[50]  Valeriy V. Gavrishchaka,et al.  Robust Algorithmic Detection of Cardiac Pathologies from Short Periods of RR Data , 2013 .

[51]  Stephanie T. Lanza,et al.  Sensitivity and Specificity of Information Criteria , 2018, bioRxiv.

[52]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[53]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[54]  Valeriy V. Gavrishchaka,et al.  Boosting-Based Frameworks in Financial Modeling: Application to Symbolic Volatility Forecasting , 2006 .

[55]  R. Scott Evans,et al.  Automated detection of physiologic deterioration in hospitalized patients , 2015, J. Am. Medical Informatics Assoc..

[56]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[57]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[58]  Marios Anthimopoulos,et al.  Multi-source Transfer Learning with Convolutional Neural Networks for Lung Pattern Analysis , 2016, IEEE journal of biomedical and health informatics.

[59]  Shimon Ullman,et al.  Single-example Learning of Novel Classes using Representation by Similarity , 2005, BMVC.

[60]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[61]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.

[62]  Rasmus Agren THESIS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY On metabolic networks and multi-omics integration , 2013 .

[63]  G. Breithardt,et al.  Heart rate variability: standards of measurement, physiological interpretation and clinical use. Task Force of the European Society of Cardiology and the North American Society of Pacing and Electrophysiology. , 1996 .

[64]  Surya Ganguli,et al.  Exact solutions to the nonlinear dynamics of learning in deep linear neural networks , 2013, ICLR.

[65]  Ji Feng,et al.  Deep Forest: Towards An Alternative to Deep Neural Networks , 2017, IJCAI.

[66]  Valeriy V. Gavrishchaka,et al.  Support Vector Machine as an Efficient Framework for Stock Market Volatility Forecasting , 2006, Comput. Manag. Sci..

[67]  Paul J. Werbos,et al.  Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.

[68]  Jacques Bélair,et al.  Dynamical disease : mathematical analysis of human illness , 1995 .

[69]  Valeriy V. Gavrishchaka,et al.  Volatility forecasting from multiscale and high-dimensional market data , 2003, Neurocomputing.

[70]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[71]  Valeriy Gavrishchaka,et al.  Multi-complexity ensemble measures for gait time series analysis: application to diagnostics, monitoring and biometrics. , 2015, Advances in experimental medicine and biology.

[72]  Dong Yu,et al.  Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..

[73]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[74]  H. Stanley,et al.  Quantification of scaling exponents and crossover phenomena in nonstationary heartbeat time series. , 1995, Chaos.

[75]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[76]  W. Scott,et al.  Heart rate variability after acute traumatic brain injury in children , 2000, Critical care medicine.

[77]  Sergey Plis,et al.  Deep Learning Applications for Predicting Pharmacological Properties of Drugs and Drug Repurposing Using Transcriptomic Data. , 2016, Molecular pharmaceutics.

[78]  Florian Metze,et al.  Extracting deep bottleneck features using stacked auto-encoders , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[79]  Mathias Baumert,et al.  Heart Rate Variability, Blood Pressure Variability, and Baroreflex Sensitivity in Overtrained Athletes , 2006, Clinical journal of sport medicine : official journal of the Canadian Academy of Sport Medicine.

[80]  H. Stanley,et al.  Model for complex heart rate dynamics in health and diseases. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.