Big Data and Machine Learning Meet the Health Sciences

Big data and machine learning are gaining traction in health sciences research. They might provide predictive models for both clinical practice and public health systems. Big data is a broad term used to denote volumes of large and complex measurements. Beyond genomics and other “omic” fields, big data includes administrative, molecular, clinical, environmental, sociodemographic, and even social media information. Machine learning, also known as pattern recognition, represents a range of techniques used to analyze big data by identifying patterns of interaction among features. Compared with traditional statistical methods that provide primarily average group-level results, machine learning algorithms allow predictions and stratification of clinical outcomes at the level of an individual subject. In the present chapter, we provide a concise historical perspective of some important events in health sciences and the analytical methods used to find causes and treatment of illnesses. The overall aim is to understand why big data and machine learning have recently become promising methods to define, predict, and treat illnesses, and how they can transform the way we conceptualize care in health sciences.

[1]  E. Finkelstein,et al.  Development and Validation of a Deep Learning System for Diabetic Retinopathy and Related Eye Diseases Using Retinal Images From Multiethnic Populations With Diabetes , 2017, JAMA.

[2]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[3]  B. Mwangi,et al.  The impact of machine learning techniques in the study of bipolar disorder: A systematic review , 2017, Neuroscience & Biobehavioral Reviews.

[4]  R. Kessler,et al.  How well can post‐traumatic stress disorder be predicted from pre‐trauma risk factors? An exploratory study in the WHO World Mental Health Surveys , 2014, World psychiatry : official journal of the World Psychiatric Association.

[5]  T. Insel Digital Phenotyping: Technology for a New Science of Behavior. , 2017, JAMA.

[6]  R. Cho,et al.  Treatment response prediction and individualized identification of first-episode drug-naïve schizophrenia using brain functional connectivity , 2018, Molecular Psychiatry.

[7]  Marcia K. Johnson,et al.  Cross-trial prediction of treatment outcome in depression: a machine learning approach. , 2016, The lancet. Psychiatry.

[8]  E. Gotuzzo,et al.  The greatest steps towards the discovery of Vibrio cholerae. , 2014, Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases.

[9]  M Susser,et al.  Choosing a future for epidemiology: I. Eras and paradigms. , 1996, American journal of public health.

[10]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[11]  B. Mwangi,et al.  Volumetric brain magnetic resonance imaging predicts functioning in bipolar disorder: A machine learning approach. , 2018, Journal of psychiatric research.

[12]  A. Turing On Computable Numbers, with an Application to the Entscheidungsproblem. , 1937 .

[13]  B. Mwangi,et al.  Individualized identification of euthymic bipolar disorder using the Cambridge Neuropsychological Test Automated Battery (CANTAB) and machine learning. , 2016, Journal of affective disorders.

[14]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[15]  Ives Cavalcante Passos,et al.  Big data analytics and machine learning: 2015 and beyond. , 2016, The lancet. Psychiatry.

[16]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[17]  Ives Cavalcante Passos,et al.  Peripheral biomarker signatures of bipolar disorder and schizophrenia: A machine learning approach , 2017, Schizophrenia Research.

[18]  T. Greenhalgh,et al.  Evidence based medicine: a movement in crisis? , 2014, BMJ : British Medical Journal.

[19]  Sander Klous,et al.  We are Big Data , 2016 .

[20]  A. Caspi,et al.  Role of Genotype in the Cycle of Violence in Maltreated Children , 2002, Science.

[21]  Sander Klous,et al.  We are Big Data: The Future of the Information Society , 2016 .

[23]  Khader M Hasan,et al.  Individualized Prediction and Clinical Staging of Bipolar Disorders using Neuroanatomical Biomarkers. , 2016, Biological psychiatry. Cognitive neuroscience and neuroimaging.

[24]  Eun-Young Kim Patient Will See You Now: The Future of Medicine is in Your Hands , 2015, Healthcare Informatics Research.

[25]  A. Detsky,et al.  Evidence-based medicine. A new approach to teaching the practice of medicine. , 1992, JAMA.

[26]  A. Duffy,et al.  Changing the bipolar illness trajectory. , 2017, The lancet. Psychiatry.

[27]  Khader M. Hasan,et al.  Identification and individualized prediction of clinical phenotypes in bipolar disorders using neurocognitive data, neuroimaging scans and machine learning , 2017, NeuroImage.

[28]  Dimitris Mavridis,et al.  Comparative efficacy and tolerability of 15 antipsychotic drugs in schizophrenia: a multiple-treatments meta-analysis , 2013, The Lancet.

[29]  Z. Obermeyer,et al.  Predicting the Future - Big Data, Machine Learning, and Clinical Medicine. , 2016, The New England journal of medicine.

[30]  Deviant Children Grown Up; A Sociological and Psychiatric Study of Sociopathic Personality , 1967 .