Disaggregating asthma: Big investigation versus big data

&NA; We are facing a major challenge in bridging the gap between identifying subtypes of asthma to understand causal mechanisms and translating this knowledge into personalized prevention and management strategies. In recent years, “big data” has been sold as a panacea for generating hypotheses and driving new frontiers of health care; the idea that the data must and will speak for themselves is fast becoming a new dogma. One of the dangers of ready accessibility of health care data and computational tools for data analysis is that the process of data mining can become uncoupled from the scientific process of clinical interpretation, understanding the provenance of the data, and external validation. Although advances in computational methods can be valuable for using unexpected structure in data to generate hypotheses, there remains a need for testing hypotheses and interpreting results with scientific rigor. We argue for combining data‐ and hypothesis‐driven methods in a careful synergy, and the importance of carefully characterized birth and patient cohorts with genetic, phenotypic, biological, and molecular data in this process cannot be overemphasized. The main challenge on the road ahead is to harness bigger health care data in ways that produce meaningful clinical interpretation and to translate this into better diagnoses and properly personalized prevention and treatment plans. There is a pressing need for cross‐disciplinary research with an integrative approach to data science, whereby basic scientists, clinicians, data analysts, and epidemiologists work together to understand the heterogeneity of asthma.

[1]  A. Pickles,et al.  Joint modeling of parentally reported and physician-confirmed wheeze identifies children with persistent troublesome wheezing. , 2013, The Journal of allergy and clinical immunology.

[2]  B. Levy,et al.  Benefits to health care professionals and patients with diabetes of a novel blood glucose meter that provides pattern recognition and real-time automatic messaging compared to conventional paper logbooks , 2015 .

[3]  Iain Buchan,et al.  The Study Team for Early Life Asthma Research (STELAR) consortium ‘Asthma e-lab’: team science bringing data, methods and investigators together , 2015, Thorax.

[4]  C. Auffray,et al.  Personalized respiratory medicine: exploring the horizon, addressing the issues. Summary of a BRN-AJRCCM workshop held in Barcelona on June 12, 2014. , 2015, American journal of respiratory and critical care medicine.

[5]  Cosma Rohilla Shalizi,et al.  Philosophy and the practice of Bayesian statistics. , 2010, The British journal of mathematical and statistical psychology.

[6]  Jason H. Moore,et al.  Big Data analysis on autopilot? , 2013, BioData Mining.

[7]  S. Schneeweiss Learning from big health care data. , 2014, The New England journal of medicine.

[8]  A. Custovic,et al.  Environmental allergen exposure, sensitisation and asthma: from whole populations to individuals at risk , 2004, Thorax.

[9]  S. Goodman,et al.  Causal inference in public health. , 2013, Annual review of public health.

[10]  W. Busse,et al.  Biomarkers in asthmatic patients: Has their time come to direct treatment? , 2016, The Journal of allergy and clinical immunology.

[11]  A. Pickles,et al.  Assessing the association of early life antibiotic prescription with asthma exacerbations, impaired antiviral immunity, and genetic variants in 17q21: a population-based birth cohort study. , 2014, The Lancet. Respiratory medicine.

[12]  C. Laprise,et al.  The genetics of asthma and allergic diseases: pieces of the puzzle are starting to come together. , 2013, Current Opinion in Allergy and Clinical Immunology.

[13]  D. Strachan,et al.  Associations of wheezing phenotypes in the first 6 years of life with atopy, lung function and airway responsiveness in mid-childhood , 2008, Thorax.

[14]  A. Custovic,et al.  Role of the indoor environment in determining the severity of asthma. , 1998, Thorax.

[15]  David J. C. MacKay,et al.  Variational Gaussian process classifiers , 2000, IEEE Trans. Neural Networks Learn. Syst..

[16]  G. Anderson,et al.  Endotyping asthma: new insights into key pathogenic mechanisms in a complex, heterogeneous disease , 2008, The Lancet.

[17]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[18]  C. Kuehni,et al.  Asthma phenotypes in childhood: conceptual thoughts on stability and transition , 2016, European Respiratory Journal.

[19]  C. Mills,et al.  Patterns of IgE responses to multiple allergen components and clinical symptoms at age 11 years , 2015, The Journal of allergy and clinical immunology.

[20]  S. Holgate,et al.  Characterization of wheezing phenotypes in the first 10 years of life , 2003, Clinical and experimental allergy : journal of the British Society for Allergy and Clinical Immunology.

[21]  L. Cosmides,et al.  Are humans good intuitive statisticians after all? Rethinking some conclusions from the literature on judgment under uncertainty , 1996, Cognition.

[22]  Teri A. Manolio,et al.  Bringing genome-wide association findings into clinical use , 2013, Nature Reviews Genetics.

[23]  Suchi Saria,et al.  Subtyping: What It is and Its Role in Precision Medicine , 2015, IEEE Intelligent Systems.

[24]  M. Wjst,et al.  Genome-wide association studies in asthma: what they really told us about pathogenesis , 2013, Current opinion in allergy and clinical immunology.

[25]  Peter J. F. Lucas,et al.  Exploiting causal functional relationships in Bayesian network modelling for personalised healthcare , 2014, Int. J. Approx. Reason..

[26]  Adnan Custovic,et al.  Asthma endotypes: a new approach to classification of disease entities within the asthma syndrome. , 2011, The Journal of allergy and clinical immunology.

[27]  D. Postma,et al.  Patterns of Growth and Decline in Lung Function in Persistent Childhood Asthma. , 2016, The New England journal of medicine.

[28]  J. Simpson,et al.  Change in the manifestations of asthma and asthma-related traits in childhood: a latent transition analysis , 2015, European Respiratory Journal.

[29]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[30]  A. Woodcock,et al.  Effect of day care attendance on sensitization and atopic wheezing differs by Toll-like receptor 2 genotype in 2 population-based birth cohort studies. , 2011, The Journal of allergy and clinical immunology.

[31]  J. Winn,et al.  Multiple atopy phenotypes and their associations with asthma: similar findings from two birth cohorts , 2013, Allergy.

[32]  Andrew Thomas,et al.  WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[33]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[34]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[35]  Daniel J. Jackson,et al.  Cadherin-related family member 3, a childhood asthma susceptibility gene product, mediates rhinovirus C binding and replication , 2015, Proceedings of the National Academy of Sciences.

[36]  Nicola A Hanania,et al.  Lebrikizumab treatment in adults with asthma. , 2011, The New England journal of medicine.

[37]  Ben Goldacre,et al.  Big health data: the need to earn public trust , 2016, British Medical Journal.

[38]  Neil D. Lawrence,et al.  Gaussian Process Latent Variable Models for Visualisation of High Dimensional Data , 2003, NIPS.

[39]  S Matthysse,et al.  The genetic transmission of schizophrenia: application of Mendelian latent structure analysis to eye tracking dysfunctions in schizophrenia and affective disorder. , 1986, Journal of psychiatric research.

[40]  Stan Lipovetsky,et al.  Generalized Latent Variable Modeling: Multilevel,Longitudinal, and Structural Equation Models , 2005, Technometrics.

[41]  Ayne,et al.  ASTHMA AND WHEEZING IN THE FIRST SIX YEARS OF LIFE , 1995 .

[42]  Alberto Maffey,et al.  [Rhinovirus wheezing illness and genetic risk of childhood-onset asthma]. , 2013, Archivos argentinos de pediatria.

[43]  Cécile Viboud,et al.  Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales , 2013, PLoS Comput. Biol..

[44]  J. Berger The case for objective Bayesian analysis , 2006 .

[45]  D. Ownby Pediatric asthma and development of atopy , 2001, Current opinion in allergy and clinical immunology.

[46]  Matthew West,et al.  Bayesian factor regression models in the''large p , 2003 .

[47]  A. Custovic,et al.  Identification of Asthma Subtypes Using Clustering Methodologies , 2016, Pulmonary Therapy.

[48]  Carole A. Goble,et al.  Why Linked Data is Not Enough for Scientists , 2010, 2010 IEEE Sixth International Conference on e-Science.

[49]  Andrew Rambaut,et al.  Real-time digital pathogen surveillance — the time is now , 2015, Genome Biology.

[50]  Christopher M. Bishop,et al.  Model-based machine learning , 2013, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[51]  A. Custovic,et al.  Gene–environment interactions in the development of asthma and atopy , 2012, Expert review of respiratory medicine.

[52]  J. Sterne,et al.  Associations of Different Phenotypes of Wheezing Illness in Early Childhood with Environmental Variables Implicated in the Aetiology of Asthma , 2012, PLoS ONE.

[53]  I. Buchan,et al.  Trajectories of lung function during childhood. , 2014, American journal of respiratory and critical care medicine.

[54]  W. Morgan,et al.  Asthma and wheezing in the first six years of life. The Group Health Medical Associates. , 1995, The New England journal of medicine.

[55]  B. Muthén BEYOND SEM: GENERAL LATENT VARIABLE MODELING , 2002 .

[56]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[57]  H. Inskip,et al.  Validation of novel wheeze phenotypes using longitudinal airway function and atopic sensitization data in the first 6 years of life: Evidence from the Southampton Women's survey , 2013, Pediatric pulmonology.

[58]  I. Buchan,et al.  Evolution pathways of IgE responses to grass and mite allergens throughout childhood. , 2015, The Journal of allergy and clinical immunology.

[59]  A. Custovic,et al.  Methylation of IL‐2 promoter at birth alters the risk of asthma exacerbations during childhood , 2013, Clinical and experimental allergy : journal of the British Society for Allergy and Clinical Immunology.

[60]  J. Brehm,et al.  Genome-wide expression profiles identify potential targets for gene-environment interactions in asthma severity. , 2015, The Journal of allergy and clinical immunology.

[61]  P. Gergen,et al.  Preseasonal treatment with either omalizumab or an inhaled corticosteroid boost to prevent fall asthma exacerbations. , 2015, The Journal of allergy and clinical immunology.

[62]  M. Ege,et al.  Clinical and epidemiologic phenotypes of childhood asthma. , 2013, American journal of respiratory and critical care medicine.

[63]  P M Bentler,et al.  Structural equation models in medical research , 1992, Statistical methods in medical research.

[64]  Jeremy Wildfire,et al.  Seasonal risk factors for asthma exacerbations among inner-city children. , 2015, The Journal of allergy and clinical immunology.

[65]  W. Morgan,et al.  Long-term outcomes of early-onset wheeze and asthma. , 2012, The Journal of allergy and clinical immunology.

[66]  A. Lowe,et al.  Early-life risk factors for childhood wheeze phenotypes in a high-risk birth cohort. , 2014, The Journal of pediatrics.

[67]  D B Dunson,et al.  Commentary: practical advantages of Bayesian analysis of epidemiologic data. , 2001, American journal of epidemiology.

[68]  I. Buchan,et al.  Challenges in identifying asthma subgroups using unsupervised statistical learning techniques. , 2013, American journal of respiratory and critical care medicine.

[69]  D. Postma,et al.  Transient early wheeze and lung function in early childhood associated with chronic obstructive pulmonary disease genes. , 2014, The Journal of allergy and clinical immunology.

[70]  Thomas Blicher,et al.  A genome-wide association study identifies CDHR3 as a susceptibility locus for early childhood asthma with severe exacerbations , 2013, Nature Genetics.

[71]  Zhou Yong,et al.  ESP "Smart Flow" Integrates Quality and Control Data for Diagnostics and Optimization in Real Time , 2013 .

[72]  Ian D Pavord,et al.  Mepolizumab for severe eosinophilic asthma (DREAM): a multicentre, double-blind, placebo-controlled trial , 2012, The Lancet.

[73]  David Barber,et al.  Bayesian Classification With Gaussian Processes , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[74]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[75]  Magnus Rattray,et al.  Distinguishing Asthma Phenotypes Using Machine Learning Approaches , 2015, Current Allergy and Asthma Reports.

[76]  Iain Buchan,et al.  Developmental Profiles of Eczema , Wheeze , and Rhinitis : Two Population-Based Birth Cohort Studies , 2014 .

[77]  A. Woodcock,et al.  Endotoxin exposure, CD14, and allergic disease: an interaction between genes and the environment. , 2006, American journal of respiratory and critical care medicine.

[78]  A. Custovic,et al.  Characterizing wheeze phenotypes to identify endotypes of childhood asthma, and the implications for future management , 2013, Expert review of clinical immunology.

[79]  A. Custovic,et al.  Challenges in interpreting wheeze phenotypes: the clinical implications of statistical learning techniques. , 2014, American journal of respiratory and critical care medicine.

[80]  A. Simpson,et al.  The role of lipopolysaccharide in the development of atopy in humans , 2010, Clinical and experimental allergy : journal of the British Society for Allergy and Clinical Immunology.

[81]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[82]  Markus Svensén,et al.  Beyond atopy: multiple patterns of sensitization in relation to asthma in a birth cohort study. , 2010, American journal of respiratory and critical care medicine.

[83]  Julien Mairal,et al.  Optimization with Sparsity-Inducing Penalties , 2011, Found. Trends Mach. Learn..