Big data: the next challenge for statistics

This paper focuses on the pivotal role that statisticians are challenged to undertake in the Big Data era. Their traditional work of managing variability, complexity, and hidden information is indeed made extremely more complex by the enormous volume of a large variety of data that new technologies are able to provide at high velocity. In detail, the paper briefly discusses few paradigmatic cases of analysis of Big Data in which theoretical, methodological and computational aspects have been fruitfully integrated with specific competences from industry, biology, and finance.

[1]  Francesca Ieva,et al.  Semiparametric Bayesian models for clustering and classification in the presence of unbalanced in‐hospital survival , 2014 .

[2]  Francesca Ieva,et al.  Detecting and visualizing outliers in provider profiling via funnel plots and mixed effect models , 2014, Health Care Management Science.

[3]  Simone Vantini,et al.  Discovering Spatiotemporal Patterns of Urban Life from Mobile Data: an Exploration through Hierarchical Independent Component Analysis. , 2013 .

[4]  Rosalba Radice,et al.  A Semiparametric Bivariate Probit Model for Joint Modeling of Outcomes in STEMI Patients , 2014, Comput. Math. Methods Medicine.

[5]  Francesca Ieva,et al.  Process indicators and outcome measures in the treatment of acute myocardial infarction patients , 2012 .

[6]  Francesca Ieva,et al.  Hospital Clustering in the Treatment of Acute Myocardial Infarction Patients Via a Bayesian Semiparametric Approach , 2013, Statistical Models for Data Analysis.

[7]  Simone Vantini,et al.  Treelet Decomposition of Mobile Phone Data for Deriving City Usage and Mobility Pattern in the Milan Urban Region , 2015 .

[8]  Francesca Ieva,et al.  Designing and Mining a Multicenter Observational Clinical Registry Concerning Patients with Acute Coronary Syndromes , 2013 .

[9]  Francesca Ieva,et al.  Exploitation, integration and statistical analysis of the Public Health Database and STEMI Archive in the Lombardia region , 2010 .

[10]  Simone Vantini,et al.  Measuring Downsize Reputational Risk in the Oil & Gas Industry , 2015 .

[11]  Anna Maria Paganoni,et al.  Advances in complex data modeling and computational methods in statistics , 2015 .

[12]  Francesca Ieva,et al.  Performance assessment using mixed effects models: a case study on coronary patient care , 2012 .

[13]  Francesca Ieva,et al.  Mining Administrative Health Databases for Epidemiological Purposes: A Case Study on Acute Myocardial Infarctions Diagnoses , 2013 .