Automatic Discovery of the Statistical Types of Variables in a Dataset

Humboldt Research Fellowship for Postdoctoral Researchers, which funded this research during her stay at the Max Planck Institute for Software Systems. ATI Grant EP/N510129/1 EPSRC Grant EP/N014162/1 Google

[1]  Tomohiro Ando,et al.  Bayesian Model Selection and Statistical Modeling , 2010 .

[2]  Wei Chu,et al.  Gaussian Processes for Ordinal Regression , 2005, J. Mach. Learn. Res..

[3]  Zoubin Ghahramani,et al.  General Table Completion using a Bayesian Nonparametric Model , 2014, NIPS.

[4]  Divesh Srivastava,et al.  Big Data Integration , 2015, Synthesis Lectures on Data Management.

[5]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[6]  Ole Winther,et al.  Bayesian Non-negative Matrix Factorization , 2009, ICA.

[7]  Dominique Brodbeck,et al.  Research directions in data wrangling: Visualizations and transformations for usable and credible data , 2011, Inf. Vis..

[8]  A. Agresti,et al.  Analysis of Ordinal Categorical Data. , 1985 .

[9]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[10]  J. Hilbe Negative Binomial Regression: Preface , 2007 .

[11]  Richard E. Turner,et al.  The Multivariate Generalised von Mises: Inference and applications , 2016 .

[12]  Mark Girolami,et al.  Variational Bayesian Multinomial Probit Regression with Gaussian Process Priors , 2006, Neural Computation.

[13]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[14]  José Miguel Hernández-Lobato Learning the Semantics of Discrete Random Variables : Ordinal or Categorical ? , 2014 .

[15]  Joseph M. Hellerstein,et al.  Quantitative Data Cleaning for Large Databases , 2008 .

[16]  Thomas L. Griffiths,et al.  The Indian Buffet Process: An Introduction and Review , 2011, J. Mach. Learn. Res..