The Data Linter: Lightweight Automated Sanity Checking for ML Data Sets
暂无分享,去创建一个
[1] Murray Hill,et al. Lint, a C Program Checker , 1978 .
[2] Jeffrey Heer,et al. Wrangler: interactive visual specification of data transformation scripts , 2011, CHI.
[3] Tova Milo,et al. Query-Oriented Data Cleaning with Oracles , 2015, SIGMOD Conference.
[4] Aaron Klein,et al. Efficient and Robust Automated Machine Learning , 2015, NIPS.
[5] Sanjay Krishnan,et al. ActiveClean: Interactive Data Cleaning For Statistical Modeling , 2016, Proc. VLDB Endow..
[6] D. Sculley,et al. What’s your ML test score? A rubric for ML production systems , 2016 .
[7] Theodore Johnson,et al. Exploratory Data Mining and Data Cleaning , 2003 .
[8] Kevin Leyton-Brown,et al. Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms , 2012, KDD.
[9] W. B. Roberts,et al. Machine Learning: The High Interest Credit Card of Technical Debt , 2014 .
[10] Erhard Rahm,et al. Data Cleaning: Problems and Current Approaches , 2000, IEEE Data Eng. Bull..
[11] P. Flajolet,et al. HyperLogLog: the analysis of a near-optimal cardinality estimation algorithm , 2007 .
[12] Ihab F. Ilyas,et al. Data Cleaning: Overview and Emerging Challenges , 2016, SIGMOD Conference.
[13] Neoklis Polyzotis,et al. Data Management Challenges in Production Machine Learning , 2017, SIGMOD Conference.