Wisteria: Nurturing Scalable Data Cleaning Infrastructure
暂无分享,去创建一个
Sanjay Krishnan | Eugene Wu | Michael J. Franklin | Jiannan Wang | Daniel Haas | S. Krishnan | D. Haas | M. Franklin | Eugene Wu | Jiannan Wang
[1] Sunil Prabhakar,et al. ERACER: a database approach for statistical inference and data cleaning , 2010, SIGMOD Conference.
[2] Jeffrey Heer,et al. Wrangler: interactive visual specification of data transformation scripts , 2011, CHI.
[3] Jeffrey Heer,et al. Enterprise Data Analysis and Visualization: An Interview Study , 2012, IEEE Transactions on Visualization and Computer Graphics.
[4] Ahmed Eldawy,et al. NADEEF: a commodity data cleaning system , 2013, SIGMOD '13.
[5] Ruben Verborgh,et al. Using OpenRefine , 2013 .
[6] Michael Stonebraker,et al. Data Curation at Scale: The Data Tamer System , 2013, CIDR.
[7] Jeffrey F. Naughton,et al. Corleone: hands-off crowdsourcing for entity matching , 2014, SIGMOD Conference.
[8] Jennifer Widom,et al. CrowdFill: collecting structured data from the crowd , 2014, SIGMOD Conference.
[9] Zhe Chen,et al. Integrating spreadsheet data via accurate and low-effort extraction , 2014, KDD.
[10] Tim Kraska,et al. A sample-and-clean framework for fast and accurate query processing on dirty data , 2014, SIGMOD Conference.
[11] Ion Stoica,et al. The Power of Choice in Data-Aware Cluster Scheduling , 2014, OSDI.
[12] Tim Kraska,et al. Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views , 2015, Proc. VLDB Endow..