论文信息 - Overton: A Data System for Monitoring and Improving Machine-Learned Products - 字舞流文

Overton: A Data System for Monitoring and Improving Machine-Learned Products

We describe a system called Overton, whose main design goal is to support engineers in building, monitoring, and improving production machine learning systems. Key challenges engineers face are monitoring fine-grained quality, diagnosing errors in sophisticated applications, and handling contradictory or incomplete supervision data. Overton automates the life cycle of model construction, deployment, and monitoring by providing a set of novel high-level, declarative abstractions. Overton's vision is to shift developers to these higher-level tasks instead of lower-level machine learning tasks. In fact, using Overton, engineers can build deep-learning-based applications without writing any code in frameworks like TensorFlow. For over a year, Overton has been used in production to support multiple applications in both near-real-time applications and back-of-house processing. In that time, Overton-based applications have answered billions of queries in multiple languages and processed trillions of records reducing errors 1.7-2.9 times versus production systems.

Christopher Ré | Feng Niu | Pallavi Gudipati | Charles Srisuwananukorn | C. Ré | Feng Niu | Pallavi Gudipati | Charles Srisuwananukorn

[1] Omer Levy,et al. GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding , 2018, BlackboxNLP@EMNLP.

[2] Gideon S. Mann,et al. Generalized Expectation Criteria for Semi-Supervised Learning with Weakly Labeled Data , 2010, J. Mach. Learn. Res..

[3] Christopher Ré,et al. The Role of Massively Multi-Task and Weak Supervision in Software 2.0 , 2019, CIDR.

[4] Guy Van den Broeck,et al. Query Processing on Probabilistic Data: A Survey , 2017, Found. Trends Databases.

[5] Hiroshi Nakagawa,et al. Reducing Wrong Labels in Distant Supervision for Relation Extraction , 2012, ACL.

[6] Mark Craven,et al. Constructing Biological Knowledge Bases by Extracting Information from Text Sources , 1999, ISMB.

[7] Jason Eisner,et al. Modeling Annotators: A Generative Approach to Learning from Annotator Rationales , 2008, EMNLP.

[8] Matei Zaharia,et al. Provenance Analysis for Missing Answers and Integrity Repairs. , 2018 .

[9] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[10] Ali Ghodsi,et al. Accelerating the Machine Learning Lifecycle with MLflow , 2018, IEEE Data Eng. Bull..

[11] Quoc V. Le,et al. AutoAugment: Learning Augmentation Policies from Data , 2018, ArXiv.

[12] Sandeep Bhatia,et al. Data Platform for Machine Learning , 2019, SIGMOD Conference.

[13] Daniel Jurafsky,et al. Distant supervision for relation extraction without labeled data , 2009, ACL.

[14] Christopher Ré,et al. Snorkel: Rapid Training Data Creation with Weak Supervision , 2017, Proc. VLDB Endow..

[15] Christopher D. Manning,et al. Improved Pattern Learning for Bootstrapped Entity Extraction , 2014, CoNLL.

[16] A. P. Dawid,et al. Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm , 1979 .

[17] Quoc V. Le,et al. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[18] Christopher De Sa,et al. DeepDive: Declarative Knowledge Base Construction , 2016, SGMD.

[19] Pedro M. Domingos,et al. Unifying logical and statistical AI with Markov logic , 2019, Commun. ACM.

[20] Ameet Talwalkar,et al. Random Search and Reproducibility for Neural Architecture Search , 2019, UAI.

[21] Christopher De Sa,et al. Incremental Knowledge Base Construction Using DeepDive , 2015, The VLDB Journal.

[22] Jun Yang,et al. Data Management in Machine Learning: Challenges, Techniques, and Systems , 2017, SIGMOD Conference.

[23] Qiang Yang,et al. An Overview of Multi-task Learning , 2018 .

[24] Christopher Ré,et al. Slice-based Learning: A Programming Model for Residual Learning in Critical Data Slices , 2019, NeurIPS.

[25] Frederic Sala,et al. Learning Dependency Structures for Weak Supervision Models , 2019, ICML.

[26] Samuel Madden,et al. MODELDB: Opportunities and Challenges in Managing Machine Learning Models , 2018, IEEE Data Eng. Bull..

[27] Anders Søgaard,et al. Deep multi-task learning with low level tasks supervised at lower layers , 2016, ACL.

[28] Sebastian Ruder,et al. An Overview of Multi-Task Learning in Deep Neural Networks , 2017, ArXiv.

[29] Bin Bi,et al. Iterative Learning for Reliable Crowdsourcing Systems , 2012 .

[30] Christopher Ré,et al. Learning to Compose Domain-Specific Transformations for Data Augmentation , 2017, NIPS.

[31] Rich Caruana,et al. Multitask Learning: A Knowledge-Based Source of Inductive Bias , 1993, ICML.

[32] Christopher De Sa,et al. Data Programming: Creating Large Training Sets, Quickly , 2016, NIPS.

[33] Christopher Ré,et al. Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale , 2018, SIGMOD Conference.

[34] Frank Hutter,et al. Neural Architecture Search: A Survey , 2018, J. Mach. Learn. Res..