论文信息 - A Case for Serverless Machine Learning

A Case for Serverless Machine Learning

The scale and complexity of ML workflows makes it hard to provision and manage resources—a burden for ML practitioners that hinders both their productivity and effectiveness. Encouragingly, however, serverless computing has recently emerged as a compelling solution to address the general problem of data center resource management. This work analyzes the resource management problem in the specific context of ML workloads and explores a research direction that leverages serverless infrastructures to automate the management of resources for ML workflows. We make a case for a serverless machine learning framework, specializing both for serverless infrastructures and Machine Learning workflows, and argue that either of those in isolation is insufficient.

Joao Carreira | J. Carreira

[1] Ion Stoica,et al. Occupy the cloud: distributed computing for the 99% , 2017, SoCC.

[2] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[3] Zheng Zhang,et al. MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems , 2015, ArXiv.

[4] Christoforos E. Kozyrakis,et al. Pocket: Elastic Ephemeral Storage for Serverless Analytics , 2018, OSDI.

[5] Scott Shenker,et al. Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[6] Xin Yuan,et al. Bandwidth optimal all-reduce algorithms for clusters of workstations , 2009, J. Parallel Distributed Comput..

[7] Randy H. Katz,et al. Heterogeneity and dynamicity of clouds at scale: Google trace analysis , 2012, SoCC '12.

[8] Message Passing Interface Forum. MPI: A message - passing interface standard , 1994 .

[9] John Langford,et al. A reliable effective terascale linear learning system , 2011, J. Mach. Learn. Res..

[10] Eric P. Xing,et al. Managed communication and consistency for fast data-parallel iterative analytics , 2015, SoCC.