The Case for Unifying Data Loading in Machine Learning Clusters
暂无分享,去创建一个
Amar Phanishayee | Abhay Venkatesh | Aarati Kakaraparthy | Shivaram Venkataraman | S. Venkataraman | Amar Phanishayee | Abhay Venkatesh | Aarati Kakaraparthy
[1] Michael I. Jordan,et al. Breaking Locality Accelerates Block Gauss-Seidel , 2017, ICML.
[2] John Wilkes,et al. An introduction to disk drive modeling , 1994, Computer.
[3] Ameet Talwalkar,et al. Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization , 2016, J. Mach. Learn. Res..
[4] Scott Shenker,et al. Disk-Locality in Datacenter Computing Considered Irrelevant , 2011, HotOS.
[5] Rina Panigrahy,et al. Design Tradeoffs for SSD Performance , 2008, USENIX ATC.
[6] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[7] Haichen Shen,et al. TVM: An Automated End-to-End Optimizing Compiler for Deep Learning , 2018, OSDI.
[8] Luis Perez,et al. The Effectiveness of Data Augmentation in Image Classification using Deep Learning , 2017, ArXiv.
[9] Lenin Ravindranath,et al. Nectar: Automatic Management of Data and Computation in Datacenters , 2010, OSDI.
[10] Michael Chow,et al. This Paper Is Included in the Proceedings of the 12th Usenix Symposium on Operating Systems Design and Implementation (osdi '16). Dqbarge: Improving Data-quality Tradeoffs in Large-scale Internet Services Dqbarge: Improving Data-quality Tradeoffs in Large-scale Internet Services , 2022 .
[11] Jeffrey Scott Vitter,et al. Random sampling with a reservoir , 1985, TOMS.
[12] Dan Williams,et al. Platform Storage Performance With 3D XPoint Technology , 2017, Proceedings of the IEEE.
[13] Tie-Yan Liu,et al. Convergence Analysis of Distributed Stochastic Gradient Descent with Shuffling , 2017, Neurocomputing.
[14] Jon Howell,et al. Flat Datacenter Storage , 2012, OSDI.
[15] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.
[16] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[17] Stephen J. Wright,et al. Randomness and permutations in coordinate descent methods , 2018, Math. Program..
[18] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .
[19] Olatunji Ruwase,et al. HyperDrive: exploring hyperparameters with POP scheduling , 2017, Middleware.
[20] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .