论文信息 - Performance Analysis of Google Colaboratory as a Tool for Accelerating Deep Learning Applications

Performance Analysis of Google Colaboratory as a Tool for Accelerating Deep Learning Applications

Google Colaboratory (also known as Colab) is a cloud service based on Jupyter Notebooks for disseminating machine learning education and research. It provides a runtime fully configured for deep learning and free-of-charge access to a robust GPU. This paper presents a detailed analysis of Colaboratory regarding hardware resources, performance, and limitations. This analysis is performed through the use of Colaboratory for accelerating deep learning for computer vision and other GPU-centric applications. The chosen test-cases are a parallel tree-based combinatorial search and two computer vision applications: object detection/classification and object localization/segmentation. The hardware under the accelerated runtime is compared with a mainstream workstation and a robust Linux server equipped with 20 physical cores. Results show that the performance reached using this cloud service is equivalent to the performance of the dedicated testbeds, given similar resources. Thus, this service can be effectively exploited to accelerate not only deep learning but also other classes of GPU-centric applications. For instance, it is faster to train a CNN on Colaboratory’s accelerated runtime than using 20 physical cores of a Linux server. The performance of the GPU made available by Colaboratory may be enough for several profiles of researchers and students. However, these free-of-charge hardware resources are far from enough to solve demanding real-world problems and are not scalable. The most significant limitation found is the lack of CPU cores. Finally, several strengths and limitations of this cloud service are discussed, which might be useful for helping potential users.

[1] Alexandru Iosup,et al. Performance Analysis of Cloud Computing Services for Many-Tasks Scientific Computing , 2011, IEEE Transactions on Parallel and Distributed Systems.

[2] Nouredine Melab,et al. GPU‐accelerated backtracking using CUDA Dynamic Parallelism , 2018, Concurr. Comput. Pract. Exp..

[3] Bram van Ginneken,et al. A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[4] João Manuel R. S. Tavares,et al. Analysis of human tissue densities: A new approach to extract features from medical images , 2017, Pattern Recognit. Lett..

[5] Christopher Dyken,et al. State-of-the-art in heterogeneous computing , 2010, Sci. Program..

[6] Dipanjan Sarkar,et al. Deep Learning for Computer Vision , 2018 .

[7] Christine L. Borgman,et al. Using the Jupyter Notebook as a Tool for Open Science: An Empirical Study , 2017, 2017 ACM/IEEE Joint Conference on Digital Libraries (JCDL).

[8] Brian E. Granger,et al. IPython: A System for Interactive Scientific Computing , 2007, Computing in Science & Engineering.

[9] Sunila Gollapudi. Deep Learning for Computer Vision , 2019 .

[10] Sabela Ramos,et al. Performance analysis of HPC applications in the cloud , 2013, Future Gener. Comput. Syst..

[11] Michael Milford,et al. Deep learning features at scale for visual place recognition , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[12] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[13] G. Bruce Berriman,et al. Scientific workflow applications on Amazon EC2 , 2010, 2009 5th IEEE International Conference on E-Science Workshops.

[14] Kaiming He,et al. Mask R-CNN , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[15] John Shalf,et al. Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[16] Randy H. Katz,et al. Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[17] Samuel Williams,et al. The Landscape of Parallel Computing Research: A View from Berkeley , 2006 .

[18] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.