Exascale Deep Learning to Accelerate Cancer Research

Deep learning, through the use of neural networks, has demonstrated remarkable ability to automate many routine tasks when presented with sufficient data for training. The neural network architecture (e.g. number of layers, types of layers, connections between layers, etc.) plays a critical role in determining what, if anything, the neural network is able to learn from the training data. The trend for neural network architectures, especially those trained on ImageNet, has been to grow ever deeper and more complex. The result has been ever increasing accuracy on benchmark datasets with the cost of increased computational demands. In this paper we demonstrate that neural network architectures can be automatically generated, tailored for a specific application, with dual objectives: accuracy of prediction and speed of prediction. Using MENNDL- an HPC-enabled software stack for neural architecture search-we generate a neural network with comparable accuracy to state-of-the-art networks on a cancer pathology dataset that is also 16× faster at inference. The speedup in inference is necessary because of the volume and velocity of cancer pathology data; specifically, the previous state-of-the-art networks are too slow for individual researchers without access to HPC systems to keep pace with the rate of data generation. Our new model enables researchers with modest computational resources to analyze newly generated data faster than it is collected.

[1]  George Lee,et al.  Image analysis and machine learning in digital pathology: Challenges and opportunities , 2016, Medical Image Anal..

[2]  J. D. Schaffer,et al.  Combinations of genetic algorithms and neural networks: a survey of the state of the art , 1992, [Proceedings] COGANN-92: International Workshop on Combinations of Genetic Algorithms and Neural Networks.

[3]  Joel H. Saltz,et al.  Patch-Based Convolutional Neural Network for Whole Slide Tissue Image Classification , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Eugenio Culurciello,et al.  An Analysis of Deep Neural Network Models for Practical Applications , 2016, ArXiv.

[5]  Prasanna Balaprakash,et al.  DeepHyper: Asynchronous Hyperparameter Search for Deep Neural Networks , 2018, 2018 IEEE 25th International Conference on High Performance Computing (HiPC).

[6]  Catherine D. Schuman,et al.  167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation , 2018, SC18: International Conference for High Performance Computing, Networking, Storage and Analysis.

[7]  Risto Miikkulainen,et al.  Evolving Neural Networks through Augmenting Topologies , 2002, Evolutionary Computation.

[8]  Torsten Hoefler,et al.  Demystifying Parallel and Distributed Deep Learning , 2018, ACM Comput. Surv..

[9]  Prabhat,et al.  Exascale Deep Learning for Climate Analytics , 2018, SC18: International Conference for High Performance Computing, Networking, Storage and Analysis.

[10]  Steven R. Young,et al.  Evolving Deep Networks Using HPC , 2017, MLHPC@SC.

[11]  T. Nielsen,et al.  Update on tumor-infiltrating lymphocytes (TILs) in breast cancer, including recommendations to assess TILs in residual disease after neoadjuvant therapy and in carcinoma in situ: A report of the International Immuno-Oncology Biomarker Working Group on Breast Cancer. , 2017, Seminars in cancer biology.

[12]  Lawrence Davis,et al.  Training Feedforward Neural Networks Using Genetic Algorithms , 1989, IJCAI.

[13]  Rajarsi R. Gupta,et al.  Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images. , 2018, Cell reports.

[14]  Joel H. Saltz,et al.  Sparse Autoencoder for Unsupervised Nucleus Detection and Representation in Histopathology Images , 2017, Pattern Recognit..

[15]  K-R Müller,et al.  Scoring of tumor-infiltrating lymphocytes: From visual estimation to machine learning. , 2018, Seminars in cancer biology.

[16]  Fei Yang,et al.  Interobserver Agreement Between Pathologists Assessing Tumor-Infiltrating Lymphocytes (TILs) in Breast Cancer Using Methodology Proposed by the International TILs Working Group , 2016, Annals of Surgical Oncology.

[17]  C. Sautès-Fridman,et al.  The immune contexture in human tumours: impact on clinical outcome , 2012, Nature Reviews Cancer.

[18]  Xiaogang Wang,et al.  Structure Learning for Deep Neural Networks Based on Multiobjective Optimization , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[19]  Tahsin Kurc,et al.  Twenty Years of Digital Pathology: An Overview of the Road Travelled, What is on the Horizon, and the Emergence of Vendor-Neutral Archives , 2018, Journal of pathology informatics.

[20]  J. Galon,et al.  From the immune contexture to the Immunoscore: the role of prognostic and predictive immune markers in cancer. , 2013, Current opinion in immunology.

[21]  Carsten Denkert,et al.  Clinical relevance of host immunity in breast cancer: from TILs to the clinic , 2016, Nature Reviews Clinical Oncology.

[22]  Steven R. Young,et al.  Optimizing Convolutional Neural Networks for Cloud Detection , 2017, MLHPC@SC.

[23]  Peter Stone,et al.  Scalable training of artificial neural networks with adaptive sparse connectivity inspired by network science , 2017, Nature Communications.

[24]  Rich Caruana,et al.  Do Deep Nets Really Need to be Deep? , 2013, NIPS.

[25]  Steven J. M. Jones,et al.  The Immune Landscape of Cancer , 2018, Immunity.

[26]  Hyojin Kim,et al.  LBANN: livermore big artificial neural network HPC toolkit , 2015, MLHPC@SC.

[27]  D. Lacombe,et al.  A transatlantic perspective on the integration of immuno-oncology prognostic and predictive biomarkers in innovative clinical trial design. , 2018, Seminars in cancer biology.

[28]  Stefan Roth,et al.  Multi-Objective Neural Network Optimization for Visual Object Detection , 2006, Multi-Objective Machine Learning.

[29]  Franck Pagès,et al.  Tumor immunosurveillance in human cancers , 2011, Cancer and Metastasis Reviews.

[30]  Andrew Wilson,et al.  Deep Learning Evolutionary Optimization for Regression of Rotorcraft Vibrational Spectra , 2018, 2018 IEEE/ACM Machine Learning in HPC Environments (MLHPC).

[31]  Peter E. Thornton,et al.  Big data visual analytics for exploratory earth system simulation analysis , 2013, Comput. Geosci..

[32]  Alexander LeNail,et al.  NN-SVG: Publication-Ready Neural Network Architecture Schematics , 2019, J. Open Source Softw..