Evaluating Generic Auto-ML Tools for Computational Pathology

Image analysis tasks in computational pathology are commonly solved using convolutional neural networks (CNNs). The selection of a suitable CNN architecture and hyperparameters is usually done through exploratory iterative optimization, which is computationally expensive and requires substantial manual work. The goal of this article is to evaluate how generic tools for neural network architecture search and hyperparameter optimization perform for common use cases in computational pathology. For this purpose, we evaluated one on-premises and one cloudbased tool for three di erent classi cation tasks for histological images: tissue classi cation, mutation prediction, and grading. We found that the default CNN architectures and parameterizations of the evaluated AutoML tools already yielded classi cation performance on par with the original publications. Hyperparameter optimization for these tasks did not substantially improve performance, despite the additional computational e ort. However, performance varied substantially between classi ers obtained from individual AutoML runs due to non-deterministic e ects. Generic CNN architectures and AutoML tools could thus be a viable alternative to manually optimizing CNN architectures and parametrizations. This would allow developers of software solutions for computational pathology to focus e orts on harder-to-automate tasks such as data curation.

[1]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[3]  Spyridon Bakas,et al.  Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data , 2020, Scientific Reports.

[4]  Taghi M. Khoshgoftaar,et al.  Survey on deep learning with class imbalance , 2019, J. Big Data.

[5]  Thomas de Quincey [C] , 2000, The Works of Thomas De Quincey, Vol. 1: Writings, 1799–1820.

[6]  Kaiyong Zhao,et al.  AutoML: A Survey of the State-of-the-Art , 2019, Knowl. Based Syst..

[7]  S. Imoto,et al.  HEAL: an automated deep learning framework for cancer histopathology image analysis , 2021, Bioinform..

[8]  Quoc V. Le,et al.  Efficient Neural Architecture Search via Parameter Sharing , 2018, ICML.

[9]  Stefano Marrone,et al.  Reproducibility of Deep CNN for Biomedical Image Processing Across Frameworks and Architectures , 2019, 2019 27th European Signal Processing Conference (EUSIPCO).

[10]  Georgios Feretzakis,et al.  Setting up an Easy-to-Use Machine Learning Pipeline for Medical Decision Support: A Case Study for COVID-19 Diagnosis Based on Deep Learning with CT Scans. , 2020, Studies in health technology and informatics.

[11]  Ming Y. Lu,et al.  Federated learning for computational pathology on gigapixel whole slide images , 2020, Medical Image Anal..

[12]  DeepMIB: User-friendly and open-source software for training of deep learning network for biological image segmentation , 2021, PLoS computational biology.

[13]  Daniel Nüst,et al.  Ten simple rules for writing Dockerfiles for reproducible data science , 2020, PLoS computational biology.

[14]  Chris Eliasmith,et al.  Hyperopt: a Python library for model selection and hyperparameter optimization , 2015 .

[15]  I. Zlobec,et al.  Classification of colorectal tissue images from high throughput tissue microarrays by ensemble deep learning methods , 2021, Scientific Reports.

[16]  P. Keane,et al.  Automated deep learning in ophthalmology: AI that can build AI , 2021, Current opinion in ophthalmology.

[17]  Thomas Kalinski,et al.  Lossless compression of JPEG2000 whole slide images is not required for diagnostic virtual microscopy. , 2011, American journal of clinical pathology.

[18]  P. Alam ‘S’ , 2021, Composites Engineering: An A–Z Guide.

[19]  Nico Karssemeijer,et al.  Stain Specific Standardization of Whole-Slide Histopathological Images , 2016, IEEE Transactions on Medical Imaging.

[20]  Alexandr A. Kalinin,et al.  Albumentations: fast and flexible image augmentations , 2018, Inf..

[21]  Constantine Bekas,et al.  BAGAN: Data Augmentation with Balancing GAN , 2018, ArXiv.

[22]  Andrew H. Beck,et al.  Computational pathology definitions, best practices, and recommendations for regulatory guidance: a white paper from the Digital Pathology Association , 2019, The Journal of pathology.

[23]  Jakob Nikolas Kather,et al.  Deep learning can predict microsatellite instability directly from histology in gastrointestinal cancer , 2019, Nature Medicine.

[24]  Frank Hutter,et al.  Neural Architecture Search: A Survey , 2018, J. Mach. Learn. Res..

[25]  Yaoliang Yu,et al.  Problems and Opportunities in Training Deep Learning Software Systems: An Analysis of Variance , 2020, 2020 35th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[26]  W. Lee,et al.  Classification of pachychoroid disease on ultrawide-field indocyanine green angiography using auto-machine learning platform , 2020, British Journal of Ophthalmology.

[27]  Sherif Sakr,et al.  Automated Machine Learning: State-of-The-Art and Open Challenges , 2019, ArXiv.

[28]  Jakob Nikolas Kather,et al.  Deep learning in cancer pathology: a new generation of clinical biomarkers , 2020, British Journal of Cancer.

[29]  P. Alam ‘U’ , 2021, Composites Engineering: An A–Z Guide.

[30]  Shidan Wang,et al.  Pathology Image Analysis Using Segmentation Deep Learning Algorithms. , 2019, The American journal of pathology.

[31]  Jens Petersen,et al.  nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation , 2020, Nature Methods.

[32]  Thomas J. Fuchs,et al.  Clinical-grade computational pathology using weakly supervised deep learning on whole slide images , 2019, Nature Medicine.

[33]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[34]  N. Razavian,et al.  Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning , 2018, Nature Medicine.

[35]  Vijay Vasudevan,et al.  Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[36]  Gaofeng Huang,et al.  Enhanced balancing GAN: minority-class image generation , 2020, Neural Computing and Applications.

[37]  Sherif Sakr,et al.  Automated Machine Learning: Techniques and Frameworks , 2019, eBISS.

[38]  Fabian Isensee,et al.  nnDetection: A Self-configuring Method for Medical Object Detection , 2021, MICCAI.

[39]  Daisuke Komura,et al.  Machine learning approaches for pathologic diagnosis , 2019, Virchows Archiv.

[40]  Shishir Chumber,et al.  Classification of chest radiographs using general purpose cloud-based automated machine learning: pilot study , 2021, Egyptian Journal of Radiology and Nuclear Medicine.

[41]  P. Alam ‘N’ , 2021, Composites Engineering: An A–Z Guide.

[42]  Samuel G. Finlayson,et al.  Code-free deep learning for multi-modality medical image classification , 2021, Nature Machine Intelligence.

[43]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[44]  Munish Puri,et al.  Automated Machine Learning Diagnostic Support System as a Computational Biomarker for Detecting Drug-Induced Liver Injury Patterns in Whole Slide Liver Pathology Images. , 2020, Assay and drug development technologies.

[45]  P. Alam,et al.  R , 1823, The Herodotus Encyclopedia.

[46]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[47]  Tony Lindsey,et al.  Automated Cardiovascular Pathology Assessment Using Semantic Segmentation and Ensemble Learning , 2020, Journal of Digital Imaging.

[48]  Claes Lundström,et al.  A Closer Look at Domain Shift for Deep Learning in Histopathology , 2019, ArXiv.

[49]  Ekaba Bisong,et al.  Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners , 2019 .

[50]  P. Alam ‘A’ , 2021, Composites Engineering: An A–Z Guide.

[51]  M. Rantalainen,et al.  Artificial intelligence as the next step towards precision pathology , 2020, Journal of internal medicine.

[52]  Shahaboddin Shamshirband,et al.  A Deep Learning Ensemble Approach for Diabetic Retinopathy Detection , 2019, IEEE Access.

[53]  A. Parwani Next generation diagnostic pathology: use of digital pathology and artificial intelligence tools to augment a pathological diagnosis , 2019, Diagnostic Pathology.

[54]  Hang Zhang,et al.  AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data , 2020, ArXiv.

[55]  Francesco Ciompi,et al.  Neural Image Compression for Gigapixel Histopathology Image Analysis , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56]  Lucas M Bachmann,et al.  Automated deep learning design for medical image classification by health-care professionals with no coding experience: a feasibility study. , 2019, The Lancet. Digital health.

[57]  Ameet Talwalkar,et al.  Random Search and Reproducibility for Neural Architecture Search , 2019, UAI.

[58]  P. Alam,et al.  H , 1887, High Explosives, Propellants, Pyrotechnics.

[59]  Colin B. Compas,et al.  Federated learning for predicting clinical outcomes in patients with COVID-19 , 2021, Nature Medicine.

[60]  Geert J. S. Litjens,et al.  Quantifying the effects of data augmentation and stain color normalization in convolutional neural networks for computational pathology , 2019, Medical Image Anal..

[61]  Marius Lindauer,et al.  Best Practices for Scientific Research on Neural Architecture Search , 2019, ArXiv.

[62]  Miss A.O. Penney (b) , 1974, The New Yale Book of Quotations.

[63]  Zhihui Li,et al.  A Comprehensive Survey of Neural Architecture Search , 2021, ACM Comput. Surv..

[64]  Ion Stoica,et al.  Tune: A Research Platform for Distributed Model Selection and Training , 2018, ArXiv.

[65]  Karl Rohr,et al.  Hyperparameter optimization for image analysis: application to prostate tissue images and live cell data of virus-infected cells , 2019, International Journal of Computer Assisted Radiology and Surgery.

[66]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[67]  T. Hermanns,et al.  Automated Gleason grading of prostate cancer tissue microarrays via deep learning , 2018, Scientific Reports.

[68]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[69]  Jeroen van der Laak,et al.  Extending Unsupervised Neural Image Compression With Supervised Multitask Learning , 2020, MIDL.

[70]  Gustavo Carneiro,et al.  Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support , 2017, Lecture Notes in Computer Science.

[71]  D. Fan,et al.  Evaluation of the performance of traditional machine learning algorithms, convolutional neural network and AutoML Vision in ultrasound breast lesions classification: a comparative study. , 2021, Quantitative imaging in medicine and surgery.

[72]  Marius Lindauer,et al.  Auto-Pytorch: Multi-Fidelity MetaLearning for Efficient and Robust AutoDL , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[73]  Bo Chen,et al.  MnasNet: Platform-Aware Neural Architecture Search for Mobile , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[74]  Yan Zeng,et al.  A machine learning model for detecting invasive ductal carcinoma with Google Cloud AutoML Vision , 2020, Comput. Biol. Medicine.

[75]  Andrew Janowczyk,et al.  Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases , 2016, Journal of pathology informatics.

[76]  P. Alam ‘G’ , 2021, Composites Engineering: An A–Z Guide.

[77]  A. Borkowski,et al.  Using Artificial Intelligence for COVID-19 Chest X-ray Diagnosis , 2020, medRxiv.

[78]  Jon Kleinberg,et al.  Transfusion: Understanding Transfer Learning for Medical Imaging , 2019, NeurIPS.

[79]  Stefano Marrone,et al.  On Reproducibility of Deep Convolutional Neural Networks Approaches , 2018, RRPR.

[80]  Alessandro Bria,et al.  Addressing class imbalance in deep learning for small lesion detection on medical images , 2020, Comput. Biol. Medicine.

[81]  Andrew A. Borkowski,et al.  Comparing Artificial Intelligence Platforms for Histopathologic Cancer Diagnosis. , 2019, Federal practitioner : for the health care professionals of the VA, DoD, and PHS.

[82]  M. Cui,et al.  Artificial intelligence and computational pathology , 2021, Laboratory Investigation.

[83]  Qingquan Song,et al.  Auto-Keras: An Efficient Neural Architecture Search System , 2018, KDD.

[84]  A. Madabhushi,et al.  Artificial intelligence in digital pathology — new tools for diagnosis and precision oncology , 2019, Nature Reviews Clinical Oncology.

[85]  Peter Stone,et al.  The Impact of Nondeterminism on Reproducibility in Deep Reinforcement Learning , 2018 .