MicrographCleaner: a python package for cryo-EM micrograph cleaning using deep learning

Cryo-EM Single Particle Analysis workflows require from tens of thousands of high-quality particle projections to unveil the three-dimensional structure of macromolecules. Conventional methods for automatic particle picking tend to suffer from high false-positive rates, hurdling the reconstruction process. One common cause of this problem is the presence of carbon and different types of high-contrast contaminations. In order to overcome this limitation, we have developed MicrographCleaner, a deep learning package designed to discriminate which regions of micrographs are suitable for particle picking and which are not in an automatic fashion. MicrographCleaner implements a U-net-like deep learning model trained on a manually curated dataset compiled from over five hundred micrographs. The benchmarking, carried out on about one hundred independent micrographs, shows that MicrographCleaner is a very efficient approach for micrograph preprocessing. MicrographCleaner (micrograph_cleaner_em) package is available at PyPI and Anaconda Cloud and also as a Scipion/Xmipp protocol. Source code is available at https://github.com/rsanchezgarc/micrograph_cleaner_em.

[1]  Radostin Danev,et al.  Volta phase plate data collection facilitates image processing and cryo-EM structure determination. , 2018, Journal of structural biology.

[2]  Thorsten Wagner,et al.  SPHIRE-crYOLO is a fast and accurate fully automated particle picker for cryo-EM , 2019, Communications Biology.

[3]  Mingxia Feng,et al.  Structure of Csx1-cOA4 complex reveals the basis of RNA decay in Type III-B CRISPR-Cas , 2019, Nature Communications.

[4]  Emmanuelle Gouillart,et al.  scikit-image: image processing in Python , 2014, PeerJ.

[5]  J. McLellan,et al.  Structure of the Respiratory Syncytial Virus Polymerase Complex , 2019, Cell.

[6]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[7]  Markus Stabrin,et al.  SPHIRE-crYOLO is a fast and accurate fully automated particle picker for cryo-EM. , 2019 .

[8]  Satya Prakash,et al.  Cryo-EM structure and dynamics of eukaryotic DNA polymerase δ holoenzyme , 2019, Nature Structural & Molecular Biology.

[9]  Christopher Irving,et al.  Appion: an integrated, database-driven pipeline to facilitate EM image processing. , 2009, Journal of structural biology.

[10]  Takanori Nakane,et al.  Cryo-EM structure of the human L-type amino acid transporter 1 in complex with glycoprotein CD98hc , 2019, Nature Structural & Molecular Biology.

[11]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[12]  Sjors H.W. Scheres,et al.  Semi-automated selection of cryo-EM particles in RELION-1.3 , 2015, Journal of structural biology.

[13]  Tian Xia,et al.  DeepPicker: a Deep Learning Approach for Fully Automated Particle Picking in Cryo-EM , 2016, Journal of structural biology.

[14]  Thomas Brox,et al.  U-Net: deep learning for cell counting, detection, and morphometry , 2018, Nature Methods.

[15]  Dimitry Tegunov,et al.  Real-time cryo–EM data pre-processing with Warp , 2019, Nature Methods.

[16]  Bonnie Berger,et al.  Positive-unlabeled convolutional neural networks for particle picking in cryo-electron micrographs , 2018, Nature Methods.

[17]  D. Barford,et al.  Structure of the inner kinetochore CCAN complex assembled onto a centromeric nucleosome , 2019, Nature.

[18]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[19]  C O S Sorzano,et al.  Scipion: A software framework toward integration, reproducibility and validation in 3D electron microscopy. , 2016, Journal of structural biology.

[20]  Renmin Han,et al.  PIXER: an automated particle-selection method based on segmentation using a deep neural network , 2019, BMC Bioinformatics.

[21]  Carlos Oscar S Sorzano,et al.  Deep Consensus, a deep learning-based approach for particle pruning in cryo-electron microscopy , 2018, IUCrJ.

[22]  Ardan Patwardhan,et al.  EMPIAR: a public archive for raw electron microscopy image data , 2016, Nature Methods.

[23]  D. Kriegman,et al.  Automatic particle selection: results of a comparative study. , 2004, Journal of structural biology.

[24]  Andrew B. Ward,et al.  EMHP: an accurate automated hole masking algorithm for single-particle cryo-EM image processing , 2017, bioRxiv.

[25]  Luca Viganò,et al.  Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , 2015, IWSEC 2015.

[26]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[27]  Yanan Zhu,et al.  A deep convolutional neural network approach to single-particle recognition in cryo-electron microscopy , 2016, BMC Bioinformatics.

[28]  C O S Sorzano,et al.  Particle quality assessment and sorting for automatic and semiautomatic particle-picking techniques. , 2013, Journal of structural biology.

[29]  Tomohiro Nishizawa,et al.  Cryo-EM structures capture the transport cycle of the P4-ATPase flippase , 2019, Science.

[30]  G. Demo,et al.  Principles for enhancing virus capsid capacity and stability from a thermophilic virus capsid structure , 2018, Nature Communications.