Advances in Modern Information Technologies for Data Analysis in CRYO-EM and XFEL Experiments

A new approach to the organization of data pipelining in cryo-electron microscopy (Cryo-EM) and X-ray free-electron laser (XFEL) experiments is presented. This approach, based on the progress in information technologies (IT) due to the development of containerization techniques, allows one to separate user’s work at the application level from the developments of IT experts at the system and middleware levels. A user must only perform two simple operations: pack application packages in containers and write a workflow with data processing logic in a standard format. Some examples of containerized workflows for Cryo-EM and XFEL experiments on study of the spatial structure of single biological nanoobjects (viruses, macromolecules, etc.) are discussed. Examples of program codes for installing applied packages in Docker containers and examples of applied workflows written in the high-level language CWL are presented at the site of the project. The examples have comments, which may help an IT-inexperienced researcher to gain an idea of how to organize Docker containers and form CWL workflows for Cryo-EM and XFEL data pipelining.

[1]  Anton Barty,et al.  Single-particle imaging without symmetry constraints at an X-ray free-electron laser , 2018, IUCrJ.

[2]  A. Mancuso,et al.  Structural biology at the European X-ray free-electron laser facility , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.

[3]  S. Scheres,et al.  Advances in Single-Particle Electron Cryomicroscopy Structure Determination applied to Sub-tomogram Averaging , 2015, Structure.

[4]  Anton Barty,et al.  Coherent soft X-ray diffraction imaging of coliphage PR772 at the Linac coherent light source , 2017, Scientific Data.

[5]  Sjors H.W. Scheres,et al.  RELION: Implementation of a Bayesian approach to cryo-EM structure determination , 2012, Journal of structural biology.

[6]  Roberto Marabini,et al.  Maximum-likelihood multi-reference refinement for electron microscopy images. , 2005, Journal of molecular biology.

[7]  Kai Zhang,et al.  Gctf: Real-time CTF determination and correction , 2015, bioRxiv.

[8]  S. Scheres,et al.  How cryo-EM is revolutionizing structural biology. , 2015, Trends in biochemical sciences.

[9]  Thorsten Wagner,et al.  Two particle-picking procedures for filamentous proteins: SPHIRE-crYOLO filament mode and SPHIRE-STRIPER , 2020, bioRxiv.

[10]  Christos Gatsogiannis,et al.  Keyhole limpet hemocyanin: 9-A CryoEM structure and molecular model of the KLH1 didecamer reveal the interfaces and intricate topology of the 160 functional units. , 2009, Journal of molecular biology.

[11]  Vasily Velikhov,et al.  Towards on-the-fly Cryo-Electron Microscopy Data Processing by High Performance Data Analysis , 2018 .

[12]  J. Hajdu,et al.  Potential for biomolecular imaging with femtosecond X-ray pulses , 2000, Nature.

[13]  D. Ratner,et al.  First lasing and operation of an ångstrom-wavelength free-electron laser , 2010 .

[14]  R. Santra,et al.  The linac coherent light source single particle imaging road map , 2015, Structural dynamics.

[15]  J. Bozek AMO instrumentation for the LCLS X-ray FEL , 2009 .

[16]  N. Grigorieff,et al.  CTFFIND4: Fast and accurate defocus estimation from electron micrographs , 2015, bioRxiv.

[17]  Acta Crystallographica Section F - another home for cryo-electron microscopy contributions. , 2019, Acta crystallographica. Section F, Structural biology communications.

[18]  Rajkumar Buyya,et al.  A Taxonomy of Workflow Management Systems for Grid Computing , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[19]  T. Ishikawa,et al.  A compact X-ray free-electron laser emitting in the sub-ångström region , 2012, Nature Photonics.

[20]  G. Sun,et al.  Global pattern for the effect of climate and land cover on water yield , 2015, Nature Communications.

[21]  Sjors H.W. Scheres,et al.  Semi-automated selection of cryo-EM particles in RELION-1.3 , 2015, Journal of structural biology.

[22]  Anton Barty,et al.  Imaging single cells in a beam of live cyanobacteria with an X-ray laser , 2015, Nature Communications.

[23]  Yifan Cheng Single-Particle Cryo-EM at Crystallographic Resolution , 2015, Cell.

[24]  E. Callaway The revolution will not be crystallized: a new method sweeps through structural biology , 2015, Nature.

[25]  J. Hajdu,et al.  An advanced workflow for single-particle imaging with the limited data at an X-ray free-electron laser , 2020, IUCrJ.

[26]  Alexis Rohou,et al.  cisTEM: User-friendly software for single-particle image processing , 2017, bioRxiv.

[27]  Three-Dimensional Structure of Cytochrome c Nitrite Reductase As Determined by Cryo-Electron Microscopy , 2018, Acta naturae.

[28]  Anton Barty,et al.  High-throughput imaging of heterogeneous cell organelles with an X-ray laser , 2014, Nature Photonics.

[29]  Alexander Novikov,et al.  Kubernetes Container Orchestration as a Framework for Flexible and Effective Scientific Data Analysis , 2019, 2019 Ivannikov Ispras Open Conference (ISPRAS).

[30]  D. Mastronarde Advanced Data Acquisition From Electron Microscopes With SerialEM , 2018, Microscopy and Microanalysis.

[31]  Ashutosh Saxena,et al.  Make3D: Learning 3D Scene Structure from a Single Still Image , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  D. Agard,et al.  MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy , 2017, Nature Methods.

[33]  Marc Messerschmidt,et al.  The Atomic, Molecular and Optical Science instrument at the Linac Coherent Light Source , 2015, Journal of synchrotron radiation.

[34]  David J. Fleet,et al.  cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination , 2017, Nature Methods.

[35]  Rajkumar Buyya,et al.  A taxonomy of scientific workflow systems for grid computing , 2005, SGMD.

[36]  Garth J. Williams,et al.  Single mimivirus particles intercepted and imaged with an X-ray laser , 2011, Nature.

[37]  Jean-Michel Claverie,et al.  Three-dimensional reconstruction of the giant mimivirus particle with an x-ray free-electron laser. , 2015, Physical review letters.

[38]  Veit Elser,et al.  Dragonfly: an implementation of the expand–maximize–compress algorithm for single-particle imaging1 , 2016, Journal of applied crystallography.

[39]  H. N. Chapman,et al.  Imaging Atomic Structure and Dynamics with Ultrafast X-ray Scattering , 2007, Science.

[40]  F. Maia The Coherent X-ray Imaging Data Bank , 2012, Nature Methods.

[41]  V. Uversky,et al.  Structure Determination by Single-Particle Cryo-Electron Microscopy: Only the Sky (and Intrinsic Disorder) is the Limit , 2019, International journal of molecular sciences.

[42]  Steffen Hauf,et al.  Megahertz single-particle imaging at the European XFEL , 2019, Communications Physics.

[43]  Carl Nettelblad,et al.  Correlations in Scattered X-Ray Laser Pulses Reveal Nanoscale Structural Features of Viruses. , 2017, Physical review letters.

[44]  Harri Hirvonsalo,et al.  REANA: A System for Reusable Research Data Analyses , 2019, EPJ Web of Conferences.

[45]  I. Vartanyants,et al.  Software Platform for European XFEL: Towards Online Experimental Data Analysis , 2018, Lobachevskii Journal of Mathematics.

[46]  Haoyuan Li,et al.  Evaluation of the performance of classification algorithms for XFEL single-particle imaging data , 2019, IUCrJ.

[47]  John Chilton,et al.  Common Workflow Language, v1.0 , 2016 .