Securing the future of research computing in the biosciences

Author summary Improvements in technology often drive scientific discovery. Therefore, research requires sustained investment in the latest equipment and training for the researchers who are going to use it. Prioritising and administering infrastructure investment is challenging because future needs are difficult to predict. In the past, highly computationally demanding research was associated primarily with particle physics and astronomy experiments. However, as biology becomes more quantitative and bioscientists generate more and more data, their computational requirements may ultimately exceed those of physical scientists. Computation has always been central to bioinformatics, but now imaging experiments have rapidly growing data processing and storage requirements. There is also an urgent need for new modelling and simulation tools to provide insight and understanding of these biophysical experiments. Bioscience communities must work together to provide the software and skills training needed in their areas. Research-active institutions need to recognise that computation is now vital in many more areas of discovery and create an environment where it can be embraced. The public must also become aware of both the power and limitations of computing, particularly with respect to their health and personal data.

[1]  E. Lindahl,et al.  Accelerated cryo-EM structure determination with parallelisation using GPUs in RELION-2 , 2016, bioRxiv.

[2]  William J. Abernathy,et al.  Patterns of Industrial Innovation , 1978 .

[3]  Michael Y. Galperin,et al.  The 2012 Nucleic Acids Research Database Issue and the online Molecular Biology Database Collection , 2011, Nucleic Acids Res..

[4]  David J. Fleet,et al.  cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination , 2017, Nature Methods.

[5]  Bálint Antal,et al.  Image Data Resource: a bioimage data integration and publication platform , 2017, Nature Methods.

[6]  R. Horwitz,et al.  Whole cell maps chart a course for 21st-century cell biology , 2017, Science.

[7]  Adrian J. Mulholland,et al.  In pursuit of an accurate spatial and temporal model of biomolecules at the atomistic level: a perspective on computer simulation , 2015, Acta crystallographica. Section D, Biological crystallography.

[8]  G J Kleywegt,et al.  Where freedom is given, liberties are taken. , 1995, Structure.

[9]  Erick A. Perez Alday,et al.  Recent progress in multi-scale models of the human atria , 2014 .

[10]  L. Loew,et al.  The Virtual Cell: a software environment for computational cell biology. , 2001, Trends in biotechnology.

[11]  Rafael Fernandez-Leiro,et al.  A pipeline approach to single-particle processing in RELION , 2016, bioRxiv.

[12]  E. Lin,et al.  Machine learning and systems genomics approaches for multi-omics data , 2017, Biomarker Research.

[13]  David S. Goodsell,et al.  Instant Construction and Visualization of Crowded Biological Environments , 2018, IEEE Transactions on Visualization and Computer Graphics.

[14]  Min Xu,et al.  A convolutional autoencoder approach for mining features in cellular electron cryo-tomograms and weakly supervised coarse segmentation , 2017, Journal of structural biology.

[15]  J. C. H. Spence,et al.  XFELs for structure and dynamics in biology , 2017, IUCrJ.

[16]  David I Stuart,et al.  Fixed target combined with spectral mapping: approaching 100% hit rates for serial crystallography. , 2016, Acta crystallographica. Section D, Structural biology.

[17]  Ivan Viola,et al.  cellVIEW: a Tool for Illustrative and Multi-Scale Rendering of Large Biomolecular Datasets , 2015, VCBM.

[18]  Wes Sharrock,et al.  The State of Development of CSE , 2012 .

[19]  L. Looger,et al.  Diverse protocols for correlative super-resolution fluorescence imaging and electron microscopy of chemically fixed samples , 2017, Nature Protocols.

[20]  Ruth Nussinov,et al.  Making Biomolecular Simulations Accessible in the Post-Nobel Prize Era , 2014, PLoS Comput. Biol..

[21]  Sébastien Boutet,et al.  Direct observation of ultrafast collective motions in CO myoglobin upon ligand dissociation , 2015, Science.

[22]  Gerry McDermott,et al.  Mesoscale imaging with cryo‐light and X‐rays: Larger than molecular machines, smaller than a cell , 2017, Biology of the cell.

[23]  J. P. Grossman,et al.  Biomolecular simulation: a computational microscope for molecular biology. , 2012, Annual review of biophysics.

[24]  John L. Klepeis,et al.  Millisecond-scale molecular dynamics simulations on Anton , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[25]  Rebecca F Thompson,et al.  Collection, pre-processing and on-the-fly analysis of data for high-resolution, single-particle cryo-electron microscopy , 2018, Nature Protocols.

[26]  Junli Liu,et al.  Bayesian uncertainty analysis for complex systems biology models: emulation, global parameter searches and evaluation of gene functions , 2016, BMC Systems Biology.

[27]  Richard H. Henchman,et al.  Biomolecular simulations: From dynamics and mechanisms to computational assays of biological activity , 2018, WIREs Computational Molecular Science.

[28]  Martin Gasthuber,et al.  Online & Offline data storage and data processing at the European XFEL facility , 2017 .

[29]  Ardan Patwardhan,et al.  A call for public archives for biological image data , 2018, Nature Methods.

[30]  Xosé M. Fernández-Suárez,et al.  The 2018 Nucleic Acids Research database issue and the online molecular biology database collection , 2017, Nucleic Acids Res..

[31]  Natalia A Trayanova,et al.  How computer simulations of the human heart can improve anti‐arrhythmia therapy , 2016, The Journal of physiology.

[32]  Isuru D. Jayasinghe,et al.  True Molecular Scale Visualization of Variable Clustering Properties of Ryanodine Receptors , 2018, Cell reports.

[33]  Rolf Apweiler,et al.  The European Bioinformatics Institute in 2017: data coordination and integration , 2017, Nucleic Acids Res..

[34]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[35]  Alexis Rohou,et al.  cisTEM: User-friendly software for single-particle image processing , 2017, bioRxiv.

[36]  Gregory A Voth,et al.  A Multiscale Description of Biomolecular Active Matter: The Chemistry Underlying Many Life Processes. , 2017, Accounts of chemical research.

[37]  Simon Hettrick,et al.  Research Software Engineers: State of the Nation Report 2017 , 2017 .

[38]  P. Hunter,et al.  The Virtual Physiological Human: Ten Years After. , 2016, Annual review of biomedical engineering.

[39]  Anton Barty,et al.  Femtosecond structural dynamics drives the trans/cis isomerization in photoactive yellow protein , 2016, Science.

[40]  Gerard J Kleywegt,et al.  Homo crystallographicus--quo vadis? , 2002, Structure.

[41]  Deborah Lupton,et al.  Quantified Self , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[42]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[43]  Joachim Frank,et al.  New Opportunities Created by Single-Particle Cryo-EM: The Mapping of Conformational Space , 2018, Biochemistry.

[44]  Erik Franken,et al.  A 3D cellular context for the macromolecular world , 2014, Nature Structural &Molecular Biology.

[45]  J. Frank,et al.  Automated particle picking for low-contrast macromolecules in cryo-electron microscopy. , 2014, Journal of structural biology.

[46]  J. Michael Cherry,et al.  The Encyclopedia of DNA elements (ENCODE): data portal update , 2017, Nucleic Acids Res..

[47]  Elliot M. Meyerowitz,et al.  Observing the cell in its native state: Imaging subcellular dynamics in multicellular organisms , 2018, Science.

[48]  Ardan Patwardhan,et al.  Trends in the Electron Microscopy Data Bank (EMDB) , 2017, Acta crystallographica. Section D, Structural biology.

[49]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..

[50]  Thomas Blaschke,et al.  The rise of deep learning in drug discovery. , 2018, Drug discovery today.

[51]  Allan M Jordan,et al.  Artificial Intelligence in Drug Design-The Storm Before the Calm? , 2018, ACS medicinal chemistry letters.

[52]  W. Kühlbrandt The Resolution Revolution , 2014, Science.

[53]  Bevin Brett,et al.  Massively parallel unsupervised single-particle cryo-EM data clustering via statistical manifold learning , 2016, PloS one.

[54]  Andrew I Su,et al.  Exploring applications of crowdsourcing to cryo-EM , 2018, Journal of structural biology.

[55]  Zaida Luthey-Schulten,et al.  Challenges of Integrating Stochastic Dynamics and Cryo-Electron Tomograms in Whole-Cell Simulations. , 2017, The journal of physical chemistry. B.

[56]  Liesbet Geris,et al.  The future is digital: In silico tissue engineering , 2018, Current Opinion in Biomedical Engineering.

[57]  Ardan Patwardhan,et al.  EMPIAR: a public archive for raw electron microscopy image data , 2016, Nature Methods.

[58]  Sjors H.W. Scheres,et al.  RELION: Implementation of a Bayesian approach to cryo-EM structure determination , 2012, Journal of structural biology.