The Landscape of Exascale Research

The next generation of supercomputers will break the exascale barrier. Soon we will have systems capable of at least one quintillion (billion billion) floating-point operations per second (1018 FLOPS). Tremendous amounts of work have been invested into identifying and overcoming the challenges of the exascale era. In this work, we present an overview of these efforts and provide insight into the important trends, developments, and exciting research opportunities in exascale computing. We use a three-stage approach in which we (1) discuss various exascale landmark studies, (2) use data-driven techniques to analyze the large collection of related literature, and (3) discuss eight research areas in depth based on influential articles. Overall, we observe that great advancements have been made in tackling the two primary exascale challenges: energy efficiency and fault tolerance. However, as we look forward, we still foresee two major concerns: the lack of suitable programming tools and the growing gap between processor performance and data bandwidth (i.e., memory, storage, networks). Although we will certainly reach exascale soon, without additional research, these issues could potentially limit the applicability of exascale computing.

[1]  Sebastian Werner,et al.  A Survey on Optical Network-on-Chip Architectures , 2017, ACM Comput. Surv..

[2]  Torsten Wilde,et al.  Monitoring Power Data: A first step towards a unified energy efficiency evaluation toolset for HPC data centers , 2014, Environ. Model. Softw..

[3]  Robert B. Ross,et al.  Enabling Parallel Simulation of Large-Scale HPC Network Systems , 2017, IEEE Transactions on Parallel and Distributed Systems.

[4]  Indrani Paul,et al.  Achieving Exascale Capabilities through Heterogeneous Computing , 2015, IEEE Micro.

[5]  Franck Cappello,et al.  Toward Exascale Resilience , 2009, Int. J. High Perform. Comput. Appl..

[6]  David Abramson,et al.  A survey on software methods to improve the energy efficiency of parallel computing , 2017, Int. J. High Perform. Comput. Appl..

[7]  Christian H. Bischof,et al.  How Many Threads will be too Many? On the Scalability of OpenMP Implementations , 2015, Euro-Par.

[8]  David Blaauw,et al.  Exploring DRAM organizations for energy-efficient and resilient exascale memories , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[9]  Xin Liu,et al.  Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[10]  Thomas Hérault,et al.  A Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI , 2012, Euro-Par.

[11]  Samuel Williams,et al.  ExaSAT: An exascale co-design tool for performance modeling , 2015, Int. J. High Perform. Comput. Appl..

[12]  Laxmikant V. Kalé,et al.  "Cool" Load Balancing for High Performance Computing Data Centers , 2012, IEEE Trans. Computers.

[13]  Alistair P. Rendell,et al.  Implementation and Optimization of the OpenMP Accelerator Model for the TI Keystone II Architecture , 2014, IWOMP.

[14]  Robert Sisneros,et al.  Damaris/Viz: A nonintrusive, adaptable and user-friendly in situ visualization framework , 2013, 2013 IEEE Symposium on Large-Scale Data Analysis and Visualization (LDAV).

[15]  Christian Engelmann,et al.  Scaling to a million cores and beyond: Using light-weight simulation to understand the challenges ahead on the road to exascale , 2014, Future Gener. Comput. Syst..

[16]  Lee Gillam,et al.  Energy efficient computing, clusters, grids and clouds: A taxonomy and survey , 2017, Sustain. Comput. Informatics Syst..

[17]  John Shalf,et al.  Exascale Computing Technology Challenges , 2010, VECPAR.

[18]  Lior Rokach,et al.  Clustering Methods , 2005, The Data Mining and Knowledge Discovery Handbook.

[19]  Bob Edwards,et al.  Programming the Adapteva Epiphany 64-core network-on-chip coprocessor , 2014, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops.

[20]  Alexey L. Lastovetsky,et al.  Hierarchical approach to optimization of parallel matrix multiplication on large-scale platforms , 2015, The Journal of Supercomputing.

[21]  Carlos Maltzahn,et al.  DAOS and Friends: A Proposal for an Exascale Storage System , 2016, SC16: International Conference for High Performance Computing, Networking, Storage and Analysis.

[22]  Vladimir V. Stegailov,et al.  Efficiency of the Tegra K1 and X1 systems-on-chip for classical molecular dynamics , 2016, 2016 International Conference on High Performance Computing & Simulation (HPCS).

[23]  Patrick M. Reed,et al.  Evolving many‐objective water management to exploit exascale computing , 2014 .

[24]  William J. Dally,et al.  Scaling the Power Wall: A Path to Exascale , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.

[25]  Stefan Lankes,et al.  Application migration in HPC — A driver of the exascale era? , 2016, 2016 International Conference on High Performance Computing & Simulation (HPCS).

[26]  Mark Giampapa,et al.  Experiences with a Lightweight Supercomputer Kernel: Lessons Learned from Blue Gene's CNK , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.

[27]  Luca Benini,et al.  Predictive Modeling for Job Power Consumption in HPC Systems , 2016, ISC.

[28]  Satoshi Matsuoka,et al.  Design and modeling of a non-blocking checkpointing system , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[29]  William Gropp,et al.  An introductory exascale feasibility study for FFTs and multigrid , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[30]  Maya Gokhale,et al.  Argo NodeOS: Toward Unified Resource Management for Exascale , 2017, 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[31]  Yuan Xie,et al.  Leveraging 3D PCRAM technologies to reduce checkpoint overhead for future exascale systems , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[32]  Michael W. Berry,et al.  Text Mining Using Non-Negative Matrix Factorizations , 2004, SDM.

[33]  Tao Tang,et al.  OpenMC: Towards Simplifying Programming for TianHe Supercomputers , 2014, Journal of Computer Science and Technology.

[34]  Jeffrey S. Vetter,et al.  Opportunities for Nonvolatile Memory Systems in Extreme-Scale High-Performance Computing , 2015, Computing in Science & Engineering.

[35]  Kenneth Moreland,et al.  Sandia National Laboratories , 2000 .

[36]  William Gropp,et al.  Programming for Exascale Computers , 2013, Computing in Science & Engineering.

[37]  Ronny Henker,et al.  Survey of Photonic and Plasmonic Interconnect Technologies for Intra-Datacenter and High-Performance Computing Communications , 2018, IEEE Communications Surveys & Tutorials.

[38]  Javier Navaridas,et al.  Designing an exascale interconnect using multi-objective optimization , 2017, 2017 IEEE Congress on Evolutionary Computation (CEC).

[39]  Ewa Deelman,et al.  Measuring the impact of burst buffers on data-intensive scientific workflows , 2019, Future Gener. Comput. Syst..

[40]  Andrew A. Chien,et al.  Data decomposition in Monte Carlo neutron transport simulations using global view arrays , 2015, Int. J. High Perform. Comput. Appl..

[41]  Sorin Faibish,et al.  Jitter-free co-processing on a prototype exascale storage stack , 2012, 012 IEEE 28th Symposium on Mass Storage Systems and Technologies (MSST).

[42]  Daniel Sunderland,et al.  Kokkos: Enabling manycore performance portability through polymorphic memory access patterns , 2014, J. Parallel Distributed Comput..

[43]  Eric Borch,et al.  Megafly: A Topology for Exascale Systems , 2018, ISC.

[44]  Kwan-Liu Ma,et al.  In Situ Visualization at Extreme Scale: Challenges and Opportunities , 2009, IEEE Computer Graphics and Applications.

[45]  Eduard Ayguadé,et al.  Programmability and portability for exascale: Top down programming methodology and tools with StarSs , 2013, J. Comput. Sci..

[46]  Torsten Hoefler,et al.  Efficient task placement and routing of nearest neighbor exchanges in dragonfly networks , 2014, HPDC '14.

[47]  Paul Nilsson,et al.  Overview of ATLAS PanDA Workload Management , 2011 .

[48]  Jacob Nelson,et al.  Comparing Runtime Systems with Exascale Ambitions Using the Parallel Research Kernels , 2016, ISC.

[49]  Barbara I. Wohlmuth,et al.  Resilience for Massively Parallel Multigrid Solvers , 2016, SIAM J. Sci. Comput..

[50]  Seyong Lee,et al.  Early evaluation of directive-based GPU programming models for productive exascale computing , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[51]  André Brinkmann,et al.  Improving Collective I/O Performance Using Non-volatile Memory Devices , 2016, 2016 IEEE International Conference on Cluster Computing (CLUSTER).

[52]  Jeffrey S. Vetter,et al.  A Survey of CPU-GPU Heterogeneous Computing Techniques , 2015, ACM Comput. Surv..

[53]  Simon D. Hammond,et al.  Optical interconnects for extreme scale computing systems , 2017, Parallel Comput..

[54]  Anna Sidorova,et al.  Uncovering the Intellectual Core of the Information Systems Discipline , 2008, MIS Q..

[55]  Thomas Lippert,et al.  The DEEP Project - Pursuing Cluster-Computing in the Many-Core Era , 2013, 2013 42nd International Conference on Parallel Processing.

[56]  Kai Ren,et al.  DeltaFS: exascale file systems scale better without dedicated servers , 2015, PDSW '15.

[57]  Jinsuk Chung,et al.  Containment domains: A scalable, efficient, and flexible resilience scheme for exascale systems , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[58]  Shaheen Syed,et al.  Using Machine Learning to Uncover Latent Research Topics in Fishery Models , 2018 .

[59]  Marcin Kozak,et al.  Improved Scatterplot Design , 2010, IEEE Computer Graphics and Applications.

[60]  Vivek Sarkar,et al.  Chapel-on-X: Exploring Tasking Runtimes for PGAS Languages , 2017, ESPM2@SC.

[61]  Hartmut Kaiser,et al.  HPX: A Task Based Programming Model in a Global Address Space , 2014, PGAS.

[62]  Rolf Riesen,et al.  Detection and Correction of Silent Data Corruption for Large-Scale High-Performance Computing , 2012, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[63]  Henri E. Bal,et al.  ExaScale high performance computing in the square kilometer array , 2012, Astro-HPC '12.

[64]  Laxmikant V. Kalé,et al.  A ‘cool’ way of improving the reliability of HPC machines , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[65]  Robert F. Lucas,et al.  Rolex: resilience-oriented language extensions for extreme-scale systems , 2016, The Journal of Supercomputing.

[66]  Peter Wiemer-Hastings,et al.  Latent semantic analysis , 2004, Annu. Rev. Inf. Sci. Technol..

[67]  Thomas Steinke,et al.  FFMK: A Fast and Fault-Tolerant Microkernel-Based System for Exascale Computing , 2016, Software for Exascale Computing.

[68]  Deva Bodas,et al.  Simple Power-Aware Scheduler to Limit Power Consumption by HPC System within a Budget , 2014, 2014 Energy Efficient Supercomputing Workshop.

[69]  Mateo Valero,et al.  ALYA: MULTIPHYSICS ENGINEERING SIMULATION TOWARDS EXASCALE , 2014 .

[70]  Richard W. Vuduc,et al.  On the communication complexity of 3D FFTs and its implications for Exascale , 2012, ICS '12.

[71]  Thomas Hérault,et al.  On Scalability for MPI Runtime Systems , 2011, 2011 IEEE International Conference on Cluster Computing.

[72]  Maya Gokhale,et al.  A Container-Based Approach to OS Specialization for Exascale Computing , 2015, 2015 IEEE International Conference on Cloud Engineering.

[73]  Laurent Villard,et al.  A portable platform for accelerated PIC codes and its application to GPUs using OpenACC , 2016, Comput. Phys. Commun..

[74]  Adam Carter,et al.  Preparing Scientific Application Software for Exascale Computing , 2012, PARA.

[75]  J. Dongarra Performance of various computers using standard linear equations software , 1990, CARN.

[76]  Michael W. Berry,et al.  Document clustering using nonnegative matrix factorization , 2006, Inf. Process. Manag..

[77]  Franck Cappello,et al.  Adaptive Impact-Driven Detection of Silent Data Corruption for HPC Applications , 2016, IEEE Transactions on Parallel and Distributed Systems.

[78]  Anthony A. Maciejewski,et al.  Optimizing checkpoint intervals for reduced energy use in exascale systems , 2017, 2017 Eighth International Green and Sustainable Computing Conference (IGSC).

[79]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[80]  Alfonso Niño,et al.  A Survey of Parallel Programming Models and Tools in the Multi and Many-core Era , 2022 .

[81]  John Shalf,et al.  Software Design Space Exploration for Exascale Combustion Co-design , 2013, ISC.

[82]  Carsten Kutzner,et al.  Tackling Exascale Software Challenges in Molecular Dynamics Simulations with GROMACS , 2015, EASC.

[83]  Dong Li,et al.  Identifying Opportunities for Byte-Addressable Non-Volatile Memory in Extreme-Scale Scientific Applications , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium.

[84]  Y. Robert,et al.  Fault-Tolerance Techniques for High-Performance Computing , 2015, Computer Communications and Networks.

[85]  Franck Cappello,et al.  Optimization of Multi-level Checkpoint Model for Large Scale HPC Applications , 2014, 2014 IEEE 28th International Parallel and Distributed Processing Symposium.

[86]  Kevin W. Boyack,et al.  Clustering More than Two Million Biomedical Publications: Comparing the Accuracies of Nine Text-Based Similarity Approaches , 2011, PloS one.

[87]  Abhinav Vishnu,et al.  On the suitability of MPI as a PGAS runtime , 2014, 2014 21st International Conference on High Performance Computing (HiPC).

[88]  Christian Engelmann,et al.  What Is the Right Balance for Performance and Isolation with Virtualization in HPC? , 2014, Euro-Par Workshops.

[89]  Francisco Almeida,et al.  Measuring energy consumption using EML (energy measurement library) , 2014, Computer Science - Research and Development.

[90]  Paul Messina,et al.  The Exascale Computing Project , 2017, Comput. Sci. Eng..

[91]  Al Geist,et al.  Major Computer Science Challenges At Exascale , 2009, Int. J. High Perform. Comput. Appl..

[92]  Jaegul Choo,et al.  UTOPIAN: User-Driven Topic Modeling Based on Interactive Nonnegative Matrix Factorization , 2013, IEEE Transactions on Visualization and Computer Graphics.

[93]  Benoît Meister,et al.  The Open Community Runtime: A runtime system for extreme scale computing , 2016, 2016 IEEE High Performance Extreme Computing Conference (HPEC).

[94]  Laxmikant V. Kalé,et al.  Architectural Constraints to Attain 1 Exaflop/s for Three Scientific Application Classes , 2011, 2011 IEEE International Parallel & Distributed Processing Symposium.

[95]  R. G. Beausoleil,et al.  Photonic Architectures for High-Performance Data Centers , 2013, IEEE Journal of Selected Topics in Quantum Electronics.

[96]  Franck Cappello,et al.  Distributed Monitoring and Management of Exascale Systems in the Argo Project , 2015, DAIS.

[97]  Jung Ho Ahn,et al.  HyperX: topology, routing, and packaging of efficient large-scale networks , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[98]  Kwan-Liu Ma,et al.  Dax Toolkit: A proposed framework for data analysis and visualization at Extreme Scale , 2011, 2011 IEEE Symposium on Large Data Analysis and Visualization.

[99]  Wim Vanroose,et al.  Hiding Global Communication Latency in the GMRES Algorithm on Massively Parallel Machines , 2013, SIAM J. Sci. Comput..

[100]  Dana Petcu,et al.  Exascale Machines Require New Programming Paradigms and Runtimes , 2015, Supercomput. Front. Innov..

[101]  Tejas Karkhanis,et al.  Active Memory Cube: A processing-in-memory architecture for exascale systems , 2015, IBM J. Res. Dev..

[102]  Rupak Biswas,et al.  High performance computing using MPI and OpenMP on multi-core parallel systems , 2011, Parallel Comput..

[103]  Thomas Ludwig,et al.  Survey of Storage Systems for High-Performance Computing , 2018, Supercomput. Front. Innov..

[104]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[105]  Franck Cappello,et al.  Toward Exascale Resilience: 2014 update , 2014, Supercomput. Front. Innov..

[106]  Laxmikant V. Kalé,et al.  Toward Runtime Power Management of Exascale Networks by on/off Control of Links , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[107]  John Shalf,et al.  Exascale Computing Trends: Adjusting to the "New Normal"' for Computer Architecture , 2013, Computing in Science & Engineering.

[108]  Franck Cappello,et al.  Damaris: Addressing Performance Variability in Data Management for Post-Petascale Simulations , 2016, TOPC.

[109]  Eduard Ayguadé,et al.  The Mont-Blanc Prototype: An Alternative Approach for HPC Systems , 2016, SC16: International Conference for High Performance Computing, Networking, Storage and Analysis.

[110]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[111]  Alistair A. Young,et al.  Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , 2017, MICCAI 2017.

[112]  Daniel A. Kane,et al.  A Systematic Review of Perennial Staple Crops Literature Using Topic Modeling and Bibliometric Analysis , 2016, PloS one.

[113]  Yuan Xie,et al.  Hybrid checkpointing using emerging nonvolatile memories for future exascale systems , 2011, TACO.

[114]  Jean-Pierre Panziera,et al.  The BXI Interconnect Architecture , 2015, 2015 IEEE 23rd Annual Symposium on High-Performance Interconnects.

[115]  Ray W. Grout,et al.  Ultrascale Visualization In Situ Visualization for Large-Scale Combustion Simulations , 2010 .

[116]  Wesley Bland,et al.  User Level Failure Mitigation in MPI , 2012, Euro-Par Workshops.

[117]  John Shalf,et al.  The International Exascale Software Project roadmap , 2011, Int. J. High Perform. Comput. Appl..

[118]  Franck Cappello,et al.  Detecting and Correcting Data Corruption in Stencil Applications through Multivariate Interpolation , 2015, 2015 IEEE International Conference on Cluster Computing.

[119]  Christian Engelmann,et al.  Failures in Large Scale Systems: Long-term Measurement, Analysis, and Implications , 2017, SC17: International Conference for High Performance Computing, Networking, Storage and Analysis.

[120]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[121]  Charu C. Aggarwal,et al.  Mining Text Data , 2012, Springer US.

[122]  Julian M. Kunkel,et al.  Exascale Storage Systems - An Analytical Study of Expenses , 2014, Supercomput. Front. Innov..

[123]  Martin Schulz,et al.  Exploring hardware overprovisioning in power-constrained, high performance computing , 2013, ICS '13.

[124]  Xinyu Niu,et al.  EXTRA: Towards the exploitation of eXascale technology for reconfigurable architectures , 2016, 2016 11th International Symposium on Reconfigurable Communication-centric Systems-on-Chip (ReCoSoC).

[125]  Rolf Riesen,et al.  Exploring the Design Space of Combining Linux with Lightweight Kernels for Extreme Scale Computing , 2015, ROSS@HPDC.

[126]  Michael W. Berry,et al.  Algorithms and applications for approximate nonnegative matrix factorization , 2007, Comput. Stat. Data Anal..

[127]  Jiaqi Liu,et al.  Soft Error Detection for Iterative Applications Using Offline Training , 2016, 2016 IEEE 23rd International Conference on High Performance Computing (HiPC).

[128]  Laxmikant V. Kalé,et al.  Using Migratable Objects to Enhance Fault Tolerance Schemes in Supercomputers , 2015, IEEE Transactions on Parallel and Distributed Systems.

[129]  Vanish Talwar,et al.  Using active NVRAM for I/O staging , 2011, PDAC '11.

[130]  Matthew J. Sottile,et al.  CAFe: Coarray Fortran Extensions for Heterogeneous Computing , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[131]  Franck Cappello,et al.  Addressing failures in exascale computing , 2014, Int. J. High Perform. Comput. Appl..

[132]  David M. Eyers,et al.  Quantifying the Energy Efficiency Challenges of Achieving Exascale Computing , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[133]  S ValacichJoseph,et al.  Uncovering the intellectual core of the information systems discipline , 2008 .

[134]  Wei Hu,et al.  Storage wall for exascale supercomputing , 2016, Frontiers of Information Technology & Electronic Engineering.

[135]  Thomas Hérault,et al.  PTG: An Abstraction for Unhindered Parallelism , 2014, 2014 Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing.

[136]  Jesper Larsson Träff,et al.  The EPiGRAM Project: Preparing Parallel Programming Models for Exascale , 2016, ISC Workshops.

[137]  Vivek Sarkar,et al.  Software challenges in extreme scale systems , 2009 .

[138]  Franck Cappello,et al.  Big data and extreme-scale computing , 2018, Int. J. High Perform. Comput. Appl..

[139]  Laxmikant V. Kalé,et al.  A scalable double in-memory checkpoint and restart scheme towards exascale , 2012, IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN 2012).

[140]  David Roberts,et al.  Toward Efficient Programmer-Managed Two-Level Memory Hierarchies in Exascale Computers , 2014, 2014 Hardware-Software Co-Design for High Performance Computing.

[141]  Jerzy Proficz,et al.  Energy-Aware High-Performance Computing: Survey of State-of-the-Art Tools, Techniques, and Environments , 2019, Sci. Program..

[142]  Toshiyuki Shimizu,et al.  Tofu: A 6D Mesh/Torus Interconnect for Exascale Computers , 2009, Computer.

[143]  Laxmikant V. Kalé,et al.  Avoiding hot-spots on two-level direct networks , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[144]  Richard F. Barrett,et al.  Exascale design space exploration and co-design , 2014, Future Gener. Comput. Syst..

[145]  Sai Narasimhamurthy,et al.  The SAGE project: a storage centric approach for exascale computing: invited paper , 2018, CF.

[146]  Dhabaleswar K. Panda,et al.  Power-Check: An Energy-Efficient Checkpointing Framework for HPC Clusters , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[147]  Jack J. Dongarra,et al.  Investigating power capping toward energy‐efficient scientific applications , 2019, Concurr. Comput. Pract. Exp..

[148]  Margaret H. Wright,et al.  The opportunities and challenges of exascale computing , 2010 .

[149]  Karl Fürlinger,et al.  Exploiting Hierarchical Exascale Hardware using a PGAS Approach , 2015 .

[150]  Franck Cappello,et al.  Fault Tolerance in Petascale/ Exascale Systems: Current Knowledge, Challenges and Research Opportunities , 2009, Int. J. High Perform. Comput. Appl..

[151]  Bianca Schroeder,et al.  Reading between the lines of failure logs: Understanding how HPC systems fail , 2013, 2013 43rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN).

[152]  Arie Shoshani,et al.  Hello ADIOS: the challenges and lessons of developing leadership class I/O frameworks , 2014, Concurr. Comput. Pract. Exp..

[153]  Thomas Hérault,et al.  Hierarchical QR factorization algorithms for multi-core clusters , 2013, Parallel Comput..

[154]  Peter Bauer,et al.  Atlas : A library for numerical weather prediction and climate modelling , 2017, Comput. Phys. Commun..

[155]  Robert B. Ross,et al.  Evaluation of Topology-Aware Broadcast Algorithms for Dragonfly Networks , 2016, 2016 IEEE International Conference on Cluster Computing (CLUSTER).

[156]  David A. Padua,et al.  Hierarchically Tiled Array as a High-Level Abstraction for Codelets , 2014, 2014 Fourth Workshop on Data-Flow Execution Models for Extreme Scale Computing.

[157]  Mateo Valero,et al.  Interconnection Networks in Petascale Computer Systems , 2016, ACM Comput. Surv..

[158]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[159]  Stephen L. Olivier,et al.  Power Measurement and Concurrency Throttling for Energy Reduction in OpenMP Programs , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[160]  Gianluca Durelli,et al.  Towards exascale computing with heterogeneous architectures , 2017, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017.

[161]  Hari Sundar,et al.  FFT, FMM, or Multigrid? A comparative Study of State-Of-the-Art Poisson Solvers for Uniform and Nonuniform Grids in the Unit Cube , 2014, SIAM J. Sci. Comput..

[162]  Scott Klasky,et al.  Exploring Automatic, Online Failure Recovery for Scientific Applications at Extreme Scales , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.

[163]  Ravishankar K. Iyer,et al.  Measuring and Understanding Extreme-Scale Application Resilience: A Field Study of 5,000,000 HPC Application Runs , 2015, 2015 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks.

[164]  John Shalf,et al.  Rethinking Hardware-Software Codesign for Exascale Systems , 2011, Computer.

[165]  Sophie Valcke,et al.  Crossing the chasm: how to develop weather and climate models for next generation computers? , 2017 .

[166]  Frank Mueller,et al.  Power tuning HPC jobs on power-constrained systems , 2016, 2016 International Conference on Parallel Architecture and Compilation Techniques (PACT).

[167]  Rolf Riesen,et al.  A Multi-Kernel Survey for High-Performance Computing , 2016, ROSS@HPDC.

[168]  Kevin T. Pedretti,et al.  Achieving Performance Isolation with Lightweight Co-Kernels , 2015, HPDC.

[169]  Ron Brightwell,et al.  On the Viability of Compression for Reducing the Overheads of Checkpoint/Restart-Based Fault Tolerance , 2012, 2012 41st International Conference on Parallel Processing.

[170]  Yu-Jin Zhang,et al.  Nonnegative Matrix Factorization: A Comprehensive Review , 2013, IEEE Transactions on Knowledge and Data Engineering.

[171]  James H. Laros,et al.  Evaluating the viability of process replication reliability for exascale systems , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[172]  Tim Menzies,et al.  Trends in Topics at SE Conferences (1993-2013) , 2016, 2017 IEEE/ACM 39th International Conference on Software Engineering Companion (ICSE-C).

[173]  Jack J. Dongarra,et al.  Exascale computing and big data , 2015, Commun. ACM.

[174]  Andrew A. Chien,et al.  Versioned Distributed Arrays for Resilience in Scientific Applications: Global View Resilience , 2015, ICCS.

[175]  Scott B. Baden,et al.  The UPC++ PGAS library for Exascale Computing , 2017, PAW@SC.