A Survey of Computational Tools in Solar Physics

The SunPy Project developed a 13-question survey to understand the software and hardware usage of the solar-physics community. Of the solar-physics community, 364 members across 35 countries responded to our survey. We found that 99 ± 0.5 $99\pm 0.5$ % of respondents use software in their research and 66% use the Python scientific-software stack. Students are twice as likely as faculty, staff scientists, and researchers to use Python rather than Interactive Data Language (IDL). In this respect, the astrophysics and solar-physics communities differ widely: 78% of solar-physics faculty, staff scientists, and researchers in our sample uses IDL, compared with 44% of astrophysics faculty and scientists sampled by Momcheva and Tollerud ( 2015 ). 63 ± 4 $63\pm 4$ % of respondents have not taken any computer-science courses at an undergraduate or graduate level. We also found that most respondents use consumer hardware to run software for solar-physics research. Although 82% of respondents work with data from space-based or ground-based missions, some of which ( e.g. the Solar Dynamics Observatory and Daniel K. Inouye Solar Telescope ) produce terabytes of data a day, 14% use a regional or national cluster, 5% use a commercial cloud provider, and 29% use exclusively a laptop or desktop. Finally, we found that 73 ± 4 $73\pm 4$ % of respondents cite scientific software in their research, although only 42 ± 3 $42\pm 3$ % do so routinely.

[1]  E. R. Cohen An Introduction to Error Analysis: The Study of Uncertainties in Physical Measurements , 1998 .

[2]  Alexander G. Gray,et al.  Introduction to astroML: Machine learning for astrophysics , 2012, 2012 Conference on Intelligent Data Understanding.

[3]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[4]  Adam Croom,et al.  Roads and Bridges: The Unseen Labor Behind Our Digital Infrastructure / Ford Foundation , 2016 .

[5]  Erik Tollerud,et al.  Software Use in Astronomy: an Informal Survey , 2015, ArXiv.

[6]  S. Freeland,et al.  Data Analysis with the SolarSoft System , 1998 .

[7]  Kevin Reardon,et al.  The SunPy Project: Open Source Development and Status of the Version 1.0 Core Package , 2020, The Astrophysical Journal.

[8]  Division on Earth,et al.  Reproducibility and Replicability in Science , 2019 .

[9]  Joel Nothman,et al.  SciPy 1.0-Fundamental Algorithms for Scientific Computing in Python , 2019, ArXiv.

[10]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.

[11]  S. P. Littlefair,et al.  THE ASTROPY PROJECT: BUILDING AN INCLUSIVE, OPEN-SCIENCE PROJECT AND STATUS OF THE V2.0 CORE PACKAGE , 2018 .

[12]  Ting Li,et al.  Sustaining Community-Driven Software for Astronomy in the 2020s , 2019 .

[13]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[14]  Alexander S. Szalay,et al.  Petabytes to science , 2019 .

[15]  Stephan Hoyer,et al.  mwaskom/seaborn: v0.10.0 (January 2020) , 2020 .

[16]  Miguel de Val-Borro,et al.  The Astropy Project: Building an Open-science Project and Status of the v2.0 Core Package , 2018, The Astronomical Journal.

[17]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[18]  Matthew Rocklin,et al.  Dask: Parallel Computation with Blocked algorithms and Task Scheduling , 2015, SciPy.

[19]  Jon F. Claerbout,et al.  Electronic documents give reproducible research a new meaning: 62nd Ann , 1992 .

[20]  David L. Donoho,et al.  WaveLab and Reproducible Research , 1995 .

[21]  J. Tinsley Oden,et al.  Research and Education in Computational Science and Engineering , 2016, ArXiv.