Automated Classification of Plasma Regions Using 3D Particle Energy Distributions

Even though automatic classification and interpretation would be highly desired features for the Magnetospheric Multiscale mission (MMS), the gold rush era in machine learning has yet to reach the science done with observations collected by MMS. We investigate the properties of the ion sky maps produced by the Dual Ion Spectrometers (DIS) from the Fast Plasma Investigation (FPI). Running the Principal Component Analysis (PCA) on a mixed subset of the data suggests that more than 500 components are needed to cover 80% of the variance. Hence, simple machine learning techniques might not deal with classification of plasma regions. Use of a three-dimensional (3D) convolutional autoencoder (3D-CAE) allows to reduce the data dimensionality by 128 times while still maintaining a decent quality energy distribution. However, k-means clustering computed over the compressed data is not capable of separating measurements according to the physical properties of the plasma. A three-dimensional convolutional neural network (3D-CNN), trained on a rather small amount of human labelled training examples is able to predict plasma regions with 99% accuracy. The low probability predictions of the 3D-CNN reveal the mixed state regions, such as the magnetopause or bow shock, which are of key interest to researchers of the MMS mission. The 3D-CNN and data processing software could easily be deployed on ground-based computers and provide classification for the whole MMS database. Data processing through the trained 3D-CNN is fast and efficient, opening up the possibility for deployment in data-centers or in situ operation onboard the spacecraft.

[1]  Enrico Camporeale,et al.  The Challenge of Machine Learning in Space Weather: Nowcasting and Forecasting , 2019, Space Weather.

[2]  Wolfgang Baumjohann,et al.  The Magnetospheric Multiscale Magnetometers , 2016 .

[3]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[4]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[5]  Sebastian Scherer,et al.  VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[6]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[7]  Dana H. Ballard,et al.  Modular Learning in Neural Networks , 1987, AAAI.

[8]  LIII , 2018, Out of the Shadow.

[9]  Vivek Kothari,et al.  The Final Frontier: Deep Learning in Space , 2020, HotMobile.

[10]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[11]  Thomas E. Moore,et al.  Magnetospheric Multiscale Overview and Science Objectives , 2016 .

[12]  S. Petrinec,et al.  Magnetospheric Multiscale Science Mission Profile and Operations , 2016 .

[13]  C. Norgren,et al.  Large‐scale characteristics of reconnection diffusion regions and associated magnetopause crossings observed by MMS , 2017 .

[14]  U. Gliese,et al.  Fast Plasma Investigation for Magnetospheric Multiscale , 2016 .

[15]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16]  C. K. Pankratz,et al.  Magnetospheric Multiscale Instrument Suite Operations and Data System , 2016 .

[17]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[18]  Marek Petrik,et al.  MMS SITL Ground Loop: Automating the Burst Data Selection Process , 2020, Frontiers in Astronomy and Space Sciences.

[19]  Samantha Piatt,et al.  Large-Scale Statistical Survey of Magnetopause Reconnection , 2019, ArXiv.

[20]  Jorge Amaya,et al.  Automatic Classification of Plasma Regions in Near-Earth Space With Supervised Machine Learning: Application to Magnetospheric Multi Scale 2016–2019 Observations , 2020, Frontiers in Astronomy and Space Sciences.