Predicting disruptive instabilities in controlled fusion plasmas through deep learning

Nuclear fusion power delivered by magnetic-confinement tokamak reactors holds the promise of sustainable and clean energy1. The avoidance of large-scale plasma instabilities called disruptions within these reactors2,3 is one of the most pressing challenges4,5, because disruptions can halt power production and damage key components. Disruptions are particularly harmful for large burning-plasma systems such as the multibillion-dollar International Thermonuclear Experimental Reactor (ITER) project6 currently under construction, which aims to be the first reactor that produces more power from fusion than is injected to heat the plasma. Here we present a method based on deep learning for forecasting disruptions. Our method extends considerably the capabilities of previous strategies such as first-principles-based5 and classical machine-learning7–11 approaches. In particular, it delivers reliable predictions for machines other than the one on which it was trained—a crucial requirement for future large reactors that cannot afford training disruptions. Our approach takes advantage of high-dimensional training data to boost predictive performance while also engaging supercomputing resources at the largest scale to improve accuracy and speed. Trained on experimental data from the largest tokamaks in the United States (DIII-D12) and the world (Joint European Torus, JET13), our method can also be applied to specific tasks such as prediction with long warning times: this opens up the possibility of moving from passive disruption prediction to active reactor control and optimization. These initial results illustrate the potential for deep learning to accelerate progress in fusion-energy science and, more generally, in the understanding and prediction of complex physical systems.Using data from plasma-based tokamak nuclear reactors in the US and Europe, a machine-learning approach based on deep neural networks is taught to forecast disruptions, even those in machines on which the algorithm was not trained.

[1]  Carlos Silva,et al.  Fast visible camera installation and operation in JET , 2008 .

[2]  R. Aymar,et al.  The ITER design , 2002 .

[3]  Matthias Troyer,et al.  Solving the quantum many-body problem with artificial neural networks , 2016, Science.

[4]  Alex Graves,et al.  Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[5]  Marc'Aurelio Ranzato,et al.  Large Scale Distributed Deep Networks , 2012, NIPS.

[6]  J. A. Leuer,et al.  Tokamak disruption alarm based on a neural network model of the high- beta limit , 1997 .

[7]  J. Contributors,et al.  Statistical analysis of disruptions in JET , 2009 .

[8]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[9]  F C Schuller,et al.  Disruptions in tokamaks , 1995 .

[10]  D. A. Humphreys,et al.  Disruption mitigation studies in DIII-D , 1999 .

[11]  M. N. A. Beurskens,et al.  JET ITER-like wall—overview and experimental programme , 2011 .

[12]  J. Contributors,et al.  Survey of disruption causes at JET , 2011 .

[13]  Jet Efda Contributors,et al.  A cross-tokamak neural network disruption predictor for the JET and ASDEX Upgrade tokamaks , 2005 .

[14]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[15]  Olga Radyvonenko,et al.  Accelerating recurrent neural network training using sequence bucketing and multi-GPU data parallelization , 2016, 2016 IEEE First International Conference on Data Stream Mining & Processing (DSMP).

[16]  J. Vega,et al.  Results of the JET real-time disruption predictor in the ITER-like wall campaigns , 2012 .

[17]  Ji Zhou,et al.  The Power of an Idea: The International Impacts of the Grand Challenges for Engineering , 2016 .

[18]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[19]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[20]  C. Lechte,et al.  Synthetic Diagnostics in the European Union Integrated Tokamak Modelling Simulation Platform , 2013 .

[21]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[22]  Olivier Sauter,et al.  Simultaneous closed-loop control of the current profile and the electron temperature profile in the TCV tokamak , 2015, 2015 American Control Conference (ACC).

[23]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[24]  N. W. Eidietis,et al.  Disruptions in ITER and strategies for their control and mitigation , 2015 .

[25]  J. Vega,et al.  Disruption Prediction on JET during the ILW Experimental Campaigns , 2016 .

[26]  I G J Classen,et al.  Commissioning of electron cyclotron emission imaging instrument on the DIII-D tokamak and first data. , 2010, The Review of scientific instruments.

[27]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[28]  L. L. Lao,et al.  Real time equilibrium reconstruction for tokamak discharge control , 1998 .

[29]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[30]  Vincent Chan,et al.  Advances and challenges in computational plasma science , 2005 .

[31]  Jet Efda Contributors,et al.  Improved feature selection based on genetic algorithms for real time disruption prediction on JET , 2012 .

[32]  Alessandra Fanni,et al.  Disruption forecasting at JET using neural networks , 2002 .

[33]  Claudio Gentile,et al.  Linear Hinge Loss and Average Margin , 1998, NIPS.

[34]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[35]  Roger G. Melko,et al.  Machine learning phases of matter , 2016, Nature Physics.

[36]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[37]  J. L. Luxon,et al.  A design retrospective of the DIII-D tokamak , 2002 .

[38]  S. Lanthaler,et al.  Overview of the JET results in support to ITER , 2017, Nuclear Fusion.

[39]  G. Pautasso,et al.  Requirements for Triggering the ITER Disruption Mitigation System , 2016 .

[40]  J Vega,et al.  Feature extraction for improved disruption prediction analysis at JET. , 2008, The Review of scientific instruments.

[41]  François Chollet,et al.  Deep Learning with Python , 2017 .

[42]  John Langford,et al.  Cost-sensitive learning by cost-proportionate example weighting , 2003, Third IEEE International Conference on Data Mining.

[43]  Julian Kates-Harbeck,et al.  Training distributed deep recurrent neural networks with mixed precision on GPU clusters , 2017, MLHPC@SC.

[44]  Luigi Fortuna,et al.  Prototype of an adaptive disruption predictor for JET based on fuzzy logic and regression trees , 2008 .

[45]  Leslie N. Smith,et al.  Cyclical Learning Rates for Training Neural Networks , 2015, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).