Exploratory Data Analysis Techniques to Determine the Dimensionality of Complex Nonlinear Phenomena: The L-to-H Transition at JET as a Case Study

A strategy to identify and select the most relevant variables to study problems in the exact sciences, when large databases of data have to be explored, is formulated. It consists of a first exploratory stage, performed mainly with the classification and regression tree method, to determine the list of most relevant signals to be used in the analysis of the phenomenon of interest. A linear correlation technique, followed by a nonlinear correlation technique (principal component analysis and autoassociative neural networks (NNs), respectively), is then applied to reduce the number of signals to the ones containing nonredundant information. The potential of the approach is illustrated by an application to the problem of identifying the confinement regime in the Joint European Torus. The minimum set of signals has been used to train an NN, and its performance is compared with that of various theoretical models. The success rate of the NN is very high, and it generally further outperforms the available theoretical models.

[1]  A. L. Rogister Ionization instability driven magnetic island belt, particle confinement and edge profiles , 1994 .

[2]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[3]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[4]  H. R. Wilson,et al.  REVIEW ARTICLE: A review of theories of the L-H transition , 2000 .

[5]  G. Janeschitz,et al.  The Scaling of the Edge Temperature in Tokamaks Based on the Alfven Drift - Wave Turbulence , 1998 .

[6]  M. Kramer Nonlinear principal component analysis using autoassociative neural networks , 1991 .

[7]  P. Green,et al.  Analyzing multivariate data , 1978 .

[8]  Y. R. Martin,et al.  H-mode access on JET and implications for ITER , 2008 .

[9]  Danilo Rastovic,et al.  Targeting and synchronization at tokamak with recurrent artificial neural networks , 2012, Neural Computing and Applications.

[10]  David R. Anderson,et al.  Model Selection and Multimodel Inference , 2003 .

[11]  P. Arena,et al.  Fuzzy logic and support vector machine approaches to regime identification in JET , 2006, IEEE Transactions on Plasma Science.

[12]  A. Chankin,et al.  Critical parameters for turbulent transport in the SOL: mechanism for the L - H transition and its impact on the H-mode power threshold and density limits , 1997 .

[13]  G. Saibene,et al.  Interpretation of density limits and the H-mode operational diagram through similarity parameters for edge transport mechanisms , 1999 .

[14]  Danilo Rastović Applications of artificial intelligence and multi-variable control of chaos on tokamak equilibriums , 2010 .

[15]  F. Wagner,et al.  Regime of Improved Confinement and High Beta in Neutral-Beam-Heated Divertor Discharges of the ASDEX Tokamak , 1982 .