Finding Relevant Parameters for the Thin‐film Photovoltaic Cells Production Process with the Application of Data Mining Methods

A data mining approach is proposed as a useful tool for the control parameters analysis of the 3‐stage CIGSe photovoltaic cell production process, in order to find variables that are the most relevant for cell electric parameters and efficiency. The analysed data set consists of stage duration times, heater power values as well as temperatures for the element sources and the substrate – there are 14 variables per sample in total. The most relevant variables of the process have been found based on the so‐called random forest analysis with the application of the Boruta algorithm. 118 CIGSe samples, prepared at Institut des Matériaux Jean Rouxel, were analysed. The results are close to experimental knowledge on the CIGSe cells production process. They bring new evidence to production parameters of new cells and further research.