Outdoor PV System Monitoring—Input Data Quality, Data Imputation and Filtering Approaches

Photovoltaic monitoring data are the primary source for studying photovoltaic plant behavior. In particular, performance loss and remaining-useful-lifetime calculations rely on trustful input data. Furthermore, a regular stream of high quality is the basis for pro-active operation and management activities which ensure a smooth operation of PV plants. The raw data under investigation are electrical measurements and usually meteorological data such as in-plane irradiance and temperature. Usually, performance analyses follow a strict pattern of checking input data quality followed by the application of appropriate filter, choosing a key performance indicator and the application of certain methodologies to receive a final result. In this context, this paper focuses on four main objectives. We present common photovoltaics monitoring data quality issues, provide visual guidelines on how to detect and evaluate these, provide new data imputation approaches, and discuss common filtering approaches. Data imputation techniques for module temperature and irradiance data are discussed and compared to classical approaches. This work is intended to be a soft introduction into PV monitoring data analysis discussing best practices and issues an analyst might face. It was seen that if a sufficient amount of training data is available, multivariate adaptive regression splines yields good results for module temperature imputation while histogram-based gradient boosting regression outperforms classical approaches for in-plane irradiance transposition. Based on tested filtering procedures, it is believed that standards should be developed including relatively low irradiance thresholds together with strict power-irradiance pair filters.

[1]  C. Long,et al.  From Dimming to Brightening: Decadal Changes in Solar Radiation at Earth's Surface , 2005, Science.

[2]  William E. Boyson,et al.  Photovoltaic array performance model. , 2004 .

[3]  Oktoviano Gandhi,et al.  Can we justify producing univariate machine-learning forecasts with satellite-derived solar irradiance? , 2020 .

[4]  Atse Louwen,et al.  PV system monitoring and characterization : Chapter 11.4 , 2017 .

[5]  José Luís Calvo-Rolle,et al.  Missing Data Imputation of Solar Radiation Data under Different Atmospheric Conditions , 2014, Sensors.

[6]  P. Ineichen,et al.  A new simplified version of the perez diffuse irradiance model for tilted surfaces , 1987 .

[7]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[8]  Clifford W. Hansen,et al.  Pvlib Python: a Python Package for Modeling Solar Energy Systems , 2018, J. Open Source Softw..

[9]  W. Beckman,et al.  Evaluation of hourly tilted surface radiation models , 1990 .

[10]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[11]  W. Beckman,et al.  Diffuse fraction correlations , 1990 .

[12]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[13]  Gilles Notton,et al.  Calculation of the polycrystalline PV module temperature using a simple method of energy balance , 2006 .

[14]  Clifford W. Hansen,et al.  Identification of periods of clear sky irradiance in time series of GHI measurements , 2016 .

[15]  Marko Topič,et al.  Advanced PV Performance Modelling Based on Different Levels of Irradiance Data Accuracy , 2020 .

[16]  George Makrides,et al.  Recent advances in failure diagnosis techniques based on performance data analysis for grid-connected photovoltaic systems , 2019, Renewable Energy.

[17]  Haydar Demirhan,et al.  Missing value imputation for short to mid-term horizontal solar irradiance data , 2018, Applied Energy.

[18]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[19]  A. Urbina,et al.  Evaluation of Solar Radiation Transposition Models for Passive Energy Management and Building Integrated Photovoltaics , 2020 .

[20]  Sarah Kurtz,et al.  Analysis of Photovoltaic System Energy Performance Evaluation Method , 2013 .

[21]  T. M. Klucher Evaluation of models to predict insolation on tilted surfaces , 1978 .

[22]  E. Dunlop,et al.  A power-rating model for crystalline silicon PV modules , 2011 .

[23]  Nicholas A. Engerer,et al.  QCPV: A quality control algorithm for distributed photovoltaic array power output , 2017 .

[24]  E. Skoplaki,et al.  A simple correlation for the operating temperature of photovoltaic modules of arbitrary mounting , 2008 .

[25]  B. Rudolf,et al.  World Map of the Köppen-Geiger climate classification updated , 2006 .

[26]  Katherine A. Klise,et al.  Automated performance monitoring for PV systems using pecos , 2016, 2016 IEEE 43rd Photovoltaic Specialists Conference (PVSC).