Data processing and quality verification for improved photovoltaic performance and reliability analytics

Data integrity is crucial for the performance and reliability analysis of photovoltaic (PV) systems, since actual in‐field measurements commonly exhibit invalid data caused by outages and component failures. The scope of this paper is to present a complete methodology for PV data processing and quality verification in order to ensure improved PV performance and reliability analyses. Data quality routines (DQRs) were developed to ensure data fidelity by detecting and reconstructing invalid data through a sequence of filtering stages and inference techniques. The obtained results verified that PV performance and reliability analyses are sensitive to the fidelity of data and, therefore, time series reconstruction should be handled appropriately. To mitigate the bias effects of 10% or less invalid data, the listwise deletion technique provided accurate results for performance analytics (exhibited a maximum absolute percentage error of 0.92%). When missing data rates exceed 10%, data inference techniques yield more accurate results. The evaluation of missing power measurements demonstrated that time series reconstruction by applying the Sandia PV Array Performance Model yielded the lowest error among the investigated data inference techniques for PV performance analysis, with an absolute percentage error less than 0.71%, even at 40% missing data rate levels. The verification of the routines was performed on historical datasets from two different locations (desert and steppe climates). The proposed methodology provides a set of standardized analytical procedures to ensure the validity of performance and reliability evaluations that are performed over the lifetime of PV systems.

[1]  C. B. Jones,et al.  Modeling nonlinear photovoltaic degradation rates , 2020, 2020 47th IEEE Photovoltaic Specialists Conference (PVSC).

[2]  Joshua S. Stein,et al.  Transient Weighted Moving-Average Model of Photovoltaic Module Back-Surface Temperature , 2020, IEEE Journal of Photovoltaics.

[3]  George Makrides,et al.  Nonlinear Photovoltaic Degradation Rates: Modeling and Comparison Against Conventional Methods , 2020, IEEE Journal of Photovoltaics.

[4]  M. Topič,et al.  Methodology of Köppen-Geiger-Photovoltaic climate classification and implications to worldwide mapping of PV system performance , 2019, Solar Energy.

[5]  George Makrides,et al.  Recent advances in failure diagnosis techniques based on performance data analysis for grid-connected photovoltaic systems , 2019, Renewable Energy.

[6]  Clifford W. Hansen,et al.  Pvlib Python: a Python Package for Modeling Solar Energy Systems , 2018, J. Open Source Softw..

[7]  William F. Holmgren,et al.  pvlib python: a python package for modeling solar energy systems , 2018, Journal of Open Source Software.

[8]  Haydar Demirhan,et al.  Missing value imputation for short to mid-term horizontal solar irradiance data , 2018, Applied Energy.

[9]  George Makrides,et al.  Five-year performance and reliability analysis of monocrystalline photovoltaic modules with different backsheet materials , 2018, Solar Energy.

[10]  Ioannis P. Panapakidis,et al.  A missing data treatment method for photovoltaic installations , 2018, 2018 IEEE International Energy Conference (ENERGYCON).

[11]  Thomas R. Betts,et al.  Satellite or ground-based measurements for production of site specific hourly irradiance data: Which is most accurate and where? , 2018 .

[12]  S. Kurtz,et al.  Robust PV Degradation Methodology and Application , 2018, IEEE Journal of Photovoltaics.

[13]  Dirk C. Jordan,et al.  PV degradation curves: non‐linearities and failure modes , 2017 .

[14]  G. Makrides,et al.  Impact of Missing Data on the Estimation of Photovoltaic System Degradation Rate , 2017 .

[15]  Nicholas A. Engerer,et al.  QCPV: A quality control algorithm for distributed photovoltaic array power output , 2017 .

[16]  Katherine A. Klise,et al.  Automated performance monitoring for PV systems using pecos , 2016, 2016 IEEE 43rd Photovoltaic Specialists Conference (PVSC).

[17]  Eleni Koubli,et al.  Inference of missing data in photovoltaic monitoring datasets , 2016 .

[18]  Xiaochen Zhang,et al.  Handling bad or missing smart meter data through advanced data imputation , 2016, 2016 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT).

[19]  Radu Platon,et al.  Online Fault Detection in PV Systems , 2015, IEEE Transactions on Sustainable Energy.

[20]  George Makrides,et al.  Review of photovoltaic degradation rate methodologies , 2014 .

[21]  José Luís Calvo-Rolle,et al.  Missing Data Imputation of Solar Radiation Data under Different Atmospheric Conditions , 2014, Sensors.

[22]  Ye Zhao,et al.  Fault experiments in a commercial-scale PV laboratory and fault detection using local outlier factor , 2014, 2014 IEEE 40th Photovoltaic Specialist Conference (PVSC).

[23]  Clifford W. Hansen,et al.  Weather-Corrected Performance Ratio , 2013 .

[24]  Francisco Herrera,et al.  On the choice of the best imputation methods for missing values considering three groups of classification methods , 2012, Knowledge and Information Systems.

[25]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[26]  E. Dunlop,et al.  A power-rating model for crystalline silicon PV modules , 2011 .

[27]  F. Fabero,et al.  Results of the 3rd Modelling Round Robin within the European Project „PERFORMANCE”– Comparison of Module Energy Rating Methods , 2010 .

[28]  H. Beyer,et al.  Quality of performance assessment of PV plants based on irradiation maps , 2008 .

[29]  J. A. del Cueto,et al.  Comparison of Degradation Rates of Individual Modules Held at Maximum Power , 2006, 2006 IEEE 4th World Conference on Photovoltaic Energy Conference.

[30]  Jerzy W. Grzymala-Busse,et al.  Handling Missing Attribute Values in Preterm Birth Data Sets , 2005, RSFDGrC.

[31]  Craig K. Enders,et al.  Missing Data in Educational Research: A Review of Reporting Practices and Suggestions for Improvement , 2004 .

[32]  William E. Boyson,et al.  Photovoltaic array performance model. , 2004 .

[33]  Jitender S. Deogun,et al.  Towards Missing Data Imputation: A Study of Fuzzy K-means Clustering Method , 2004, Rough Sets and Current Trends in Computing.

[34]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[35]  Gustavo E. A. P. A. Batista,et al.  An analysis of four missing data treatment methods for supervised learning , 2003, Appl. Artif. Intell..

[36]  R. Little Missing-Data Adjustments in Large Surveys , 1988 .

[37]  Marios Theristis,et al.  Chapter II-1-B – Energy Yield in Photovoltaic Systems , 2018 .

[38]  Eleni Koubli,et al.  Impact of data quality on photovoltaic (PV) performance assessment , 2017 .

[39]  George Makrides,et al.  ESTIMATION OF THE DEGRADATION RATE OF FIELDED PHOTOVOLTAIC ARRAYS IN THE PRESENCE OF MEASUREMENT OUTAGES , 2017 .

[40]  Iea Pvps,et al.  Analytical Monitoring of Grid-connected Photovoltaic Systems , 2014 .

[41]  Iea Pvps,et al.  Analytical Monitoring of Grid-connected Photovoltaic Systems Good Practices for Monitoring and Performance Analysis , 2013 .

[42]  Steven C. Wheelwright,et al.  Forecasting: Methods and Applications, 3rd Ed , 1997 .

[43]  R. G. Ross,et al.  Interface design considerations for terrestrial solar cell modules , 1976 .