Research on Predicting Line Loss Rate in Low Voltage Distribution Network Based on Gradient Boosting Decision Tree

Line loss rate plays an essential role in evaluating the economic operation of power systems. However, in a low voltage (LV) distribution network, calculating line loss rate has become more cumbersome due to poor configuration of the measuring and detecting device, the difficulty in collecting operational data, and the excessive number of components and nodes. Most previous studies mainly focused on the approaches to calculate or predict line loss rate, but rarely involve the evaluation of the prediction results. In this paper, we propose an approach based on a gradient boosting decision tree (GBDT), to predict line loss rate. GBDT inherits the advantages of both statistical models and AI approaches, and can identify the complex and nonlinear relationship while computing the relative importance among variables. An empirical study on a data set in a city demonstrates that our proposed approach performs well in predicting line loss rate, given a large number of unlabeled examples. Experiments and analysis also confirmed the effectiveness of our proposed approach in anomaly detection and practical project management.

[1]  Fardin Ahmadizar,et al.  Artificial neural network development by means of a novel combination of grammatical evolution and genetic algorithm , 2015, Eng. Appl. Artif. Intell..

[2]  Ouyang Se Comprehensive Evaluation Method of Transformer Area State Oriented to Transformer Area Management and Its Application , 2015 .

[3]  Liu Kewen,et al.  A Distribution Network Theoretical Line Loss Calculation Method Based on Improved Core Vector Machine , 2011 .

[4]  Antonio Cezar de Castro Lima,et al.  A Hybrid GA–MLPNN Model for One-Hour-Ahead Forecasting of the Global Horizontal Irradiance in Elizabeth City, North Carolina , 2018, Energies.

[5]  Wang Yufei Theoretical line loss calculation based on SVR and PSO for distribution system , 2012 .

[6]  Zhijian Jin,et al.  Probabilistic Wind Power Forecasting Approach via Instance-Based Transfer Learning Embedded Gradient Boosting Decision Trees , 2019, Energies.

[7]  L. Breiman Arcing classifier (with discussion and a rejoinder by the author) , 1998 .

[8]  J. Friedman Stochastic gradient boosting , 2002 .

[9]  Hoyong Kim,et al.  Artificial neural-network based feeder reconfiguration for loss reduction in distribution systems , 1993 .

[10]  Ouyang Sen,et al.  Line-loss rate calculation model considering feeder clustering features for medium-voltage distribution network , 2016 .

[11]  J. C. Hwang,et al.  Development of simplified loss models for distribution system analysis , 1994 .

[12]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[13]  Chuan Ding,et al.  Prioritizing Influential Factors for Freeway Incident Clearance Time Prediction Using the Gradient Boosting Decision Trees Method , 2017, IEEE Transactions on Intelligent Transportation Systems.

[14]  Nadeem Javaid,et al.  Short-Term Electric Load and Price Forecasting Using Enhanced Extreme Learning Machine Optimization in Smart Grids , 2019, Energies.

[15]  Yanbo Che,et al.  A short-term photovoltaic power prediction model based on the gradient boost decision tree , 2018 .

[16]  Shu Gong,et al.  Prediction of Wind Power by Chaos and BP Artificial Neural Networks Approach Based on Genetic Algorithm , 2015 .

[17]  Long Chen,et al.  Short-Term Load Forecasting Using EMD-LSTM Neural Networks with a Xgboost Algorithm for Feature Importance Evaluation , 2017 .

[18]  Alexandre Lucas,et al.  EV Idle Time Estimation on Charging Infrastructure, Comparing Supervised Machine Learning Regressions , 2019, Energies.

[19]  Mahdi Farnaghi,et al.  A Varied Density-based Clustering Approach for Event Detection from Heterogeneous Twitter Data , 2019, ISPRS Int. J. Geo Inf..

[20]  Wei Zhang,et al.  Application of Relative Entropy and Gradient Boosting Decision Tree to Fault Prognosis in Electronic Circuits , 2018, Symmetry.

[21]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[22]  Umi Kalthum Ngah,et al.  Adaptive fuzzy moving K-means clustering algorithm for image segmentation , 2009, IEEE Transactions on Consumer Electronics.

[23]  D. Sun,et al.  Calculation of Energy Losses in a Distribution System , 1980, IEEE Transactions on Power Apparatus and Systems.

[24]  Xin Kai AN ADVANCED ALGORITHM BASED ON COMBINATION OF GA WITH BP TO ENERGY LOSS OF DISTRIBUTION SYSTEM , 2002 .

[25]  Seon Hyeog Kim,et al.  Multiscale PMU Data Compression via Density-Based WAMS Clustering Analysis , 2019 .

[26]  Zou Yun-fen Prediction model research of reasonable line loss for transformer district based on data mining technology , 2015 .

[27]  Zhang Jian-hai THE CALCULATION OF ENERGY LOSSES IN DISTRIBUTION SYSTEMS BASED ON RBF NETWORK WITH DYNAMIC CLUSTERING ALGORITHM , 2005 .

[28]  L. Breiman Arcing the edge , 1997 .

[29]  Servane Gey,et al.  Model selection for CART regression trees , 2005, IEEE Transactions on Information Theory.

[30]  Guo Zhizhong,et al.  DISTRIBUTION SYSTEM THEORETICAL LINE LOSS CALCULATION BASED ON LOAD OBTAINING AND MATCHING POWER FLOW , 2005 .

[31]  Liu Da-wei Calculation of line losses in distribution systems using artificial neural network aided by immune genetic algorithm , 2009 .

[32]  Jianbing Shen,et al.  Real-Time Superpixel Segmentation by DBSCAN Clustering Algorithm. , 2016, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[33]  Jizhen Liu,et al.  Multi-Polytope-Based Adaptive Robust Damping Control in Power Systems Using CART , 2015, IEEE Transactions on Power Systems.

[34]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[35]  Hsiao-Dong Chiang,et al.  Hierarchical K-means Method for Clustering Large-Scale Advanced Metering Infrastructure Data , 2017, IEEE Transactions on Power Delivery.

[36]  Chen Yong-jin Loss study of HVDC system , 2007 .

[37]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[38]  Zhou Zhi-hui Line loss comprehensive analytical method based on real-time system data and its application , 2005 .

[39]  Victor Pankratius,et al.  Optimizing Parallel Clustering Throughput in Shared Memory , 2017, IEEE Transactions on Parallel and Distributed Systems.

[40]  Xiao Zhang,et al.  A Taxi Gap Prediction Method via Double Ensemble Gradient Boosting Decision Tree , 2017, 2017 IEEE 3rd International Conference on Big Data Security on Cloud (BigDataSecurity), IEEE International Conference on High Performance and Smart Computing, (HPSC) and IEEE International Conference on Intelligent Data and Security (IDS).

[41]  Zhang Kaika Theoretical Analysis on Distribution Network Loss Based on Load Measurement and Countermeasures to Reduce the Loss , 2013 .

[42]  Chee Peng Lim,et al.  Online Motor Fault Detection and Diagnosis Using a Hybrid FMM-CART Model , 2014, IEEE Transactions on Neural Networks and Learning Systems.