Design of Hadoop-based Framework for Analytics of Large Synchrophasor Datasets

Abstract The power sector is increasingly utilizing GPS-stamped real-time measurements from Phasor Measurement Units (PMU) to improve the reliability and efficiency of power grids. PMUs directly measure phase angles in real-time, which allows operators to perform grid optimization that was not possible in the past. In early 2010, almost 250 PMU's were deployed across North America and it continues to increase remarkably. However, one of the major challenges is the complexity of analyzing such a large amount of real-time datasets. The phasor data from PMU's will be accumulated in peta-bytes in coming years, which exceeds the capability of conventional relational database technologies. A new software and architecture framework is in desperate need to process such a large amount of data in real-time reliably and cost- effectively. The paper presents a new framework based on Hadoop, an open-source system widely used in the industry, to perform distributed and parallel analytics on large synchrophasor datasets. The paper demonstrates various applications of MapReduce to analyze patterns of load distribution using parallel node calculations, which can later be scaled up to match the requirements for power utility sector. The paper serves as a pilot study on data analytics on big data of smart grids.

[1]  Luigi Vanfretti,et al.  Implementation of an experimental wide-area monitoring platform for development of synchronized phasor measurement applications , 2011, 2011 IEEE Power and Energy Society General Meeting.

[2]  James S. Thorp,et al.  IEEE Standard for Synchrophasors for Power Systems , 1998 .

[3]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[4]  Paul Trachian Machine learning and windowed subsecond event detection on PMU data via Hadoop and the openPDC , 2010, IEEE PES General Meeting.

[5]  Garhan Attebury,et al.  Hadoop distributed file system for the Grid , 2009, 2009 IEEE Nuclear Science Symposium Conference Record (NSS/MIC).