A big data analytical framework for analyzing solar energy receptors using evolutionary computing approach

Data science has been empowered with the emerging concept of big data enabling data scalability in many ways. Effective prediction systems for complex analytical problems dealing with big data can be created using evolutionary computing, associate feature selection and reduction techniques. In the current work, we put forward a big data analytical scheme to analyze solar energy receptors based on a set of features. Correct estimation of pressure loss coefficients (PLC) greatly improves the design of a solar collector. Evaluation of PLC is a time and resource consuming process as the flow rate and Reynolds number changes at every junction. Moreover, a suitable and appropriate algebraic expression is not yet defined in the laminar region of flow for approximation of the complex relationship among different geometrical features and flow variables. The overall heat gain of the solar receptor is dependent upon flow rates and flow distribution in risers. Also, the local disturbances during the flow division and combining process from manifold to risers affects the performance of the solar collector. Owing to these reasons, mostly they are calculated using experiments, primarily due to the complexity involved. The proposed big data framework involves acquiring huge feature sets at each point along the flow of thermal fluid. The data is experimentally acquired in a set of around forty features for large number of Reynolds number and discharge ratio variations. Reynolds number varies from 200 to 15,000 while discharge ratio variation is in the range of 0–1. Feature reduction in the big data set is done by calculating the relevancy score using ReliefF algorithm that extracts the most relevant features. Later, the framework employs a suitably selected optimal ANN architecture of layers, neurons and activation functions. The selected topology is trained using reduced features sets using Levenberg–Marquardt backpropagation algorithm. Test and validation results bespeaks the efficacy of the proposed strategy and indicate that future PLC values can be forecasted close to experimental data. The relative percent error is around 10% of the experimental data set and is found better than computational fluid dynamics based approaches in terms of memory and processing time.

[1]  Mykola Pechenizkiy,et al.  Feature selection is the ReliefF for multiple instance learning , 2010, 2010 10th International Conference on Intelligent Systems Design and Applications.

[2]  Abdul Waheed Badar,et al.  CFD based analysis of flow distribution in a coaxial vacuum tube solar collector with laminar flow conditions , 2012, International Journal of Energy and Environmental Engineering.

[3]  Tarlochan Kaur,et al.  Development of ANN Based Model for Solar Potential Assessment Using Various Meteorological Parameters , 2016 .

[4]  Shahzad Yousaf,et al.  Calculation of Pressure Loss Coefficients in Combining Flows of a Solar Collector using Artificial Neural Networks , 2018 .

[5]  Syed Ismail Shah,et al.  Computing Deblurred Time-Frequency Distributions Using Artificial Neural Networks , 2008 .

[6]  Le Gruenwald,et al.  Data Stream Management: Aggregation, Classification, Modeling, and Operator Placement , 2008, IEEE Internet Computing.

[7]  Hans-Peter Kriegel,et al.  Can Shared-Neighbor Distances Defeat the Curse of Dimensionality? , 2010, SSDBM.

[8]  Syed Ismail Shah,et al.  Evolutionary time-frequency distributions using Bayesian regularised neural network model , 2007 .

[9]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[10]  Emmanuel Müller,et al.  Statistical selection of relevant subspace projections for outlier ranking , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[11]  T.A.A. Victoire,et al.  Particle swarm approach for retiming in VLSI , 2003, 2003 46th Midwest Symposium on Circuits and Systems.

[12]  Yılmaz Kaya,et al.  Comparison of ANN and MLR models for estimating solar radiation in Turkey using NOAA/AVHRR data , 2013 .

[13]  S. Furbo,et al.  A numerical model for pressure drop and flow distribution in a solar collector with U-connected absorber pipes , 2016 .

[14]  Mahmoud Al-Ayyoub,et al.  Feature extraction and selection for Arabic tweets authorship authentication , 2017, J. Ambient Intell. Humaniz. Comput..

[15]  Hans-Peter Kriegel,et al.  Local outlier detection reconsidered: a generalized view on locality with applications to spatial, video, and network outlier detection , 2012, Data Mining and Knowledge Discovery.

[16]  Samir Moujaes,et al.  Three-dimensional CFD predications and experimental comparison of pressure drop of some common pipe fittings in turbulent flow , 2006 .

[17]  Zhen Zhu,et al.  Optimized Approximation Algorithm in Neural Networks Without Overfitting , 2008, IEEE Transactions on Neural Networks.

[18]  Mohammad Karamouz,et al.  Input data selection for solar radiation estimation , 2009 .

[19]  Urszula Stanczyk RELIEF-based Selection of Decision Rules , 2014, KES.

[20]  G P Blair,et al.  An Improved Branched Pipe Model for Multi-Cylinder Automotive Engine Calculations , 1985 .

[21]  Martin Hilbert,et al.  The World’s Technological Capacity to Store, Communicate, and Compute Information , 2011, Science.

[22]  Toly Chen,et al.  Fuzzy neural network approach to optimizing process performance by using multiple responses , 2016, J. Ambient Intell. Humaniz. Comput..

[23]  Niranjan Kumar Injeti,et al.  Numerical prediction of pressure loss of fluid in a T-junction , 2013 .

[24]  I. E. Idelchik,et al.  Flow Resistance : A Design Guide for Engineers , 1989 .

[25]  Xindong Wu,et al.  Data mining with big data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[26]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[27]  M. Bassett,et al.  Calculation of steady flow pressure loss coefficients for pipe junctions , 2001 .

[28]  Ivor W. Tsang,et al.  The Emerging "Big Dimensionality" , 2014, IEEE Computational Intelligence Magazine.

[29]  Anand Paul,et al.  Internet of Things: A primer , 2019, Human Behavior and Emerging Technologies.

[30]  Soteris A. Kalogirou,et al.  Artificial neural networks in renewable energy systems applications: a review , 2001 .

[31]  Hans-Peter Kriegel,et al.  A survey on unsupervised outlier detection in high‐dimensional numerical data , 2012, Stat. Anal. Data Min..

[32]  Engin Gedik,et al.  Investigation on thermal performance calculation of two type solar air collectors using artificial neural network , 2011, Expert Syst. Appl..

[33]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[34]  Howard Gobioff,et al.  The Google file system , 2003, SOSP '03.

[35]  Plamen P. Angelov,et al.  An evolving machine learning method for human activity recognition systems , 2013, J. Ambient Intell. Humaniz. Comput..

[36]  Martin Hilbert,et al.  Info Capacity| How to Measure the World’s Technological Capacity to Communicate, Store and Compute Information? Part I: Results and Scope , 2012 .

[37]  Jason Venner Getting Started with Hadoop Core , 2009 .

[38]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[39]  Andreas Richter,et al.  Flow distribution in solar collectors with laminar flow conditions , 2002 .

[40]  Pierluigi Siano,et al.  Optimal allocation of wind turbines in microgrids by using genetic algorithm , 2013, J. Ambient Intell. Humaniz. Comput..

[41]  Cagdas Hakan Aladag,et al.  A new architecture selection method based on tabu search for artificial neural networks , 2011, Expert Syst. Appl..

[42]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[43]  W H Hager,et al.  An Approximate Treatment of Flow in Branches and Bends , 1984 .

[44]  Soteris A. Kalogirou,et al.  Machine learning methods for solar radiation forecasting: A review , 2017 .

[45]  Noam Lior,et al.  Flow distribution in manifolded solar collectors with negligible buoyancy effects , 1994 .

[46]  Gunter Rockendorf,et al.  Internal thermal coupling in direct-flow coaxial vacuum tube collectors , 2010 .

[47]  Douglas C. Montgomery Big Data and the Quality Profession , 2014, Qual. Reliab. Eng. Int..

[48]  Jianyu Yang,et al.  Object-oriented feature selection of high spatial resolution images using an improved Relief algorithm , 2013, Math. Comput. Model..

[49]  Farzin Salmasi,et al.  Predicting discharge coefficient of compound broad-crested weir by using genetic programming (GP) and artificial neural network (ANN) techniques , 2013, Arabian Journal of Geosciences.