Techniques Tanimoto correlated feature selection system and hybridization of clustering and boosting ensemble classification of remote sensed big data for weather forecasting

Abstract Weather forecasting has been done using various techniques but still not efficient for handling the big remote sensed data since the data comprises the more features. Hence the techniques degrade the forecasting accuracy and take more prediction time. To enhance the prediction accuracy (PA) with minimal time, Tanimoto Correlation based Combinatorial MAP Expected Clustering and Linear Program Boosting Classification (TC-CMECLPBC) Technique is proposed. At first, the data and features are gathered from big weather database. After that, relevant features are selected through finding the similarity between the features. Tanimoto Correlation Coefficient is used to find the similarity between the features for selecting the relevant features with higher feature selection accuracy. After selecting the relevant features, MAP expected clustering process is carried out to group the weather data for cluster formation. In this process, a number of cluster and cluster centroids are initialized. In this clustering process, it includes two steps namely expectation (E) and maximization (M) to discover maximum probability for grouping data into the cluster. After that, the clustering result is given to Linear Program boosting classifier to improve the prediction performance. In this classification, the weak classifier results are boosted to create strong classifier. The results evident that the TC-CMECLPBC technique enhance the PA with lesser time and false positive rate (FPR) than the conventional methods.

[1]  Michael B. Richman,et al.  Adaptive Machine Learning Approaches to Seasonal Prediction of Tropical Cyclones , 2012, Complex Adaptive Systems.

[2]  Andrew Kusiak,et al.  Modeling and Prediction of Rainfall Using Radar Reflectivity Data: A Data-Mining Approach , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[3]  Liang Zou,et al.  Fast prediction of typhoon tracks based on a similarity method and GIS , 2013 .

[4]  Andrew E. Mercer,et al.  Multidimensional Kernel Principal Component Analysis of False Alarms of Rapidly Intensifying Atlantic Tropical Cyclones , 2018 .

[5]  M. Yamaguchi,et al.  Selective Ensemble Mean Technique for Tropical Cyclone Track Forecasts Using Multi-Model Ensembles , 2015 .

[6]  Trong Hai Duong,et al.  ENSO-based tropical cyclone forecasting using CF-ANFIS , 2016, Vietnam Journal of Computer Science.

[7]  Jiang Cheng,et al.  Intelligent Combined Prediction of Wind Power Based on Numerical Weather Prediction and Fuzzy Clustering , 2015 .

[8]  S. S. Bedi,et al.  Weather Forecasting Using Sliding Window Algorithm , 2013 .

[9]  Federico Divina,et al.  A multivariate approach to the symmetrical uncertainty measure: Application to feature selection problem , 2019, Inf. Sci..

[10]  Yintong Wang,et al.  Unsupervised Representative Feature Selection Algorithm Based on Information Entropy and Relevance Analysis , 2018, IEEE Access.

[11]  Li Xia,et al.  Analysis on Abnormal Tropical Cyclone Track Forecast Error of Ecmwf-Ifs in the Western North Pacific , 2016 .

[12]  Nan Li,et al.  A Quality Control Method of Ground-Based Weather Radar Data Based on Statistics , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[13]  Kuo-Ming Chao,et al.  Dynamic fine-tuning stacked auto-encoder neural network for weather forecast , 2018, Future Gener. Comput. Syst..

[14]  Pabitra Mitra,et al.  Fuzzy Clustering-Based Ensemble Approach to Predicting Indian Monsoon , 2015 .

[15]  Rüdiger Westermann,et al.  Time-Hierarchical Clustering and Visualization of Weather Forecast Ensembles , 2017, IEEE Transactions on Visualization and Computer Graphics.

[16]  Wensheng Gao,et al.  Prediction of Tropical Cyclones’ Characteristic Factors on Hainan Island Using Data Mining Technology , 2014 .

[17]  Stephan Stephany,et al.  A new classification approach for detecting severe weather patterns , 2013, Comput. Geosci..

[18]  Chih-Chiang Wei,et al.  Wavelet kernel support vector machines forecasting techniques: Case study on water-level predictions during typhoons , 2012, Expert Syst. Appl..

[19]  Jun Wang,et al.  A prediction scheme of tropical cyclone frequency based on lasso and random forest , 2018, Theoretical and Applied Climatology.

[20]  Gwo-Fong Lin,et al.  Development of an effective data-driven model for hourly typhoon rainfall forecasting , 2013 .

[21]  Huantong Geng,et al.  A prediction scheme for the frequency of summer tropical cyclone landfalling over China based on data mining methods , 2016 .

[22]  Lei Han,et al.  A tropical cyclone similarity search algorithm based on deep learning method , 2018, Atmospheric Research.