A new Ensemble based multi-agent system for prediction problems: Case study of modeling coal free swelling index

Abstract In this article, a new ensemble based multi-agent system called “EMAS” is introduced for prediction of problems in data mining. The EMAS is constructed using a four-layer multi-agent system architecture to generate a data mining process based on the coordination of intelligent agents. The EMAS performance is based on data preprocessing and prediction. The first layer is dedicated to clean and normalize data. The second layer is designed for data preprocessing by using intelligent variable ranking to select the most effective agents (select the most important input variables to model an output variable). In the third layer, a negative correlation learning (NCL) algorithm is used to train a neural network ensemble (NNE). Fourth layer is dedicated to do three different subtasks including; knowledge discovery, prediction and data presentation. The ability of the EMAS is evaluated by using a robust coal database (3238 records) for prediction of Free Swelling Index (FSI) as an important problem in coke making industry, and comparing the outcomes with the results of other conventional modeling methods Coal particles have complex structures and EMAS can explore complicated relationships between their structural parameters and select the most important ones for FSI modeling. The results show that the EMAS outperforms all presented modeling methods; therefore, it can be considered as a suitable tool for prediction of problems. Moreover, the results indicated that the EMAS can be further employed as a reliable tool to select important variables, predict complicated problems, model, control, and optimize fuel consumption in iron making plants and other energy facilities.

[1]  Lars Kai Hansen,et al.  Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Matthias Klusch,et al.  Distributed data mining and agents , 2005, Eng. Appl. Artif. Intell..

[3]  Gerhard Weiss,et al.  Multiagent systems: a modern approach to distributed artificial intelligence , 1999 .

[4]  S. S. Matin,et al.  Explaining relationships between coke quality index and coal properties by Random Forest method , 2016 .

[5]  S. S. Matin,et al.  Modeling of free swelling index based on variable importance measurements of parent coal properties by random forest method , 2016 .

[6]  Nasser Ghasem-Aghaee,et al.  Modeling and implementing an agent-based system for prediction of protein relative solvent accessibility , 2011, Expert Syst. Appl..

[7]  Sotiris B. Kotsiantis,et al.  Combining bagging, boosting, rotation forest and random subspace methods , 2011, Artificial Intelligence Review.

[8]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[9]  Esmaeil Hadavandi,et al.  A bat-neural network multi-agent system (BNNMAS) for stock price prediction: Case study of DAX stock price , 2015, Appl. Soft Comput..

[10]  Frans Coenen,et al.  EMADS: An extendible multi-agent data miner , 2009, Knowl. Based Syst..

[11]  Arash Ghanbari,et al.  Developing a hybrid artificial intelligence model for outpatient visits forecasting in hospitals , 2012, Appl. Soft Comput..

[12]  Bijay K. Mishra,et al.  Tribo-electrostatic separation of high ash coking coal washery rejects: Effect of moisture on separation efficiency , 2016 .

[13]  Ling Tang,et al.  A non-iterative decomposition-ensemble learning paradigm using RVFL network for crude oil price forecasting , 2017, Appl. Soft Comput..

[14]  Reza Ebrahimpour,et al.  Mixture of experts: a literature survey , 2014, Artificial Intelligence Review.

[15]  S. Chehreh Chelgani,et al.  Studies of relationships between Free Swelling Index (FSI) and coal quality by regression and Adaptive Neuro Fuzzy Inference System , 2011 .

[16]  J. L. G. Cimadevilla,et al.  Influence of coal forced oxidation on technological properties of cokes produced at laboratory scale , 2005 .

[17]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[18]  M. Mastalerz,et al.  Functional group and individual maceral chemistry of high volatile bituminous coals from southern Indiana: controls on coking , 2004 .

[19]  S. Khoshjavan,et al.  Evaluation of effect of coal chemical properties on coal swelling index using artificial neural networks , 2011, Expert Syst. Appl..

[20]  Sander Scholtus,et al.  Handbook of Statistical Data Editing and Imputation , 2011 .

[21]  Xin Yao,et al.  Ensemble learning via negative correlation , 1999, Neural Networks.

[22]  Xin Yao,et al.  Diversity creation methods: a survey and categorisation , 2004, Inf. Fusion.

[23]  Mohammad Hossein Fazel Zarandi,et al.  A hybrid fuzzy intelligent agent‐based system for stock price prediction , 2012, Int. J. Intell. Syst..

[24]  Shahaboddin Shamshirband,et al.  A novel evolutionary-negative correlated mixture of experts model in tourism demand estimation , 2016, Comput. Hum. Behav..

[25]  J. Hower,et al.  Estimation of free-swelling index based on coal analysis using multivariable regression and artifici , 2011 .

[26]  Nicolás García-Pedrajas,et al.  Supervised subspace projections for constructing ensembles of classifiers , 2012, Inf. Sci..

[27]  Rui Ye,et al.  Considering diversity and accuracy simultaneously for ensemble pruning , 2017, Appl. Soft Comput..

[28]  Reza Ebrahimpour,et al.  Combining features of negative correlation learning with mixture of experts in proposed ensemble methods , 2012, Appl. Soft Comput..

[29]  Xin Yao,et al.  Evolutionary ensembles with negative correlation learning , 2000, IEEE Trans. Evol. Comput..

[30]  Yvan Saeys,et al.  Java-ML: A Machine Learning Library , 2009, J. Mach. Learn. Res..

[31]  Yoichi Hayashi,et al.  SPMoE: a novel subspace-projected mixture of experts model for multi-target regression problems , 2015, Soft Computing.

[32]  R. Shiffler Maximum Z Scores and Outliers , 1988 .

[33]  L. Dascalescu,et al.  Effect of particle size on the tribo-aero-electrostatic separation of plastics , 2017 .

[34]  Marko Robnik-Sikonja,et al.  An adaptation of Relief for attribute estimation in regression , 1997, ICML.

[35]  Antonio Fernández-Caballero,et al.  Modeling and implementing an agent-based environmental health impact decision support system , 2009, Expert Syst. Appl..

[36]  A. Karegowda,et al.  COMPARATIVE STUDY OF ATTRIBUTE SELECTION USING GAIN RATIO AND CORRELATION BASED FEATURE SELECTION , 2010 .

[37]  Cornelius T. Leondes,et al.  Fuzzy logic and expert systems applications , 1997, Neural network systems techniques and applications.

[38]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[39]  S. S. Matin,et al.  Variable selection and prediction of uniaxial compressive strength and modulus of elasticity by random forest , 2017, Appl. Soft Comput..

[40]  Yves A. Lussier,et al.  Partitioning knowledge bases between advanced notification and clinical decision support systems , 2007, Decis. Support Syst..

[41]  James C. Hower,et al.  Modeling of gross calorific value based on coal properties by support vector regression method , 2017, Modeling Earth Systems and Environment.

[42]  Yurong Liu,et al.  A niching evolutionary algorithm with adaptive negative correlation learning for neural network ensemble , 2017, Neurocomputing.

[43]  Shahaboddin Shamshirband,et al.  A novel Boosted-neural network ensemble for modeling multi-target regression problems , 2015, Eng. Appl. Artif. Intell..

[44]  Agostino Poggi,et al.  JADE: A software framework for developing multi-agent applications. Lessons learned , 2008, Inf. Softw. Technol..