Comparison between different methods for developing neural network topology applied to a complex polymerization process

In this paper, three methods for developing the optimal topology of feed-forward artificial neural networks are described and applied for modeling a complex polymerization process. In the free radical polymerization of styrene, accompanied by gel and glass effects, the monomer conversion and molecular masses are modeled depending on reaction conditions. The first proposed methodology is an algorithm which systemizes a series of criteria and heuristics on neural networks modeling. The method is laborious, but the practical considerations, structured in a 6-steps algorithm, and the criterion and formula used for calculating the performance indices give the method a real chance of obtaining a neural network of minimum size and maximum performance. The next two methods belong to evolutionary techniques and they are based on a classical genetic algorithm and differential evolution algorithm. They automatically develop the neural network topology by determining the optimal values for the number of hidden layers, the number of neurons in these layers, the weights between layers, the biases of the neurons and the activation functions. For all the three methods, a combination between training and testing errors was considered in order to evaluate the performance of the developed neural networks and to choose the best one among them. The relative percentage errors calculated in the validation phase registered good values, under 7%. A comparison between these methods pointed out both advantages and disadvantages, but, even if they lead to different network architectures, accurate results were obtained and, consequently, near optimal neural network topologies were developed.

[1]  Juan Julián Merelo Guervós,et al.  G-Prop: Global optimization of multilayer perceptrons using GAs , 2000, Neurocomputing.

[2]  Khashayar Khorasani,et al.  New training strategies for constructive neural networks with application to regression problems , 2004, Neural Networks.

[3]  Michael Sylvester Packianather,et al.  Optimizing the parameters of multilayered feedforward neural networks through Taguchi design of experiments , 2000 .

[5]  Bidyadhar Subudhi,et al.  Differential Evolution and Levenberg Marquardt Trained Neural Network Scheme for Nonlinear System Identification , 2008, Neural Processing Letters.

[6]  Joni-Kristian Kämäräinen,et al.  Differential Evolution Training Algorithm for Feed-Forward Neural Networks , 2003, Neural Processing Letters.

[7]  Sandip Kumar Lahiri,et al.  Modeling of Commercial Ethylene Oxide Reactor: A Hybrid Approach by Artificial Neural Network & Differential Evolution , 2010 .

[8]  Deoki N. Saraf,et al.  Design of neural networks using genetic algorithm for on-line property estimation of crude fractionator products , 2006, Comput. Chem. Eng..

[9]  J. Tvrdík,et al.  Adaptive Differential Evolution : Application to Nonlinear Regression , 2007 .

[10]  José Neves,et al.  Simultaneous Evolution of Neural Network Topologies and Weights for Classification and Regression , 2005, IWANN.

[11]  James T. Kwok,et al.  Constructive algorithms for structure learning in feedforward neural networks for regression problems , 1997, IEEE Trans. Neural Networks.

[12]  James Tannock,et al.  The optimisation of neural network parameters using Taguchi’s design of experiments approach: an application in manufacturing process modelling , 2005, Neural Computing & Applications.

[13]  D. Zaharie A Comparative Analysis of Crossover Variants in Differential Evolution , 2007 .

[14]  George-Christopher Vosniakos,et al.  Optimizing feedforward artificial neural network architecture , 2007, Eng. Appl. Artif. Intell..

[15]  Silvia Curteanu,et al.  The neural networks based modeling of a polybenzimidazole-based polymer electrolyte membrane fuel cell: Effect of temperature , 2009 .

[16]  Bidyadhar Subudhi,et al.  An improved differential evolution trained neural network scheme for nonlinear system identification , 2009, Int. J. Autom. Comput..

[17]  Kazuyuki Murase,et al.  A new algorithm to design compact two-hidden-layer artificial neural networks , 2001, Neural Networks.

[18]  George-Christopher Vosniakos,et al.  Prediction of surface roughness in CNC face milling using neural networks and Taguchi's design of experiments , 2002 .

[19]  Christian Lebiere,et al.  The Cascade-Correlation Learning Architecture , 1989, NIPS.

[20]  Nidapan Sureerattanan,et al.  New Training Method and Optimal Structure of Backpropagation Networks , 2005, ICNC.

[21]  Silvia Curteanu,et al.  Optimization Strategy Based on Genetic Algorithms and Neural Networks Applied to a Polymerization Process , 2007 .

[22]  C. Jutten,et al.  Gal: Networks That Grow When They Learn and Shrink When They Forget , 1991 .

[23]  Robert M. Pap,et al.  Handbook of neural computing applications , 1990 .

[24]  Kazuyuki Murase,et al.  A Pruning Algorithm for Training Cooperative Neural Network Ensembles , 2006, IEICE Trans. Inf. Syst..

[25]  Robert J. Butera,et al.  Genetic Algorithm for Optimization and Specification of a Neuron Model , 2006, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[26]  George D. Magoulas,et al.  Training multilayer networks with discrete activation functions , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[27]  Pedro Paulo Balestrassi,et al.  Design of experiments on neural network's training for nonlinear time series forecasting , 2009, Neurocomputing.

[28]  Silvia Curteanu,et al.  Optimization methodology applied to feed‐forward artificial neural network parameters , 2011 .

[29]  Silvia Curteanu,et al.  Ten steps modeling of electrolysis processes by using neural networks , 2010, Environ. Model. Softw..

[30]  Éric D. Taillard,et al.  Analysis and test of efficient methods for building recursive deterministic perceptron neural networks , 2007, Neural Networks.

[31]  Bong-Jin Yum,et al.  Robust design of multilayer feedforward neural networks: an experimental approach , 2004, Eng. Appl. Artif. Intell..

[32]  Silvia Curteanu,et al.  Neural network-based modeling for semi-batch and nonisothermal free radical polymerization , 2006 .

[33]  Fabiano A.N. Fernandes,et al.  Neural network applications in polymerization processes , 2005 .

[34]  Bao-Gang Hu,et al.  Two-Phase Construction of Multilayer Perceptrons Using Information Theory , 2009, IEEE Transactions on Neural Networks.

[35]  Eleni I. Vlahogianni,et al.  Optimized and meta-optimized neural networks for short-term traffic flow prediction: A genetic approach , 2005 .

[36]  Frédéric Alexandre,et al.  Neural Network Topology Optimization , 2005, ICANN.

[37]  R. Storn,et al.  Differential Evolution: A Practical Approach to Global Optimization (Natural Computing Series) , 2005 .

[38]  Silvia Curteanu,et al.  Modeling and simulation of free radical polymerization of styrene under semibatch reactor conditions , 2003 .

[39]  Qing Wang,et al.  Process cost modelling using neural networks , 2000 .