Classification of solid fuels with machine learning

Abstract In energy applications, fuels are processed in various ways according to their types. Experiments conducted with non-optimal processing procedures cause waste of the materials and may lead to inaccurate conclusions. Moreover, classifying the fuel material becomes harder when the material is already processed or collected from an environment that makes it difficult to discern. Thus, with the constantly developing and diversifying energy applications, the need for a framework that can classify solid fuels is increased. In this study, we used machine learning based approach to classify fuels with the use of proximate analysis results, i.e., fixed carbon, volatile matter and ash contents. We collected a data set from the literature and group them into four classes, i.e., coals, woods, agricultural residue and manufactured biomass. Then, K-nearest neighbor, support vector machine and random forest machine learning classifiers are employed to develop prediction models. Furthermore, hierarchical classification approach is taken to combine each classifier’s advantages with integration of expert opinion to create a complete and highly accurate classifier framework which can classify an unknown fuel into one of the four categories. K-fold cross validation is used to evaluate classifiers’ performances in unbiased manner. With hierarchical classifier, % 96 and %92 classification accuracy is achieved for training and testing phases, respectively. Source code of the proposed hierarchical classifier framework is also provided.

[1]  A. Rossi,et al.  Fuel Characteristics of Wood and Nonwood Biomass Fuels , 1984 .

[2]  Claudio Gentile,et al.  Hierarchical classification: combining Bayes with SVM , 2006, ICML.

[3]  C. Zhang,et al.  Estimation of higher heating value of coal based on proximate analysis using support vector regression , 2015 .

[4]  I. Maqsood,et al.  Random Forests and Decision Trees , 2012 .

[5]  B. M. Gibbs,et al.  Thermal analysis and devolatilization kinetics of cotton stalk, sugar cane bagasse and shea meal under nitrogen and air atmospheres. , 2009, Bioresource technology.

[6]  Carlos Rodríguez-Monroy,et al.  An Efficient Waste-To-Energy Model in Isolated Environments. Case Study: La Gomera (Canary Islands) , 2019, Sustainability.

[7]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[8]  A. D. Gordon A Review of Hierarchical Classification , 1987 .

[9]  Jillian L. Goldfarb,et al.  Improved prediction of higher heating value of biomass using an artificial neural network model based on proximate analysis. , 2017, Bioresource technology.

[10]  P. Abdul Salam,et al.  Estimation of higher heating value of biomass from proximate analysis: A new approach , 2012 .

[11]  Jo‐Shu Chang,et al.  Torrefaction performance and energy usage of biomass wastes and their correlations with torrefaction severity index , 2018, Applied Energy.

[12]  Nittaya Kerdprasop,et al.  An Empirical Study of Distance Metrics for k-Nearest Neighbor Algorithm , 2015 .

[13]  Tzu-Tsung Wong,et al.  Performance evaluation of classification algorithms by k-fold and leave-one-out cross validation , 2015, Pattern Recognit..

[14]  Yoav Freund,et al.  The Alternating Decision Tree Learning Algorithm , 1999, ICML.

[15]  Wen‐ying Li,et al.  Prediction of elemental composition of coal using proximate analysis , 2017 .

[16]  Mariana Belgiu,et al.  Random forest in remote sensing: A review of applications and future directions , 2016 .

[17]  José Ignacio Arranz,et al.  Energetic characterization of densified residues from Pyrenean oak forest , 2009 .

[18]  Jason Weston,et al.  Multi-Class Support Vector Machines , 1998 .

[19]  RajenderKumar Gupta,et al.  Characterising ash of biomass and waste , 2007 .

[20]  Yoshua Bengio,et al.  No Unbiased Estimator of the Variance of K-Fold Cross-Validation , 2003, J. Mach. Learn. Res..

[21]  Andrew McCallum,et al.  Collective multi-label classification , 2005, CIKM '05.

[22]  C. R. Monroy,et al.  An efficient waste to energy model for isolated environments. Case study: La Gomera , 2019 .

[23]  L. Jiménez,et al.  Study of the physical and chemical properties of lignocellulosic residues with a view to the production of fuels , 1991 .

[24]  Jiyuan Zhang,et al.  Proximate analysis based prediction of gross calorific value of coals: A comparison of support vector machine, alternating conditional expectation and artificial neural network , 2015 .

[25]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[26]  Qinghai Li,et al.  Classification of municipal solid waste components for thermal conversion in waste-to-energy research , 2015 .

[27]  E. Gehan A GENERALIZED WILCOXON TEST FOR COMPARING ARBITRARILY SINGLY-CENSORED SAMPLES. , 1965, Biometrika.

[28]  Timothy A. Warner,et al.  Implementation of machine-learning classification in remote sensing: an applied review , 2018 .

[29]  D. Duca,et al.  Soft Independent Modelling of Class Analogy applied to infrared spectroscopy for rapid discrimination between hardwood and softwood , 2016 .

[30]  S. Channiwala,et al.  A UNIFIED CORRELATION FOR ESTIMATING HHV OF SOLID, LIQUID AND GASEOUS FUELS , 2002 .

[31]  Alex A. Freitas,et al.  A survey of hierarchical classification across different application domains , 2010, Data Mining and Knowledge Discovery.

[32]  Jörg Maier,et al.  Thermal Utilization of Solid Recovered Fuels in Pulverized Coal Power Plants and Industrial Furnaces as Part of an Integrated Waste Management Concept , 2009 .

[33]  Carlos Alberto Luengo,et al.  Thermochemical Properties of Cuban Biomass , 2000 .

[34]  Leif E. Peterson K-nearest neighbor , 2009, Scholarpedia.

[35]  Oliver Kramer,et al.  K-Nearest Neighbors , 2013 .

[36]  J. Bilbao,et al.  Fast characterization of biomass fuels by thermogravimetric analysis (TGA) , 2015 .

[37]  Yoram Singer,et al.  Large margin hierarchical classification , 2004, ICML.

[38]  S. Lemon,et al.  Classification and regression tree analysis in public health: Methodological review and comparison with logistic regression , 2003, Annals of behavioral medicine : a publication of the Society of Behavioral Medicine.

[39]  N. Ilten,et al.  Fuel Analyses and Thermochemical Processing of Olive Residues , 2004 .

[40]  J. Bordado,et al.  Potential biofuels from liquefied industrial wastes – Preliminary evaluation of heats of combustion and van Krevelen correlations , 2016 .

[41]  C. I. Sainz-Díaz,et al.  Activated carbon from solid wastes using a pilot-scale batch flaming pyrolyser , 2000 .

[42]  Arthur Zimek,et al.  A Study of Hierarchical and Flat Classification of Proteins , 2010, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[43]  A. Demirbas,et al.  Calculation of higher heating values of biomass fuels , 1997 .

[44]  Achim Zeileis,et al.  Bias in random forest variable importance measures: Illustrations, sources and a solution , 2007, BMC Bioinformatics.

[45]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[46]  P. Fernàndez,et al.  Correlations of properties of Spanish coals with their natural radionuclides contents , 1997 .

[47]  P. Bonelli Slow Pyrolysis of Nutshells: Characterization of Derived Chars and of Process Kinetics , 2003 .

[48]  José Antonio Lozano,et al.  Sensitivity Analysis of k-Fold Cross Validation in Prediction Error Estimation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  T. Bridgeman,et al.  Classification of macroalgae as fuel and its thermochemical behaviour. , 2008, Bioresource technology.

[50]  H. Haykiri-Acma,et al.  Effect of biomass on burnouts of Turkish lignites during co-firing. , 2009 .

[51]  Thomas G. Dietterich Overfitting and undercomputing in machine learning , 1995, CSUR.

[52]  Bryan M. Jenkins,et al.  CORRELATION OF PHYSICAL AND CHEMICAL PROPERTIES OF TERRESTRIAL BIOMASS WITH CONVERSION. , 1985 .

[53]  B. Jenkins,et al.  Combustion properties of biomass , 1998 .

[54]  Y. Niu,et al.  Short Review on the Origin and Countermeasure of Biomass Slagging in Grate Furnace , 2014, Front. Energy Res..

[55]  Patrick E. McKnight,et al.  Mann‐Whitney U Test , 2010 .

[56]  S. Channiwala,et al.  A correlation for calculating HHV from proximate analysis of solid fuels , 2005 .

[57]  Tomás Cordero,et al.  Predicting heating values of lignocellulosics and carbonaceous materials from proximate analysis , 2001 .

[58]  Paul G. Risser,et al.  Agricultural and Forestry Residues , 1981 .

[59]  T. A. Milne,et al.  Thermodynamic data for biomass conversion and waste incineration , 1986 .

[60]  Paresh Chandra Deka,et al.  Support vector machine applications in the field of hydrology: A review , 2014, Appl. Soft Comput..

[61]  Peter B Harrington,et al.  Fuzzy rule-building expert system classification of fuel using solid-phase microextraction two-way gas chromatography differential mobility spectrometric data. , 2007, Analytical chemistry.

[62]  James M. Keller,et al.  A fuzzy K-nearest neighbor algorithm , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[63]  David A. Tillman,et al.  Wood as an energy resource , 1978 .

[64]  A. Ghaly,et al.  Study of agglomeration characteristics of silica sand-straw ash mixtures using scanning electronic microscopy and energy dispersion X-ray techniques , 1994 .

[65]  Jianren Fan,et al.  A comprehensive study on estimating higher heating value of biomass from proximate and ultimate analysis with machine learning approaches , 2019 .

[66]  M. Mehrpooya,et al.  Estimation of biomass higher heating value (HHV) based on the proximate analysis by using iterative neural network-adapted partial least squares (INNPLS) , 2017 .

[67]  Marliati Ahmad,et al.  New Van Krevelen diagram and its correlation with the heating value of biomass , 2013 .

[68]  Ee-Peng Lim,et al.  Hierarchical text classification and evaluation , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[69]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[70]  Min-fang Qi,et al.  Estimation of low calorific value of blended coals based on support vector regression and sensitivity analysis in coal-fired power plants , 2019, Fuel.

[71]  Stan Szpakowicz,et al.  Hierarchical versus Flat Classification of Emotions in Text , 2010, HLT-NAACL 2010.

[72]  A. J. Toft,et al.  A techno-economic comparison of power production by biomass fast pyrolysis with gasification and combustion , 2002 .

[73]  Erik Dahlquist,et al.  Technologies for Converting Biomass to Useful Energy : combustion, gasification, pyrolysis, torrefaction and fermentation , 2013 .

[74]  Taghi M. Khoshgoftaar,et al.  An Empirical Study of Learning from Imbalanced Data Using Random Forest , 2007 .

[75]  Two-dimensional correlation coefficient mapping in gas chromatography: Jet fuel classification for environmental analysis , 2006 .

[76]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[77]  Francisco Herrera,et al.  Study on the Impact of Partition-Induced Dataset Shift on $k$-Fold Cross-Validation , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[78]  Özgün Yücel,et al.  An artificial intelligence based approach to predicting syngas composition for downdraft biomass gasification , 2018, Energy.