A novel distribution-free hybrid regression model for manufacturing process efficiency improvement

This work is motivated by a particular problem of a modern paper manufacturing industry, in which maximum efficiency of the fiber-filler recovery process is desired. A lot of unwanted materials along with valuable fibers and fillers come out as a by-product of the paper manufacturing process and mostly goes as waste. The job of an efficient Krofta supracell is to separate the unwanted materials from the valuable ones so that fibers and fillers can be collected from the waste materials and reused in the manufacturing process. The efficiency of Krofta depends on several crucial process parameters and monitoring them is a difficult proposition. To solve this problem, we propose a novel hybridization of regression trees (RT) and artificial neural networks (ANN), hybrid RT-ANN model, to solve the problem of low recovery percentage of the supracell. This model is used to achieve the goal of improving supracell efficiency, viz., gain in percentage recovery. In addition, theoretical results for the universal consistency of the proposed model are given with the optimal value of a vital model parameter. Experimental findings show that the proposed hybrid RT-ANN model achieves higher accuracy in predicting Krofta recovery percentage than other conventional regression models for solving the Krofta efficiency problem. This work will help the paper manufacturing company to become environmentally friendly with minimal ecological damage and improved waste recovery.

[1]  J. Friedman Multivariate adaptive regression splines , 1990 .

[2]  Jean-Pierre Nadal,et al.  Neural trees: a new tool for classification , 1990 .

[3]  Kuriakose Athappilly,et al.  A comparative predictive analysis of neural networks (NNs), nonlinear regression and classification and regression tree (CART) models , 2005, Expert Syst. Appl..

[4]  Jing Shi,et al.  On comparing three artificial neural networks for wind speed forecasting , 2010 .

[5]  Theodoros Varzakas,et al.  Application of failure mode and effect analysis (FMEA) and cause and effect analysis for industrial processing of common octopus (Octopus vulgaris) – Part II , 2009 .

[6]  F. Windmeijer,et al.  R-Squared Measures for Count Data Regression Models With Applications to Health-Care Utilization , 1996 .

[7]  Gábor Lugosi,et al.  Nonparametric estimation via empirical risk minimization , 1995, IEEE Trans. Inf. Theory.

[8]  Thomas M. Cover,et al.  Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition , 1965, IEEE Trans. Electron. Comput..

[9]  I. Alsyouf The role of maintenance in improving companies' productivity and profitability , 2002 .

[10]  T. Kohonen,et al.  Statistical pattern recognition with neural networks: benchmarking studies , 1988, IEEE 1988 International Conference on Neural Networks.

[11]  A. Nobel Histogram regression estimation using data-dependent partitions , 1996 .

[12]  András Faragó,et al.  Strong universal consistency of neural network classifiers , 1993, IEEE Trans. Inf. Theory.

[13]  D. F. Andrews,et al.  A Robust Method for Multiple Linear Regression , 1974 .

[14]  Wayne Ieee,et al.  Entropy Nets: From Decision Trees to Neural Networks , 1990 .

[15]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[16]  Tian-Shyug Lee,et al.  A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines , 2005, Expert Syst. Appl..

[17]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[18]  Guoqiang Peter Zhang,et al.  Neural network forecasting for seasonal and trend time series , 2005, Eur. J. Oper. Res..

[19]  V. Barnett,et al.  Applied Linear Statistical Models , 1975 .

[20]  Keinosuke Fukunaga,et al.  A Branch and Bound Algorithm for Feature Subset Selection , 1977, IEEE Transactions on Computers.

[21]  Jorge Alberto Soares Tenório,et al.  Use of froth flotation to separate PVC/PET mixtures , 2000 .

[22]  Joaquín Izquierdo,et al.  Hybrid regression model for near real-time urban water demand forecasting , 2017, J. Comput. Appl. Math..

[23]  N. Horan,et al.  The Treatment of a High Strength Pulp and Paper Mill Effluent for Wastewater Re-Use , 1998 .

[24]  Youngjae Chun,et al.  Sustainable manufacturing and the role of the International Journal of Production Research , 2013 .

[25]  C. A. Murthy,et al.  A nonparametric ensemble binary classifier and its statistical properties , 2018, Statistics & Probability Letters.

[26]  Misha Denil,et al.  Consistency of Online Random Forests , 2013, ICML.

[27]  Chris Aldrich,et al.  Online monitoring and control of froth flotation systems with machine vision: A review , 2010 .

[28]  László Györfi,et al.  A Probabilistic Theory of Pattern Recognition , 1996, Stochastic Modelling and Applied Probability.

[29]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[30]  K. I. Ramachandran,et al.  Feature selection using Decision Tree and classification through Proximal Support Vector Machine for fault diagnostics of roller bearing , 2007 .

[31]  A. Afifi,et al.  Comparison of Stopping Rules in Forward “Stepwise” Regression , 1977 .

[32]  Jagadeesh Anmala,et al.  Rainfall-Runoff Modeling Using Artificial Neural Networks , 2010 .

[33]  Dimitri P. Solomatine,et al.  Neural networks and M5 model trees in modelling water level-discharge relationship , 2005, Neurocomputing.

[34]  R. Russell Rhinehart,et al.  pH Control using a statistical technique for continuous on-line model adaptation , 1993 .

[35]  S. Wold,et al.  The Collinearity Problem in Linear Regression. The Partial Least Squares (PLS) Approach to Generalized Inverses , 1984 .

[36]  H. Abdi Partial least squares regression and projection on latent structure regression (PLS Regression) , 2010 .

[37]  Luc Devroye,et al.  Consistency of Random Forests and Other Averaging Classifiers , 2008, J. Mach. Learn. Res..

[38]  Tanujit Chakraborty,et al.  A novel hybridization of classification trees and artificial neural networks for selection of students in a business school , 2018 .

[39]  Byun-Woo Lee,et al.  Assessment of rice leaf growth and nitrogen status by hyperspectral canopy reflectance and partial least square regression , 2006 .

[40]  Adam Krzyzak,et al.  A Distribution-Free Theory of Nonparametric Regression , 2002, Springer series in statistics.

[41]  Maamar Sebri,et al.  Forecasting urban water demand: A meta-regression analysis. , 2016, Journal of environmental management.

[42]  James A. Finch,et al.  On the role of cavitation in particle collection during flotation. A critical review , 1994 .

[43]  M. Dresselhaus,et al.  Vapor-grown carbon fibers (VGCFs): Basic properties and their battery applications , 2001 .

[44]  R. Jennrich,et al.  Application of Stepwise Regression to Non-Linear Estimation , 1968 .

[45]  Jean-Philippe Vert,et al.  Consistency of Random Forests , 2014, 1405.2881.

[46]  Jiwen Dong,et al.  Time-series forecasting using flexible neural tree model , 2005, Inf. Sci..

[47]  Ali Boubakri,et al.  Electrodialytic desalination of brackish water: determination of optimal experimental parameters using full factorial design , 2017, Applied Water Science.

[48]  Randall Balestriero,et al.  Neural Decision Trees , 2017, ArXiv.

[49]  A. Malik,et al.  Artificial neural network modeling of the river water quality—A case study , 2009 .

[50]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[51]  Luc Devroye,et al.  On the layered nearest neighbour estimate, the bagged nearest neighbour estimate and the random forest method in regression and classification , 2010, J. Multivar. Anal..

[52]  Anthony T. C. Goh,et al.  Multivariate Adaptive Regression Splines Application for Multivariate Geotechnical Problems with Big Data , 2016, Geotechnical and Geological Engineering.