Multi-Branch Multi Layer Perceptron: A Solution for Precise Regression using Machine Learning

The problem of simple regression using Multi Layer Perceptron (MLP) has its limitations. The main problem with MLP is that it is difficult to find a perfect architecture to fit all the data. The more complex and multi-dimensional the data we use, the deeper the network has to be, which increases training time as well as optimization and tuning of the network. The solution to fit the data more precisely could be to split the data into groups based on the input variable and use a different model to train and predict data for each of these groups. In most cases, datasets contain a large number of non-linear input features, which makes the method for finding thresholds for each group very difficult. The proposed technique tackles this problem using multiple branches consisting of shallow MLP, one of which acts as a selector (classifier) for the output. The selector’s main goal is to generate a weight for the other prediction branches which in other words means, results in one branch being more proficient in predicting certain parts of the feature space while the other branches will be better at predicting completely different parts of the feature space.

[1]  Nikhil Sharma,et al.  Stock Market Prediction Using Historical Stock Prices And Dependence On Other Companies In Automotive Sector , 2022, 2022 IEEE 10th Region 10 Humanitarian Technology Conference (R10-HTC).

[2]  Xiaojian Weng,et al.  Stock Price Prediction Based On Lstm And Bert , 2022, 2022 International Conference on Machine Learning and Cybernetics (ICMLC).

[3]  B. Purkayastha,et al.  Health Insurance Cost Prediction Using Regression Models , 2022, 2022 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COM-IT-CON).

[4]  Ritik. S. Jain,et al.  Regression Analysis of COVID-19 using Machine Learning Algorithms , 2020, 2020 International Conference on Smart Electronics and Communication (ICOSEC).

[5]  Herbert F. Jelinek,et al.  Estimating Left Ventricle Ejection Fraction Levels Using Circadian Heart Rate Variability Features and Support Vector Regression Models , 2020, IEEE Journal of Biomedical and Health Informatics.

[6]  Srinath Ravikumar,et al.  Prediction of Stock Prices using Machine Learning (Regression, Classification) Algorithms , 2020, 2020 International Conference for Emerging Technology (INCET).

[7]  Hong Hui Tan,et al.  Vanishing Gradient Mitigation with Deep Learning Neural Network Optimization , 2019, 2019 7th International Conference on Smart Computing & Communications (ICSCC).

[8]  Jaswinder Singh,et al.  A Study on Single and Multi-layer Perceptron Neural Network , 2019, 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC).

[9]  Sashank J. Reddi,et al.  On the Convergence of Adam and Beyond , 2018, ICLR.

[10]  Sebastian Ruder,et al.  An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[11]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[12]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[13]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..