Evaluating forecasting algorithm of realistic datasets based on machine learning

Developing a good forecasting model is one of the most important methods to boost profits of enterprises. However, it is also a difficult task to design a well-performed model, especially when using traditional linear statistic tools. There are many useful algorithms in Machine Learning and each has its advantage. This paper begins with two realistic forecasting issues: store sales forecasting and customer satisfaction analyzing, gives a preprocessing scheme to improve the performance of machine leaning algorithms, including Artificial Neural Network (ANN), Support Vector Machine (SVM) and eXtreme Gradient Boosting (XGBoost). We develop schemes respectively for each of the two datasets and it is verified to solve each of the problems. The proposed scheme shows better performance and achieve top 10% rank in the international Machine Learning competition (Kaggle).