A study on the standardization strategy for building of learning data set for machine learning applications