Maximizing information from chemical engineering data sets: Applications to machine learning