Comparative Study of Machine Learning Algorithms Over Big Data Sets