Popular platforms for big data analytics: A survey

This paper aims to make a comparative study of the different platforms used in the field of Big data analytics. It lists the different hardware platforms available, and widely used in Big data, to perform data analysis. A detailed description of the frameworks and processing paradigms used by each platform is also provided. Comparison criteria such as scalability, I/O rate, fault tolerance, data size, iterative processing and real-time processing are used to assess the advantages and drawbacks of each of these platforms. A summary table will present the strengths and weaknesses of each platform, then a qualitative and rigorous comparison is discussed according to the criteria mentioned above. Also this document presents the new trends aiming at the improvement of the performances of these platforms by the use of hardware accelerators and also by the convergence of big data analytics frameworks with modern High Performance Computing (HPC) systems. The results of this survey on widely used platforms in big data analytics can help users make an informed decision about choosing the right platform for their data analytics needs.