Big Data representation for grade analysis through Hadoop framework

Big Data is a large dataset displaying the features of volume, velocity and variety in an OR relationship. Big Data as a large dataset is of no significance if it cannot be exposed to strategic analysis and utilization. There are many software and hardware solutions available in the technological landscape that enable capturing, storing and subsequently analysis of Big Data. Hadoop and its associated technological solution is one of them. Hadoop is the software framework for computing large amount of data. It is made up of four main modules. These modules are Hadoop Common, Hadoop Distributed File System (HDFS), Hadoop YARN, and Hadoop MapReduce. Hadoop MapReduce divides large problem into smaller sub problems under the control of JobTracker. This paper suggests a Big Data representation for grade analytics in an educational context. The study and the experiments can be implemented on R or AWS the cloud infrastructure provided by Amazon.

[1]  Surajit Chaudhuri How Different is Big Data? , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[2]  Gilles Fedak,et al.  MapReduce and Hadoop , 2012 .

[3]  Nathan Marz,et al.  Big Data: Principles and best practices of scalable realtime data systems , 2015 .

[4]  K. V. V. N. Raju,et al.  Data Mining with Big Data , 2016 .

[5]  Sateesh K. Peddoju,et al.  Classification and comparison of NoSQL big data models , 2015, Int. J. Big Data Intell..

[6]  Martin Grund,et al.  Impala: A Modern, Open-Source SQL Engine for Hadoop , 2015, CIDR.

[7]  A. Kala Karun,et al.  A review on hadoop — HDFS infrastructure extensions , 2013, 2013 IEEE CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES.

[8]  Xindong Wu,et al.  Data mining with big data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[9]  Yunhao Liu,et al.  Big Data: A Survey , 2014, Mob. Networks Appl..

[10]  Johnny Wong,et al.  Proliferating Cloud Density through Big Data Ecosystem, Novel XCLOUDX Classification and Emergence of as-a-Service Era , 2016 .

[11]  Seref Sagiroglu,et al.  Big data: A review , 2013, 2013 International Conference on Collaboration Technologies and Systems (CTS).

[12]  Johnny S. Wong,et al.  A Brief Review on Leading Big Data Models , 2014, Data Sci. J..

[13]  K. Bakshi,et al.  Considerations for big data: Architecture and approach , 2012, 2012 IEEE Aerospace Conference.

[14]  Sugam Sharma,et al.  Evolution of as-a-Service Era in Cloud , 2015, ArXiv.

[15]  Rajkumar Buyya,et al.  Big Data computing and clouds: Trends and future directions , 2013, J. Parallel Distributed Comput..

[16]  M. Anusha,et al.  Big Data-Survey , 2016 .

[17]  Robert D. Schneider,et al.  Hadoop For Dummies , 2012 .

[18]  Srikanta Patnaik,et al.  Leading NoSQL models for handling Big Data: a brief review , 2016, Int. J. Bus. Inf. Syst..

[19]  Luis M. Vaquero,et al.  Open Source Cloud Computing Systems: Practices and Paradigms , 2012 .