Big Data Technologies for Batch and Real-Time Data Processing: A Review

With the rampant growth in the generation of data from various sources, there is a necessity to analyze and process this huge amount of data effectively. Data, available in batch and real-time requires being processed actively. Big Data technologies provide effective frameworks to handle voluminous data efficiently. This paper reports review on data processing using Big Data frameworks: Apache MapReduce, Apache Spark and Apache Storm, for data available in batch and real-time. The conclusion is summarized based on the comparison between the frameworks on several variables.