Chapter 8 – Developing Big Data Applications

This chapter goes into greater detail about the MapReduce programming model. The chapter debates the expectations that migration to a big data platform will automatically provide scalable performance, and provides an overview of parallel application development using MapReduce. We walk through the execution model, and consider aspects of performance in relation to data distribution and task parallelism. The chapter also discusses another open source big data application development framework. Lastly, we review the execution model for parallel applications.