Using Phoenix++ MapReduce to introduce undergraduate students to parallel computing

With the release of CS2013, departments around the country are injecting more parallelism into their core computer science courses. MapReduce is an attractive option for introducing undergraduate students to parallelism. However, most approaches to teaching MapReduce require students to access a Hadoop cluster. This is often too costly to be a viable option at many institutions, especially if MapReduce represents a minor topic in a course. In this paper, we discuss the use of Phoenix++ MapReduce for introducing undergraduate students to parallel computing and the MapReduce paradigm. We publish an open-source module on Phoenix++, enabling instructors at other institutions to rapidly adopt Phoenix++ MapReduce for use in their own curricula.

[1]  Richard A. Brown,et al.  WebMapReduce: an accessible and adaptable tool for teaching map-reduce computing , 2011, SIGCSE '11.

[2]  Edward B. Duffy,et al.  Teaching HDFS/MapReduce Systems Concepts to Undergraduates , 2014, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops.

[3]  Christophe Bisciglia,et al.  Cluster computing for web-scale data processing , 2008, SIGCSE '08.

[4]  Richard A. Brown,et al.  CSinParallel and Synergy for Rapid Incremental Addition of PDC Into CS Curricula , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.

[5]  Sushil K. Prasad NSF/TCPP Introduction , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[6]  Justin Talbot,et al.  Phoenix++: modular MapReduce for shared-memory systems , 2011, MapReduce '11.

[7]  Jimmy J. Lin,et al.  Exploring Large-Data Issues in the Curriculum: A Case Study with MapReduce , 2008 .

[8]  Jacob D. Furst,et al.  Enhancing undergraduate education: a REU model for interdisciplinary research , 2009, SIGCSE '09.

[9]  Daniel J. Ernst,et al.  Concurrent CS: preparing students for a multicore world , 2008, SIGCSE 2008.

[10]  Richard A. Brown,et al.  Hadoop at home: large-scale computing at a small college , 2009, SIGCSE '09.

[11]  Tiffany Barnes,et al.  Improving retention and graduate recruitment through immersive research experiences for undergraduates , 2008, SIGCSE '08.

[12]  Benjamin C. M. Fung,et al.  A novel approach of mining write-prints for authorship attribution in e-mail forensics , 2008, Digit. Investig..

[13]  Randy H. Katz,et al.  Experiences teaching MapReduce in the cloud , 2012, SIGCSE '12.

[14]  Herb Sutter,et al.  The Free Lunch Is Over A Fundamental Turn Toward Concurrency in Software , 2013 .

[15]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.