This paper details our experiences with design and implementation of data science curriculum at University at Buffalo (UB). We discuss (i) briefly the history of project, (ii) a certificate program that we created, (iii) a data-intensive computing course that forms the core of the curriculum and (iv) some of the challenges we faced and how we addressed them. Major goal of the project was to improve the preparedness of our workforce for the emerging data-intensive computing area. We measured this through assessment of student learning on various concepts and topics related to data-intensive computing. We also discuss the best practices in building a data science program. We highlight the importance of external funding support and multi-disciplinary collaborations in the success of the project. The pedagogical resources created for the project are freely available to help educators and other learners navigate the path to learning data science. We expect this paper about our experience will provide a road map for educators who desire to introduce data science in their curriculum.
[1]
Rachel Schutt,et al.
Doing Data Science
,
2013
.
[2]
Sanjay Ghemawat,et al.
MapReduce: Simplified Data Processing on Large Clusters
,
2004,
OSDI.
[3]
Jimmy J. Lin,et al.
Book Reviews: Data-Intensive Text Processing with MapReduce by Jimmy Lin and Chris Dyer
,
2010,
CL.
[4]
C. Pipper,et al.
[''R"--project for statistical computing].
,
2008,
Ugeskrift for laeger.
[5]
Jim Gray,et al.
Distributed Computing Economics
,
2004,
ACM Queue.
[6]
Steve Pierson.
Master’s Programs in Data Science and Analytics (Continued …)
,
2017
.