Expanding Undergraduate Exposure to Computer Science Subfields: Resources and Lessons from a Hands-on Computational Biology Workshop

Computational biology is an exciting and ever-widening interdisciplinary field. Expanding the participation of undergraduate students in this field will help to inspire and train the next generation of scientists necessary to support this growing area. However, students at smaller institutions, such as those focused on undergraduate education, may not have access to courses related to or even faculty interested in computational biology. Providing more opportunities for such undergraduate students to be exposed to computational biology, or other subfields within computer science, will be important for ensuring these students are included in the pipeline of scientists contributing to these diverse fields. To this end, we hosted a computational biology workshop that brought together undergraduate students from three different liberal arts colleges. The goal of the workshop was to provide an introduction to how computer science can be used to help answer important problems in biology. A diverse set of six faculty members from different institutions each created and taught a hands-on module as an introduction to a different area of computational biology at the workshop. We describe how we went about organizing this undergraduate workshop, summarize the workshop materials that are freely available, and discuss the outcomes and lessons learned from the workshop. We further propose that the workshop structure used is adaptable to other subfields of computer science. Workshop materials available at the workshop website: https://sites.google.com/carleton.edu/compbioworkshop2018/home.

[1]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[2]  T. M. Murali,et al.  GRAPHSPACE: Stimulating interdisciplinary collaborations in network biology , 2017 .

[3]  Oscar Garcia,et al.  A proposed undergraduate bioinformatics curriculum for computer scientists , 2002, SIGCSE '02.

[4]  Travis E. Oliphant,et al.  Guide to NumPy , 2015 .

[5]  Maria M. Klawe,et al.  Increasing Female Participation in Computing: The Harvey Mudd College Story , 2013, Computer.

[6]  Eystein Oveland,et al.  PeptideShaker enables reanalysis of MS-derived proteomics data sets , 2015, Nature Biotechnology.

[7]  M. Bedau Artificial life: organization, adaptation and complexity from the bottom up , 2003, Trends in Cognitive Sciences.

[8]  Jason A. Papin,et al.  Ten simple rules for biologists learning to program , 2018, PLoS Comput. Biol..

[9]  Russell Schwartz,et al.  The development and application of bioinformatics core competencies to improve bioinformatics training and education , 2018, PLoS Comput. Biol..

[10]  Russell Schwartz,et al.  Bioinformatics Curriculum Guidelines: Toward a Definition of Core Competencies , 2014, PLoS Comput. Biol..

[11]  Tanya Y. Berger-Wolf,et al.  A Biology-themed Introductory CS Course at a Large, Diverse Public University , 2018, SIGCSE.

[12]  Andreas Madlung,et al.  Assessing an effective undergraduate module teaching applied bioinformatics to biology students , 2018, PLoS Comput. Biol..

[13]  Reinhard Laubenbacher,et al.  Fostering bioinformatics education through skill development of professors: Big Genomic Data Skills Training for Professors , 2019, PLoS Comput. Biol..

[14]  Christopher A. Miller,et al.  VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. , 2012, Genome research.

[15]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[16]  James E. Johnson,et al.  Using Galaxy-P to leverage RNA-Seq for the discovery of novel protein variations , 2014, BMC Genomics.

[17]  William R. Pearson,et al.  Bioinformatics core competencies for undergraduate life sciences education , 2017, bioRxiv.

[18]  David R Smith,et al.  Bringing bioinformatics to the scientific masses , 2018, EMBO reports.

[19]  Cath Brooksbank,et al.  From trainees to trainers to instructors: Sustainably building a national capacity in bioinformatics training , 2019, PLoS Comput. Biol..

[20]  V. Marx Biology: The big challenges of big data , 2013, Nature.

[21]  Tom O. Delmont,et al.  Anvi’o: an advanced analysis and visualization platform for ‘omics data , 2015, PeerJ.

[22]  Itai Yanai,et al.  Computational biologists: moving to the driver's seat , 2017, Genome Biology.

[23]  Dirk Merkel,et al.  Docker: lightweight Linux containers for consistent development and deployment , 2014 .

[24]  F. Markowetz All biology is computational biology , 2017, PLoS biology.

[25]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.