A cyber-linked undergraduate research experience in computational biomolecular structure prediction and design

Computational biology is an interdisciplinary field, and many computational biology research projects involve distributed teams of scientists. To accomplish their work, these teams must overcome both disciplinary and geographic barriers. Introducing new training paradigms is one way to facilitate research progress in computational biology. Here, we describe a new undergraduate program in biomolecular structure prediction and design in which students conduct research at labs located at geographically-distributed institutions while remaining connected through an online community. This 10-week summer program begins with one week of training on computational biology methods development, transitions to eight weeks of research, and culminates in one week at the Rosetta annual conference. To date, two cohorts of students have participated, tackling research topics including vaccine design, enzyme design, protein-based materials, glycoprotein modeling, crowd-sourced science, RNA processing, hydrogen bond networks, and amyloid formation. Students in the program report outcomes comparable to students who participate in similar in-person programs. These outcomes include the development of a sense of community and increases in their scientific self-efficacy, scientific identity, and science values, all predictors of continuing in a science research career. Furthermore, the program attracted students from diverse backgrounds, which demonstrates the potential of this approach to broaden the participation of young scientists from backgrounds traditionally underrepresented in computational biology.

[1]  Adrian A Canutescu,et al.  Cyclic coordinate descent: A robotics algorithm for protein loop closure , 2003, Protein science : a publication of the Protein Society.

[2]  Timothy A. Whitehead,et al.  Computational Design of Proteins Targeting the Conserved Stem Region of Influenza Hemagglutinin , 2011, Science.

[3]  W. Heath The Difference: How the Power of Diversity Creates Better Groups, Firms, Schools, and Societies , 2008 .

[4]  David Baker,et al.  Algorithm discovery by protein folding game players , 2011, Proceedings of the National Academy of Sciences.

[5]  Linda A. Reinen,et al.  BENEFITS AND CHALLENGES OF COURSE-BASED UNDERGRADUATE RESEARCH EXPERIENCES (CURES) FOR STEM STUDENTS: A REPORT FROM THE NATIONAL ACADEMIES OF SCIENCES, ENGINEERING, AND MEDICINE , 2017 .

[6]  Jeffrey J. Gray,et al.  Structure-based non-canonical amino acid design to covalently crosslink an antibody-antigen complex. , 2014, Journal of structural biology.

[7]  Russ Harris,et al.  The Confidence Gap , 2011 .

[8]  D. Baker,et al.  Improved chemical shift based fragment selection for CS-Rosetta using Rosetta3 fragment picker , 2013, Journal of Biomolecular NMR.

[9]  Elisabeth L. Humphris,et al.  Prediction of protein-protein interface sequence diversity using flexible backbone computational protein design. , 2008, Structure.

[10]  Jeffrey Perkel,et al.  Democratic databases: science on GitHub , 2016, Nature.

[11]  David P. Cartrette,et al.  Describing Changes in Undergraduate Students’ Preconceptions of Research Activities , 2012 .

[12]  Why Reu,et al.  Research Experiences for Undergraduates (REU) , 2016 .

[13]  S. Ceci,et al.  Understanding current causes of women's underrepresentation in science , 2011, Proceedings of the National Academy of Sciences.

[14]  Jens Meiler,et al.  ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. , 2011, Methods in enzymology.

[15]  Russell Schwartz,et al.  Bioinformatics Curriculum Guidelines: Toward a Definition of Core Competencies , 2014, PLoS Comput. Biol..

[16]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[17]  Paul R. Hernandez,et al.  Toward a Model of Social Influence that Explains Minority Student Integration into the Scientific Community. , 2011, Journal of educational psychology.

[18]  M. J. Chang,et al.  Making a Difference in Science Education , 2013, American educational research journal.

[19]  A. Porter,et al.  Preconditions for Interdisciplinary Research , 2011 .

[20]  Barbara K. Goza,et al.  The role of efficacy and identity in science career commitment among underrepresented minority students , 2011 .

[21]  朱真一,et al.  每月一書:Lean In: Women, Work, and the Will to Lead , 2013 .

[22]  Alfred P. Rovai,et al.  The Classroom and School Community Inventory: Development, refinement, and validation of a self-report measure for educational research , 2004, Internet High. Educ..

[23]  D. Baker,et al.  Automated de novo prediction of native-like RNA tertiary structures , 2007, Proceedings of the National Academy of Sciences.

[24]  Holly J. Falk-Krzesinski,et al.  Opinion: Gender diversity leads to better science , 2017, Proceedings of the National Academy of Sciences.

[25]  D. Baker,et al.  Computational Design of Self-Assembling Protein Nanomaterials with Atomic Level Accuracy , 2012, Science.

[26]  David P. Anderson,et al.  BOINC: a system for public-resource computing and storage , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[27]  David Baker,et al.  Accurate design of co-assembling multi-component protein nanomaterials , 2014, Nature.

[28]  Seth Cooper,et al.  To Three or not to Three: Improving Human Computation Game Onboarding with a Three-Star System , 2017, CHI.

[29]  D. Baker,et al.  Principles for designing ideal protein structures , 2012, Nature.

[30]  Christine Pfund,et al.  Culturally Diverse Undergraduate Researchers’ Academic Outcomes and Perceptions of Their Research Mentoring Relationships , 2015, International journal of science education.

[31]  Laurel Smith‐Doerr,et al.  Gender diversity leads to better science , 2017 .

[32]  George D. Kuh High-Impact Educational Practices: What They Are, Who Has Access to Them, and Why They Matter , 2008 .

[33]  Brian D. Weitzner,et al.  Blind prediction performance of RosettaAntibody 3.0: Grafting, relaxation, kinematic loop modeling, and full CDR optimization , 2014, Proteins.

[34]  Sandra L. Laursen,et al.  Undergraduate research in the sciences : engaging students in real science , 2010 .

[35]  Jeffrey M. Perkel How scientists use Slack , 2016, Nature.

[36]  M. Linn,et al.  Undergraduate research experiences: Impacts and opportunities , 2015, Science.

[37]  Brian Kuhlman,et al.  Engineering a protein–protein interface using a computationally designed library , 2010, Proceedings of the National Academy of Sciences.

[38]  Kevin Eagan,et al.  Defining Attributes and Metrics of Effective Research Mentoring Relationships , 2016, AIDS and Behavior.

[39]  Jeffrey J. Gray,et al.  De novo design of peptide-calcite biomineralization systems. , 2010, Journal of the American Chemical Society.

[40]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[41]  W. Bialek,et al.  Introductory Science and Mathematics Education for 21st-Century Biologists , 2004, Science.

[42]  Jared Adolf-Bryfogle,et al.  Residue‐centric modeling and design of saccharide and glycoconjugate structures , 2017, J. Comput. Chem..