G-OnRamp: Generating genome browsers to facilitate undergraduate-driven collaborative genome annotation

Scientists are sequencing new genomes at an increasing rate with the goal of associating genome contents with phenotypic traits. After a new genome is sequenced and assembled, structural gene annotation is often the first step in analysis. Despite advances in computational gene prediction algorithms, most eukaryotic genomes still benefit from manual gene annotation. This requires access to good genome browsers to enable annotators to visualize and evaluate multiple lines of evidence (e.g., sequence similarity, RNA sequencing [RNA-Seq] results, gene predictions, repeats) and necessitates many volunteers to participate in the work. To address the technical barriers to creating genome browsers, the Genomics Education Partnership (GEP; https://gep.wustl.edu/) has partnered with the Galaxy Project (https://galaxyproject.org) to develop G-OnRamp (http://g-onramp.org), a web-based platform for creating UCSC Genome Browser Assembly Hubs and JBrowse genome browsers. G-OnRamp also converts a JBrowse instance into an Apollo instance for collaborative genome annotations in research and educational settings. The genome browsers produced can be transferred to the CyVerse Data Store for long-term access. G-OnRamp enables researchers to easily visualize their experimental results, educators to create Course-based Undergraduate Research Experiences (CUREs) centered on genome annotation, and students to participate in genomics research. In the process, students learn about genes/genomes and about how to utilize large datasets. Development of G-OnRamp was guided by extensive user feedback. Sixty-five researchers/educators from >40 institutions participated through in-person workshops, which produced >20 genome browsers now available for research and education. Genome browsers generated for four parasitoid wasp species have been used in a CURE engaging students at 15 colleges and universities. Our assessment results in the classroom demonstrate that the genome browsers produced by G-OnRamp are effective tools for engaging undergraduates in research and in enabling their contributions to the scientific literature in genomics. Expansion of such genomics research/education partnerships will be beneficial to researchers, faculty, and students alike.

[1]  Jeremy Buhler,et al.  A Course-Based Research Experience: How Benefits Change with Increased Investment in Instructional Time , 2014, CBE life sciences education.

[2]  Jeremy Goecks,et al.  G-OnRamp: a Galaxy-based platform for collaborative annotation of eukaryotic genomes , 2019, Bioinform..

[3]  Pratibha Varma-Nelson,et al.  Assessment of Course-Based Undergraduate Research Experiences: A Meeting Report , 2014, CBE life sciences education.

[4]  Katharina J. Hoff,et al.  BRAKER1: Unsupervised RNA-Seq-Based Genome Annotation with GeneMark-ET and AUGUSTUS , 2016, Bioinform..

[5]  Anthony Bretaudeau,et al.  GGA: Galaxy for genome annotation, teaching, and genomic databases , 2018 .

[6]  Jimmy Ma,et al.  Drosophila Muller F Elements Maintain a Distinct Set of Genomic Properties Over 40 Million Years of Evolution , 2015, G3: Genes, Genomes, Genetics.

[7]  K. Brenner,et al.  Undergraduate Research Experiences for STEM Students: Successes, Challenges, and Opportunities. , 2017 .

[8]  Cathy H. Wu,et al.  Community annotation and bioinformatics workforce development in concert—Little Skate Genome Annotation Workshops and Jamborees , 2012, Database J. Biol. Databases Curation.

[9]  K. Anders,et al.  Scaling Up: Adapting a Phage-Hunting Course to Increase Participation of First-Year Students in Research , 2016, CBE life sciences education.

[10]  D Lopatto,et al.  Genomics Education Partnership , 2008, Science.

[11]  Sarah K. Hilton,et al.  Retrotransposons Are the Major Contributors to the Expansion of the Drosophila ananassae Muller F Element , 2017, G3: Genes|Genomes|Genetics.

[12]  Suzanna E Lewis,et al.  JBrowse: a dynamic web platform for genome visualization and analysis , 2016, Genome Biology.

[13]  Jeremy Buhler,et al.  The Genomics Education Partnership: Successful Integration of Research into Laboratory Classes at a Diverse Group of Undergraduate Institutions , 2010, CBE life sciences education.

[14]  Ting Wang,et al.  Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser , 2013, Bioinform..

[15]  David Lopatto,et al.  Undergraduate research experiences support science career decisions and active learning. , 2007, CBE life sciences education.

[16]  Doreen Ware,et al.  The iPlant Collaborative: Cyberinfrastructure for Enabling Data to Discovery for the Life Sciences , 2016, PLoS biology.

[17]  Wei Li,et al.  A Broadly Implementable Research Course in Phage Discovery and Genomics for First-Year Undergraduate Students , 2014, mBio.

[18]  Lukas A. Mueller,et al.  A quick guide for student-driven community genome annotation , 2018, PLoS Comput. Biol..

[19]  Steven G. Cresawn,et al.  An inclusive Research Education Community (iREC): Impact of the SEA-PHAGES program on research outcomes and student learning , 2017, Proceedings of the National Academy of Sciences.

[20]  Erin L. Dolan,et al.  Early Engagement in Course-Based Research Increases Graduation Rates and Completion of Science, Engineering, and Mathematics Degrees , 2016, CBE life sciences education.

[21]  Colin Diesh,et al.  Apollo: Democratizing genome annotation , 2019, bioRxiv.

[22]  J. Gouzy,et al.  EuGene: An Automated Integrative Gene Finder for Eukaryotes and Prokaryotes. , 2019, Methods in molecular biology.

[23]  M. Yandell,et al.  Genome Annotation and Curation Using MAKER and MAKER‐P , 2014, Current protocols in bioinformatics.

[24]  Jennifer R Kowalski,et al.  Implementation of a Collaborative Series of Classroom-Based Undergraduate Research Experiences Spanning Chemical Biology, Biochemistry, and Neurobiology , 2016, CBE life sciences education.

[25]  Deborah Grove,et al.  Vision and Change through the Genome Consortium for Active Teaching Using Next-Generation Sequencing (GCAT-SEEK) , 2014, CBE life sciences education.

[26]  Janet S Russell,et al.  The genome solver website: a virtual space fostering high impact practices for undergraduate biology. , 2012, Journal of microbiology & biology education.

[27]  Adam J. Kleinschmit,et al.  The GEP: Crowd-Sourcing Big Data Analysis with Undergraduates. , 2017, Trends in genetics : TIG.