Ten Simple Rules for Taking Advantage of git and GitHub

A ‘Ten Simple Rules’ guide to git and GitHub. We describe and provide examples on how to use these software to track projects, as users, teams and organizations. We document collaborative development using branching and forking, interaction between collaborators using issues and continuous integration and automation using, for example, Travis CI and codevoc. We also describe dissemination and social aspects of GitHub such as GitHub pages, following and watching repositories, and give advice on how to make code citable.

[1]  Valmir C. Barbosa,et al.  On best practices in the development of bioinformatics software , 2014, Front. Genet..

[2]  Alexander G. Fletcher,et al.  Ten Simple Rules for Effective Computational Research , 2014, PLoS Comput. Biol..

[3]  Mauno Vihinen,et al.  No more hidden solutions in bioinformatics , 2015, Nature.

[4]  Credit where credit is overdue , 2009, Nature Biotechnology.

[5]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[6]  John D. Blischak,et al.  A Quick Introduction to Version Control with Git and GitHub , 2016, PLoS Comput. Biol..

[7]  James D. Herbsleb,et al.  Social coding in GitHub: transparency and collaboration in an open software repository , 2012, CSCW.

[8]  Paul T. Groth,et al.  Ten Simple Rules for the Care and Feeding of Scientific Data , 2014, PLoS Comput. Biol..

[9]  Sophia Ananiadou,et al.  Europe PMC: a full-text literature database for the life sciences and platform for innovation , 2014, Nucleic Acids Res..

[10]  Knut Reinert,et al.  OpenMS – An open-source software framework for mass spectrometry , 2008, BMC Bioinformatics.

[11]  David L. Donoho,et al.  WaveLab and Reproducible Research , 1995 .

[12]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[13]  William Stafford Noble A Quick Guide to Organizing Computational Biology Projects , 2009, PLoS Comput. Biol..

[14]  Martin Eisenacher,et al.  PRIDE Inspector Toolsuite: Moving Toward a Universal Visualization Tool for Proteomics Data Standard Formats and Quality Assessment of ProteomeXchange Datasets , 2015, Molecular & Cellular Proteomics.

[15]  Andrew R. Jones,et al.  ProteomeXchange provides globally co-ordinated proteomics data submission and dissemination , 2014, Nature Biotechnology.

[16]  Karthik Ram,et al.  Git can facilitate greater reproducibility and increased transparency in science , 2013, Source Code for Biology and Medicine.

[17]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[18]  Yasset Perez-Riverol,et al.  Making proteomics data accessible and reusable: Current state of proteomics databases and repositories , 2015, Proteomics.

[19]  Thomas Jackson,et al.  A data repository and analysis framework for spontaneous neural activity recordings in developing retina , 2013, bioRxiv.

[20]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[21]  Kathryn S. Lilley,et al.  MSnbase-an R/Bioconductor package for isobaric tagged mass spectrometry data visualization, processing and quantitation , 2012, Bioinform..

[22]  Alexander Konovalov,et al.  Software Carpentry: Programming with R , 2017 .