Biopython: freely available Python tools for computational molecular biology and bioinformatics

Summary: The Biopython project is a mature open source international collaboration of volunteer developers, providing Python libraries for a wide range of bioinformatics problems. Biopython includes modules for reading and writing different sequence file formats and multiple sequence alignments, dealing with 3D macro molecular structures, interacting with common tools such as BLAST, ClustalW and EMBOSS, accessing key online databases, as well as providing numerical methods for statistical learning. Availability: Biopython is freely available, with documentation and source code at www.biopython.org under the Biopython license. Contact: All queries should be directed to the Biopython mailing lists, see www.biopython.org/wiki/_Mailing_listspeter.cock@scri.ac.uk.

[1]  Konrad Hinsen,et al.  The molecular modeling toolkit: A new approach to molecular simulations , 2000, J. Comput. Chem..

[2]  Travis E. Oliphant,et al.  Python for Scientific Computing , 2007, Computing in Science & Engineering.

[3]  Cymon J Cox,et al.  WASABI: an automated sequence processing system for multigene phylogenies. , 2007, Systematic biology.

[4]  Leighton Pritchard,et al.  GenomeDiagram: a python package for the visualization of large-scale genomic data , 2006, Bioinform..

[5]  Jeffrey Chang,et al.  Biopython: Python tools for computational biology , 2000, SIGB.

[6]  Konrad Hinsen The molecular modeling toolkit: A new approach to molecular simulations , 2000 .

[7]  S Miyano,et al.  Open source clustering software. , 2004, Bioinformatics.

[8]  D. Maddison,et al.  NEXUS: an extensible file format for systematic information. , 1997, Systematic biology.

[9]  Bernard Manderick,et al.  PDB file parser and structure class implemented in Python , 2003, Bioinform..

[10]  Dan Wu,et al.  EMBL Nucleotide Sequence Database in 2006 , 2006, Nucleic Acids Res..

[11]  Satoru Miyano,et al.  Open source clustering software , 2004 .

[12]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[13]  Mark J. Pallen,et al.  xBASE, a collection of online databases for bacterial comparative genomics , 2005, Nucleic Acids Res..

[14]  Andreas Prlic,et al.  Sequence analysis , 2003 .

[15]  F. Rousset genepop’007: a complete re‐implementation of the genepop software for Windows and Linux , 2008, Molecular ecology resources.

[16]  Laurent Excoffier,et al.  SIMCOAL 2.0: a program to simulate genomic diversity over large recombining regions in a subdivided population with a complex history , 2004, Bioinform..

[17]  M. Beaumont,et al.  Evaluating loci for use in the genetic analysis of population structure , 1996, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[18]  Evelyn Camon,et al.  The EMBL Nucleotide Sequence Database , 2000, Nucleic Acids Res..

[19]  Matthew R. Pocock,et al.  The Bioperl toolkit: Perl modules for the life sciences. , 2002, Genome research.

[20]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[21]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[22]  M. Prentice,et al.  Bacterial comparative genomics , 2004, Genome Biology.

[23]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[24]  M. P. Cummings PHYLIP (Phylogeny Inference Package) , 2004 .