Tailor-made multiple sequence alignments using the PRALINE 2 alignment toolkit

Abstract Summary PRALINE 2 is a toolkit for custom multiple sequence alignment workflows. It can be used to incorporate sequence annotations, such as secondary structure or (DNA) motifs, into the alignment scoring, as well as to customize many other aspects of a progressive multiple alignment workflow. Availability and implementation PRALINE 2 is implemented in Python and available as open source software on GitHub: https://github.com/ibivu/PRALINE/.

[1]  Kenji Mizuguchi,et al.  HOMSTRAD: recent developments of the Homologous Protein Structure Alignment Database , 2004, Nucleic Acids Res..

[2]  Wan Fokkink,et al.  Motif-Aware PRALINE: Improving the alignment of motif regions , 2018, PLoS Comput. Biol..

[3]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Amos Bairoch,et al.  The PROSITE database , 2005, Nucleic Acids Res..

[5]  Jaap Heringa,et al.  Two Strategies for Sequence Comparison: Profile-preprocessed and Secondary Structure-induced Multiple Alignment , 1999, Comput. Chem..

[6]  Johannes Söding,et al.  The MPI bioinformatics Toolkit as an integrative platform for advanced protein sequence and structure analysis , 2016, Nucleic Acids Res..

[7]  Kazutaka Katoh,et al.  Parallelization of MAFFT for large-scale multiple sequence alignments , 2018, Bioinform..

[8]  Fabian Sievers,et al.  Clustal Omega for making accurate alignments of many protein sequences , 2018, Protein science : a publication of the Protein Society.

[9]  Jaap Heringa,et al.  ConBind: motif-aware cross-species alignment for the identification of functional transcription factor binding sites , 2015, Nucleic acids research.

[10]  P. Hogeweg,et al.  The alignment of sets of sequences and the construction of phyletic trees: An integrated method , 2005, Journal of Molecular Evolution.

[11]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[12]  Jaap Heringa,et al.  PRALINE: a multiple sequence alignment toolbox that integrates homology-extended and secondary structure information , 2005, Nucleic Acids Res..

[13]  Olivier Poch,et al.  BAliBASE 3.0: Latest developments of the multiple sequence alignment benchmark , 2005, Proteins.