Algorithms for the Calculation and Visualisation of Phylogenetic Networks

Evolution describes the development of species as a steady adaption to the environment. In the classical model, a species develops by mutation and speciation events, which can be modelled using phylogenetic trees. However, if the evolutionary model is generalized by integrating events such as recombination, a tree can no longer describe the process. Phylogenetic networks are a class of graphs that have been developed to describe these more complex processes. These networks can be divided into two groups: those networks that model evolutionary events explicitly and those networks that do so implicitly. In this thesis, we focus on the development of new algorithms for the reconstruction and visualization of explicit phylogenetic networks. A reconstruction is called optimal if it minimizes the evolutionary costs. The large number of possible graphs from which one can choose an optimal solution presents one of the hardest problems in the reconstruction, since no possibility exists to choose the right one efficiently. One possible way to reduce the number of graphs one has to choose from, is to break down the problem into smaller independent sub-problems. By carefully reducing the class of phylogenetic networks under consideration, we were able to show that indeed the main problem can be broken up into smaller parts. Furthermore, we developed an efficient algorithm for calculating all optimal solutions for each independent sub-problem. In addition, we developed an algorithm that is capable of drawing explicit phylogenetic networks. The algorithm was designed in such a way that the algorithms available for drawing phylogenetic trees can be generalized to draw explicit phylogenetic networks. To do so, we modified the explicit phylogenetic network and extended the tree drawing algorithm by adding an optimization step, which minimizes the number of crossing edges. Furthermore, we present two software projects which aim at extending the availability of new phylogenetic methods within SplitsTree and increasing the useablility of large phylogenetic graphs. In the first project, we implemented a management system for plugins, which allows the application to dynamically integrate new methods that are stored on a database in the Internet. In the second project, we developed software that allows for the annotation of phylogenetic graphs within SplitsTree.

[1]  Bernard M. E. Moret,et al.  NetGen: generating phylogenetic networks with diploid hybrids , 2006, Bioinform..

[2]  Charles Semple,et al.  Supertree algorithms for ancestral divergence dates and nested taxa , 2004, Bioinform..

[3]  W. Maddison Gene Trees in Species Trees , 1997 .

[4]  Margherita Pagani,et al.  Second Edition , 2004 .

[5]  Christopher A. Meacham,et al.  Theoretical and Computational Considerations of the Compatibility of Qualitative Taxonomic Characters , 1983 .

[6]  Tandy J. Warnow,et al.  Reconstructing reticulate evolution in species: theory and practice , 2004, RECOMB.

[7]  R. Griffiths,et al.  Bounds on the minimum number of recombination events in a sample history. , 2003, Genetics.

[8]  M. Nivat Theoretical Computer Science Volume 213-214 , 1999 .

[9]  Error introduced by the infinite-site model. , 1992, Molecular biology and evolution.

[10]  D. Maddison,et al.  NEXUS: an extensible file format for systematic information. , 1997, Systematic biology.

[11]  H. B. Kettlewell Further selection experiments on industrial melanism in the Lepidoptera , 1956, Heredity.

[12]  Charles Semple,et al.  Tree Reconstruction via a Closure Operation on Partial Splits , 2000, JOBIM.

[13]  W. Martin,et al.  ANNOTATED ENGLISH TRANSLATION OF MERESCHKOWSKY'S 1905 PAPER 'UBER NATUR UND URSPRUNG DER CHROMATOPHOREN IM PFLANZENREICHE' , 1999 .

[14]  B. Pryor,et al.  Molecular characterization of Embellisia and Nimbya species and their relationship to Alternaria, Ulocladium and Stemphylium. , 2003, Mycologia.

[15]  Loren H Rieseberg,et al.  Reconstructing patterns of reticulate evolution in plants. , 2004, American journal of botany.

[16]  T. Sang,et al.  Testing hybridization hypotheses based on incongruent gene trees. , 2000, Systematic biology.

[17]  Patricia J. Johnson,et al.  Ancient Invasions: From Endosymbionts to Organelles , 2004, Science.

[18]  Aravaipa Canyon Basin,et al.  Volume 3 , 2012, Journal of Diabetes Investigation.

[19]  Kaizhong Zhang,et al.  Perfect Phylogenetic Networks with Recombination , 2001, J. Comput. Biol..

[20]  Yun S. Song,et al.  On the minimum number of recombination events in the evolutionary history of DNA sequences , 2004, Journal of mathematical biology.

[21]  Yun S. Song,et al.  Constructing Minimal Ancestral Recombination Graphs , 2005, J. Comput. Biol..

[22]  M. Kimura The number of heterozygous nucleotide sites maintained in a finite population due to steady flux of mutations. , 1969, Genetics.

[23]  Ronald L. Rivest,et al.  Introduction to Algorithms, Second Edition , 2001 .