Rule-Based Recommendation System for Phylogenetic Inference

Phylogenetic Inference is the reconstruction of a phylogenetic tree that depicts the evolutionary relationship among a group of taxa. This evolutionary relationship between different groups of species is useful for fields such as medicine, forensics, and drug discovery. With the availability of multiple phylogenetic inference algorithms, it is challenging to select the optimal algorithm for novice users. Moreover, it is essential to compare the accuracy and efficiency of different algorithms for a given dataset. Thus, there is a need for a recommendation system to identify the most appropriate phylogenetic inference algorithm for each dataset. This paper presents a Rule-based recommendation component that suggests users with suitable algorithms for inferencing. The evaluation results show the recommendations suggested by the system are 86.7% relevant and accurate.

[1]  Dong Xie,et al.  BEAST 2: A Software Platform for Bayesian Evolutionary Analysis , 2014, PLoS Comput. Biol..

[2]  Arndt von Haeseler,et al.  W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis , 2016, Nucleic Acids Res..

[3]  Anne-Mieke Vandamme,et al.  The Phylogenetic Handbook: A Practical Approach to Phylogenetic Analysis and Hypothesis Testing , 2009 .

[4]  B. Hall,et al.  Simple and accurate estimation of ancestral protein sequences. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[5]  B. Hall Comparison of the accuracies of several phylogenetic methods using protein and DNA sequences. , 2005, Molecular biology and evolution.

[6]  Gang Wang,et al.  MrBayes on a Graphics Processing Unit , 2011, Bioinform..

[7]  Tandy J. Warnow,et al.  DACTAL: divide-and-conquer trees (almost) without alignments , 2012, Bioinform..

[8]  Miguel A. Vega-Rodríguez,et al.  A comparative study on distance methods applied to a multiobjective firefly algorithm for phylogenetic inference , 2013, GECCO.

[9]  Niema Moshiri,et al.  A Two‐State Model of Tree Evolution and Its Applications to Alu Retrotransposition , 2018, Systematic biology.

[10]  O. Gascuel,et al.  New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. , 2010, Systematic biology.

[11]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[12]  Kai Ye,et al.  PVTree: A Sequential Pattern Mining Method for Alignment Independent Phylogeny Reconstruction , 2019, Genes.

[13]  Sudhir Kumar,et al.  MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. , 2018, Molecular biology and evolution.

[14]  Xiaoguang Liu,et al.  Efficient Implementation of MrBayes on Multi-GPU , 2013, Molecular biology and evolution.

[15]  Antonis Rokas,et al.  Phylogenetic Analysis of Protein Sequence Data Using the Randomized Axelerated Maximum Likelihood (RAXML) Program , 2011, Current protocols in molecular biology.

[16]  Olivier Gascuel,et al.  FastME 2.0: A Comprehensive, Accurate, and Fast Distance-Based Phylogeny Inference Program , 2015, Molecular biology and evolution.

[17]  M. Nei,et al.  Efficiencies of fast algorithms of phylogenetic inference under the criteria of maximum parsimony, minimum evolution, and maximum likelihood when a large number of sequences are used. , 2000, Molecular biology and evolution.

[18]  Katharina T. Huber,et al.  ape 3.0: New tools for distance-based phylogenetics and evolutionary analysis in R , 2012, Bioinform..

[19]  Keith A. Crandall,et al.  Phylogenetic Inference Using Molecular Data , 2009 .

[20]  D. A. Meedeniya,et al.  An Interactive Gameplay to Crowdsource Multiple Sequence Alignment of Genome Sequences: Genenigma , 2019, ICBBB '19.

[21]  Pablo A. Goloboff,et al.  TNT, a free program for phylogenetic analysis , 2008 .

[22]  Ziheng Yang,et al.  Molecular Evolution: A Statistical Approach , 2014 .