Two-Level ACO for Haplotype Inference Under Pure Parsimony

Haplotype Inference is a challenging problem in bioinformatics that consists in inferring the basic genetic constitution of diploid organisms on the basis of their genotype. This information enables researchers to perform association studies for the genetic variants involved in diseases and the individual responses to therapeutic agents. A notable approach to the problem is to encode it as a combinatorial problem under certain hypotheses (such as the pure parsimonycriterion) and to solve it using off-the-shelf combinatorial optimization techniques. At present, the main methods applied to Haplotype Inference are either simple greedy heuristic or exact methods, which are adequate only for moderate size instances. In this paper, we present an iterative constructive approach to Haplotype Inference based on ACO and we compare it against a state-of-the-art exact method.

[1]  Andrew G. Clark,et al.  Computational Methods for SNPs and Haplotype Inference , 2002, Lecture Notes in Computer Science.

[2]  Inês Lynce,et al.  Efficient Haplotype Inference with Boolean Satisfiability , 2006, AAAI.

[3]  Dan Gusfield,et al.  Haplotype Inference by Pure Parsimony , 2003, CPM.

[4]  A. Clark,et al.  Inference of haplotypes from PCR-amplified samples of diploid populations. , 1990, Molecular biology and evolution.

[5]  Armin Biere,et al.  Theory and Applications of Satisfiability Testing - SAT 2006, 9th International Conference, Seattle, WA, USA, August 12-15, 2006, Proceedings , 2006, SAT.

[6]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[7]  Daniel G. Brown,et al.  Integer programming approaches to haplotype inference by pure parsimony , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[8]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[9]  Inês Lynce,et al.  Efficient Haplotype Inference with Pseudo-boolean Optimization , 2007, AB.

[10]  Marco Dorigo,et al.  The hyper-cube framework for ant colony optimization , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[11]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[12]  Manuel López-Ibáñez,et al.  Ant colony optimization , 2010, GECCO '10.

[13]  Ting Chen,et al.  An approximation algorithm for haplotype inference by maximum parsimony , 2005, SAC '05.

[14]  Konstantinos Kalpakis,et al.  Haplotype phasing using semidefinite programming , 2005, Fifth IEEE Symposium on Bioinformatics and Bioengineering (BIBE'05).

[15]  Xiang-Sun Zhang,et al.  Haplotype Inference by Pure Parsimony via Genetic Algorithm , 1997 .

[16]  Inês Lynce,et al.  SAT in Bioinformatics: Making the Case with Haplotype Inference , 2006, SAT.

[17]  Zeng Changqing The International Human Genome Haplotype Map (HapMap) Project and Its China Chapter , 2003 .

[18]  P. Tam The International HapMap Consortium. The International HapMap Project (Co-PI of Hong Kong Centre which responsible for 2.5% of genome) , 2003 .

[19]  Andrea Roli,et al.  Stochastic local search for large-scale instances of the haplotype inference problem by pure parsimony , 2008, J. Algorithms.

[20]  Shibu Yooseph,et al.  A Survey of Computational Methods for Determining Haplotypes , 2002, Computational Methods for SNPs and Haplotype Inference.

[21]  Giuseppe Lancia,et al.  Haplotyping Populations by Pure Parsimony: Complexity of Exact and Approximation Algorithms , 2004, INFORMS J. Comput..