Linkage disequilibrium study with a parallel adaptive GA

In this article, we model a linkage disequilibrium study (genomic study) as an optimization problem where a given objective function has to be optimized. The objective of the study is to discover haplotypes (associations of genetic markers) candidate to explain multi-factorial diseases such as diabetes or obesity. To determine what kind of algorithm will be able to solve this problem, we first study the specificities and the structure of the problem. Results of this study show that exact algorithms are not adapted to this specific problem and lead us to the development of a parallel dedicated adaptive multipopulation genetic algorithm that is able to find several haplotypes of different sizes. After describing the genomic problem, we present the dedicated genetic algorithm, its specificities, such as the use of several populations and its advanced mechanisms such as the adaptive choice of operators, random immigrants, and its parallel implementation. Results on a real dataset are given.

[1]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[2]  D. Curtis,et al.  Monte Carlo tests for associations between disease and alleles at highly polymorphic loci , 1995, Annals of human genetics.

[3]  Manabu Kotani,et al.  Feature Extraction Using Genetic Algorithms , 1999 .

[4]  Tzung-Pei Hong,et al.  Simultaneously Applying Multiple Mutation Operators in Genetic Algorithms , 2000, J. Heuristics.

[5]  Bruce Rannala,et al.  DMLE+: Bayesian linkage disequilibrium gene mapping , 2002, Bioinform..

[6]  C. Sabatti,et al.  Bayesian analysis of haplotypes for linkage disequilibrium mapping. , 2001, Genome research.

[7]  D Curtis,et al.  Use of an artificial neural network to detect association between a disease and multiple marker genotypes , 2001, Annals of human genetics.

[8]  El-Ghazali Talbi,et al.  Chapter 14 – Discovery of Genetic and Environmental Interactions in Disease Data Using Evolutionary Computation , 2003 .

[9]  L. Lazzeroni,et al.  Linkage disequilibrium and gene mapping: an empirical least-squares approach. , 1998, American journal of human genetics.

[10]  W. Punch,et al.  Feature Extraction Using Genetic Algorithms , 1997 .

[11]  J. Kere,et al.  Data mining applied to linkage disequilibrium mapping. , 2000, American journal of human genetics.

[12]  Jack Dongarra,et al.  PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing , 1995 .

[13]  K. Roeder,et al.  Disequilibrium mapping: composite likelihood for pairwise disequilibrium. , 1996, Genomics.

[14]  Jurg Ott,et al.  Handbook of Human Genetic Linkage , 1994 .

[15]  D. Boichard,et al.  Utilisation des marqueurs moléculaires en génétique animale , 1998 .

[16]  Hiroshi Motoda,et al.  Feature Extraction, Construction and Selection: A Data Mining Perspective , 1998 .

[17]  J. Terwilliger A powerful likelihood method for the analysis of linkage disequilibrium between trait loci and one or more polymorphic marker loci. , 1995, American journal of human genetics.

[18]  Clare Bates Congdon,et al.  A comparison of genetic algorithms and other machine learning systems on a complex classification task from common disease research , 1995 .