Improving CUDA DNA Analysis Software with Genetic Programming

We genetically improve BarraCUDA using a BNF grammar incorporating C scoping rules with GP. Barracuda maps next generation DNA sequences to the human genome using the Burrows-Wheeler algorithm (BWA) on nVidia Tesla parallel graphics hardware (GPUs). GI using phenotypic tabu search with manually grown code can graft new features giving more than 100 fold speed up on a performance critical kernel without loss of accuracy.

[1]  George R. Price,et al.  Selection and Covariance , 1970, Nature.

[2]  Gilbert Syswerda,et al.  Uniform Crossover in Genetic Algorithms , 1989, ICGA.

[3]  John R. Koza,et al.  Genetic programming (videotape): the movie , 1992 .

[4]  Riccardo Poli,et al.  Fitness Causes Bloat: Mutation , 1997, EuroGP.

[5]  Christopher Harris,et al.  An investigation into the application of genetic programming techniques to signal analysis and feature detection , 1998 .

[6]  William B. Langdon,et al.  Seeding Genetic Programming Populations , 2000, EuroGP.

[7]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[8]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[9]  Riccardo Poli,et al.  A Field Guide to Genetic Programming , 2008 .

[10]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[11]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[12]  Richard Durbin,et al.  Fast and accurate long-read alignment with Burrows–Wheeler transform , 2010, Bioinform..

[13]  Mark Harman,et al.  Evolving a CUDA kernel from an nVidia template , 2010, IEEE Congress on Evolutionary Computation.

[14]  Graham Pullan,et al.  BarraCUDA - a fast short read sequence aligner using graphics processing units , 2011, BMC Research Notes.

[15]  Claire Le Goues,et al.  GenProg: A Generic Method for Automatic Software Repair , 2012, IEEE Transactions on Software Engineering.

[16]  William B. Langdon,et al.  Mycoplasma contamination in the 1000 Genomes Project , 2014, BioData Mining.

[17]  Mark Harman,et al.  Babel Pidgin: SBSE Can Grow and Graft Entirely New Functionality into a Real World System , 2014, SSBSE.

[18]  Mark Harman,et al.  Using Genetic Improvement and Code Transplants to Specialise a C++ Program to a Problem Class , 2014, EuroGP.

[19]  Mark Harman,et al.  Genetically Improved CUDA C++ Software , 2014, EuroGP.

[20]  Mark Harman,et al.  Improving 3D medical image registration CUDA software with genetic programming , 2014, GECCO.

[21]  William B. Langdon,et al.  Genetically Improved Software , 2015, Handbook of Genetic Programming Applications.

[22]  Mark Harman,et al.  Ieee Transactions on Evolutionary Computation 1 , 2022 .

[23]  Yves Le Traon,et al.  Trivial Compiler Equivalence: A Large Scale Empirical Study of a Simple, Fast and Effective Equivalent Mutant Detection Technique , 2015, 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering.