An efficient discrete PSO coupled with a fast local search heuristic for the DNA fragment assembly problem

Abstract This paper focuses on Particle Swarm Optimization (PSO) applied to the DNA fragment assembly problem. Existing PSO algorithms for this permutation-based combinatorial problem use the Smaller Position Value (SPV) rule to transform continuous vectors into permutations of integers. However, this approach has limitations and is not suitable for this NP-hard problem. Here we propose a new discrete PSO that works directly in the search space of permutations and effectively addresses the fragment assembly problem. In our proposal, the fact that relative ordering of DNA fragments is most indicative of assembly accuracy is exploited in the particle update mechanism. This is implemented through a new operator called Probabilistic Edge Recombination (PER). This operator builds a new position through the probabilistic recombination of edges (adjacency relations) between fragments from the current position, the personal best, and the group best. Additionally, we design variants of the proposed PSO algorithm by applying heuristic information and/or local search. With this aim, we develop a new fast variant of the best state-of-the-art local search algorithm for the assembly problem. Extensive experiments have been conducted to demonstrate the efficiency and effectiveness of the algorithms used. In comparison with the state-of-the-art assembly techniques, our algorithms achieve a better performance.

[1]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[2]  Chu-Sing Yang,et al.  A memetic particle swarm optimization algorithm for solving the DNA fragment assembly problem , 2014, Neural Computing and Applications.

[3]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[4]  Mehmet Fatih Tasgetiren,et al.  A particle swarm optimization algorithm for makespan and total flowtime minimization in the permutation flowshop sequencing problem , 2007, Eur. J. Oper. Res..

[5]  Mohamed El Yafrani,et al.  A hybrid crow search algorithm for solving the DNA fragment assembly problem , 2018, Expert Syst. Appl..

[6]  Eugene W. Myers,et al.  A whole-genome assembly of Drosophila. , 2000, Science.

[7]  Abdelmadjid Boukra,et al.  CS-ABC: a cooperative system based on artificial bee colony to resolve the DNA fragment assembly problem , 2018 .

[8]  Jun Zhang,et al.  A Novel Set-Based Particle Swarm Optimization Method for Discrete Optimization Problems , 2010, IEEE Transactions on Evolutionary Computation.

[9]  R INDUMATHY,et al.  Nature-inspired novel Cuckoo Search Algorithm for genome sequence assembly , 2015 .

[10]  Eugene W. Myers,et al.  Combinatorial algorithms for DNA sequence assembly , 1995, Algorithmica.

[11]  Enrique Alba,et al.  A Hybrid Genetic Algorithm for the DNA Fragment Assembly Problem , 2008, Recent Advances in Evolutionary Computation for Combinatorial Optimization.

[12]  Konstantinos P. Anagnostopoulos,et al.  A particle swarm optimization based hyper-heuristic algorithm for the classic resource constrained project scheduling problem , 2014, Inf. Sci..

[13]  P. Pevzner,et al.  An Eulerian path approach to DNA fragment assembly , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Enrique Alba,et al.  DNA fragment assembly using a grid-based genetic algorithm , 2008, Comput. Oper. Res..

[15]  Sheridan K. Houghten,et al.  Benchmark datasets for the DNA fragment assembly problem , 2013, Int. J. Bio Inspired Comput..

[16]  Maurice Clerc,et al.  Discrete Particle Swarm Optimization, illustrated by the Traveling Salesman Problem , 2004 .

[17]  Smaine Mazouzi,et al.  Pe-DFA: Penguins Search Optimisation Algorithm for DNA Fragment Assembly , 2016, Int. J. Appl. Metaheuristic Comput..

[18]  S. Turner,et al.  Real-Time DNA Sequencing from Single Polymerase Molecules , 2009, Science.

[19]  Enrique Alba,et al.  Iterated Local Search for de Novo Genomic Sequencing , 2010, ICAISC.

[20]  Darrell Whitley,et al.  The Travelling Salesman and Sequence Scheduling: Quality Solutions using Genetic Edge Recombination , 1990 .

[21]  Alfredo Milani,et al.  Algebraic Particle Swarm Optimization for the permutations search space , 2017, 2017 IEEE Congress on Evolutionary Computation (CEC).

[22]  Enrique Alba,et al.  A New Local Search Algorithm for the DNA Fragment Assembly Problem , 2007, EvoCOP.

[23]  Adam M Phillippy,et al.  New advances in sequence assembly , 2017, Genome research.

[24]  Enrique Alba,et al.  A self-adaptive cellular memetic algorithm for the DNA fragment assembly problem , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[25]  Eugene W. Myers A history of DNA sequence assembly , 2016, it Inf. Technol..

[26]  Aaron A. Klammer,et al.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data , 2013, Nature Methods.

[27]  Owen White,et al.  TIGR Assembler: A New Tool for Assembling Large Shotgun Sequencing Projects , 1995 .

[28]  Dantong Ouyang,et al.  A hybrid alternate two phases particle swarm optimization algorithm for flow shop scheduling problem , 2010, Comput. Ind. Eng..

[29]  Sheridan K. Houghten,et al.  Restarting and recentering genetic algorithm variations for DNA fragment assembly: The necessity of a multi-strategy approach , 2016, Biosyst..

[30]  Doina Bucur,et al.  A stochastic de novo assembly algorithm for viral-sized genomes obtains correct genomes and builds consensus , 2017, Inf. Sci..

[31]  Hao Gao,et al.  A Hybrid Particle-Swarm Tabu Search Algorithm for Solving Job Shop Scheduling Problems , 2014, IEEE Transactions on Industrial Informatics.

[32]  Timothy P. L. Smith,et al.  Reducing assembly complexity of microbial genomes with single-molecule sequencing , 2013, Genome Biology.

[33]  Steven J. M. Jones,et al.  Abyss: a Parallel Assembler for Short Read Sequence Data Material Supplemental Open Access , 2022 .

[34]  Yeu-Ruey Tzeng,et al.  A revised discrete particle swarm optimization algorithm for permutation flow-shop scheduling problem , 2014, Soft Comput..

[35]  Chu-Sing Yang,et al.  A memetic gravitation search algorithm for solving DNA fragment assembly problems , 2016, J. Intell. Fuzzy Syst..

[36]  Lei Liu,et al.  Particle swarm optimization algorithm: an overview , 2017, Soft Computing.

[37]  Mohammed Azmi Al-Betar,et al.  Hybridizing β-hill climbing with wavelet transform for denoising ECG signals , 2018, Inf. Sci..

[38]  Nachol Chaiyaratana,et al.  DNA fragment assembly using an ant colony system algorithm , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[39]  Stephanie Forrest,et al.  Genetic algorithms, operators, and DNA fragment assembly , 1995, Machine Learning.

[40]  Enrique Alba,et al.  The Problem Aware Local Search algorithm: an efficient technique for permutation-based problems , 2017, Soft Comput..

[41]  Indumathy Rajagopal and Uma Maheswari Sankareswaran An Adaptive Particle Swarm Optimization Algorithm for Solving DNA Fragment Assembly Problem , 2015 .

[42]  Enrique Alba,et al.  An improved trajectory-based hybrid metaheuristic applied to the noisy DNA Fragment Assembly Problem , 2014, Inf. Sci..

[43]  F. Sanger,et al.  Nucleotide sequence of bacteriophage lambda DNA. , 1982, Journal of molecular biology.

[44]  Guillermo Fernández-Anaya,et al.  DNA fragment assembly using optimization , 2013, 2013 IEEE Congress on Evolutionary Computation.

[45]  Vincent A. Cicirello,et al.  The Permutation in a Haystack Problem and the Calculus of Search Landscapes , 2016, IEEE Transactions on Evolutionary Computation.

[46]  Enrique Alba,et al.  An improved problem aware local search algorithm for the DNA fragment assembly problem , 2017, Soft Comput..

[47]  Eugene W. Myers,et al.  Toward Simplifying and Accurately Formulating Fragment Assembly , 1995, J. Comput. Biol..

[48]  Ana Carolina Olivera,et al.  Solving the DNA fragment assembly problem with a parallel discrete firefly algorithm implemented on GPU , 2018, Comput. Sci. Inf. Syst..

[49]  Kumar Sanjay,et al.  DNA Sequence Assembly using Particle Swarm Optimization , 2011 .

[50]  Andries Petrus Engelbrecht,et al.  A study of particle swarm optimization particle trajectories , 2006, Inf. Sci..