Parallel Implementation of Motif-Based Clustering for HT-SELEX Dataset

A clustering method for high-throughput sequencing with SELEX pools (HT-SELEX) is crucial for selecting different types of aptamer candidates. The fast and accurate clustering method is indispensable for an enormous sequence data produced by HT-SELSEX. We have already developed a fast motif-based clustering (FMBC) method for HT-SELEX data implemented by R language. FMBC exhibited high accuracy of sequence clustering compared with conventional methods, while the processing time of FMBC is longer than AptaCluster. This paper proposes the parallel implementation of FMBC using Python with multi-threading to improve the performance of FMBC. Experimental evaluation using the NCBI SRA data of SRR3279661 from BioProject PRJNA315881 demonstrated that parallel FMBC exhibited higher accuracy of clustering and shorter processing time than conventional methods.

[1]  Teresa M. Przytycka,et al.  Identifying high-affinity aptamer ligands with defined cross-reactivity using high-throughput guided systematic evolution of ligands by exponential enrichment , 2015, Nucleic acids research.

[2]  Silvio Bicciato,et al.  APTANI: a computational tool to select aptamers through sequence-structure motif analysis of HT-SELEX data , 2015, Bioinform..

[3]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[4]  D. Bunka,et al.  Development of aptamer therapeutics. , 2010, Current opinion in pharmacology.

[5]  L. Gold,et al.  Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. , 1990, Science.

[6]  Peng Jiang,et al.  MPBind: a Meta-motif-based statistical framework and pipeline to Predict Binding potential of SELEX-derived aptamers , 2014, Bioinform..

[7]  William H. Thiel,et al.  Aptamers as Diagnostic Tools in Cancer , 2018, Pharmaceuticals.

[8]  E. Vermaas,et al.  Selection of single-stranded DNA molecules that bind and inhibit human thrombin , 1992, Nature.

[9]  Teresa M. Przytycka,et al.  AptaCluster - A Method to Cluster HT-SELEX Aptamer Pools and Lessons from Its Application , 2014, RECOMB.

[10]  Theodore R Allnutt,et al.  Shortlisting aptamer Candidates from HT-SELEX data , 2018 .

[11]  Khalid K. Alam,et al.  FASTAptamer: A Bioinformatic Toolkit for High-throughput Sequence Analysis of Combinatorial Selections , 2015, Molecular therapy. Nucleic acids.

[12]  A. Pardi,et al.  Molecular interactions and metal binding in the theophylline-binding core of an RNA aptamer. , 2000, RNA.

[13]  Phuong Dao,et al.  Large scale analysis of the mutational landscape in HT-SELEX improves aptamer discovery , 2015, Nucleic acids research.

[14]  Seung Soo Oh,et al.  Rapid and Label-Free Strategy to Isolate Aptamers for Metal Ions. , 2016, ACS nano.

[15]  J. Szostak,et al.  In vitro selection of RNA molecules that bind specific ligands , 1990, Nature.

[16]  Marco Aurélio Krieger,et al.  Isolation of an Aptamer that Binds Specifically to E. coli , 2016, PloS one.

[17]  Chunhai Fan,et al.  Aptamer-based biosensors , 2008 .