AptaCluster - A Method to Cluster HT-SELEX Aptamer Pools and Lessons from Its Application

Systematic Evolution of Ligands by EXponential Enrichment (SELEX) is a well established experimental procedure to identify aptamers - synthetic single-stranded (ribo)nucleic molecules that bind to a given molecular target. Recently, new sequencing technologies have revolutionized the SELEX protocol by allowing for deep sequencing of the selection pools after each cycle. The emergence of High Throughput SELEX (HT-SELEX) has opened the field to new computational opportunities and challenges that are yet to be addressed. To aid the analysis of the results of HT-SELEX and to advance the understanding of the selection process itself, we developed AptaCluster. This algorithm allows for an efficient clustering of whole HT-SELEX aptamer pools; a task that could not be accomplished with traditional clustering algorithms due to the enormous size of such datasets. We performed HT-SELEX with Interleukin 10 receptor alpha chain (IL-10RA) as the target molecule and used AptaCluster to analyze the resulting sequences. AptaCluster allowed for the first survey of the relationships between sequences in different selection rounds and revealed previously not appreciated properties of the SELEX protocol. As the first tool of this kind, AptaCluster enables novel ways to analyze and to optimize the HT-SELEX procedure. Our AptaCluster algorithm is available as a very fast multiprocessor implementation upon request.

[1]  Bertrand Tavitian,et al.  Nucleic acid aptamers in cancer medicine , 2002, FEBS letters.

[2]  Gaya K Amarasinghe,et al.  Development of RNA aptamers targeting Ebola virus VP35. , 2013, Biochemistry.

[3]  Juan M. Vaquerizas,et al.  Multiplexed massively parallel SELEX for characterization of human transcription factor binding specificities. , 2010, Genome research.

[4]  Teresa M. Przytycka,et al.  Identification of sequence-structure RNA binding motifs for SELEX-derived aptamers , 2012, Bioinform..

[5]  J. Szostak,et al.  In vitro selection of RNA molecules that bind specific ligands , 1990, Nature.

[6]  William H. Thiel,et al.  Isolation and Optimization of Murine IL-10 Receptor Blocking Oligonucleotide Aptamers Using High-throughput Sequencing. , 2012, Molecular therapy : the journal of the American Society of Gene Therapy.

[7]  M. Biggin,et al.  High-throughput SELEX determination of DNA sequences bound by transcription factors in vitro. , 2012, Methods in molecular biology.

[8]  M. Gu,et al.  Advances in aptamer screening and small molecule aptasensors. , 2014, Advances in biochemical engineering/biotechnology.

[9]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[10]  E. Riley,et al.  IL-10: The Master Regulator of Immunity to Infection , 2008, The Journal of Immunology.

[11]  Yue Zhao,et al.  Inferring Binding Energies from Selected Binding Sites , 2009, PLoS Comput. Biol..

[12]  Mark P. McPike,et al.  Acyclic Identification of Aptamers for Human alpha-Thrombin Using Over-Represented Libraries and Deep Sequencing , 2011, PloS one.

[13]  Kemin Wang,et al.  Whole Cell-SELEX Aptamers for Highly Specific Fluorescence Molecular Imaging of Carcinomas In Vivo , 2013, PloS one.

[14]  Boris Schling The Boost C++ Libraries , 2011 .

[15]  Zuben E. Sauna,et al.  Aptamers as a Sensitive Tool to Detect Subtle Modifications in Therapeutic Proteins , 2012, PloS one.

[16]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[17]  Liqing Zhang,et al.  Performance comparison between k-tuple distance and four model-based distances in phylogenetic tree reconstruction , 2008, Nucleic acids research.