Nucleotide-time alignment for molecular recorders

Using a DNA polymerase to record intracellular calcium levels has been proposed as a novel neural recording technique, promising massive-scale, single-cell resolution monitoring of large portions of the brain. This technique relies on local storage of neural activity in strands of DNA, followed by offline analysis of that DNA. In simple implementations of this scheme, the time when each nucleotide was written cannot be determined directly by post-hoc DNA sequencing; the timing data must be estimated instead. Here, we use a Dynamic Time Warping-based algorithm to perform this estimation, exploiting correlations between neural activity and observed experimental variables to translate DNA-based signals to an estimate of neural activity over time. This algorithm improves the parallelizability of traditional Dynamic Time Warping, allowing several-fold increases in computation speed. The algorithm also provides a solution to several critical problems with the molecular recording paradigm: determining recording start times and coping with DNA polymerase pausing. The algorithm can generally locate DNA-based records to within <10% of a recording window, allowing for the estimation of unobserved incorporation times and latent neural tunings. We apply our technique to an in silico motor control neuroscience experiment, using the algorithm to estimate both timings of DNA-based data and the directional tuning of motor cortical cells during a center-out reaching task. We also use this algorithm to explore the impact of polymerase characteristics on system performance, determining the precision of a molecular recorder as a function of its kinetic and error-generating properties. We find useful ranges of properties for DNA polymerase-based recorders, providing guidance for future protein engineering attempts. This work demonstrates a useful general extension to dynamic alignment algorithms, as well as direct applications of that extension toward the development of molecular recorders, providing a necessary stepping stone for future biological work.

[1]  Eamonn J. Keogh,et al.  Accelerating Dynamic Time Warping Subsequence Search with GPUs and FPGAs , 2010, 2010 IEEE International Conference on Data Mining.

[2]  Jonathan D. Victor,et al.  Dynamic programming algorithms for comparing multineuronal spike trains via cost-based metrics and alignments , 2007, Journal of Neuroscience Methods.

[3]  Seiichi Nakagawa,et al.  Speaker-Independent English Consonant and Japanese Word Recognition by a Stochastic Dynamic Time Warping Method , 1988 .

[4]  Konrad Kording,et al.  The Database for Reaching Experiments and Models , 2013, PloS one.

[5]  Robert S. Anderssen,et al.  Dawn and Dusk Set States of the Circadian Oscillator in Sprouting Barley (Hordeum vulgare) Seedlings , 2015, PloS one.

[6]  Philipp J. Keller,et al.  Whole-brain functional imaging at cellular resolution using light-sheet microscopy , 2013, Nature Methods.

[7]  Marc W Slutzky,et al.  Statistical assessment of the stability of neural movement representations. , 2011, Journal of neurophysiology.

[8]  Aaron E. Rosenberg,et al.  Performance tradeoffs in dynamic time warping algorithms for isolated word recognition , 1980 .

[9]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[10]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[11]  Antoine M. van Oijen,et al.  Real-time single-molecule observation of rolling-circle DNA replication , 2009, Nucleic acids research.

[12]  Michael Nelson,et al.  A macroscopic kinetic model for DNA polymerase elongation and high-fidelity nucleotide selection , 2005, Comput. Biol. Chem..

[13]  Limin Xiao,et al.  Parallelizing Dynamic Time Warping Algorithm Using Prefix Computations on GPU , 2013, 2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing.

[14]  Eli Brenner,et al.  Structure learning and the Occam's razor principle: a new view of human function acquisition , 2014, Front. Comput. Neurosci..

[15]  R. Yuste,et al.  The Brain Activity Map Project and the Challenge of Functional Connectomics , 2012, Neuron.

[16]  Sebastian Magierowski,et al.  Internal Readout System for Molecular Recorders , 2015, IEEE Transactions on Molecular, Biological and Multi-Scale Communications.

[17]  Stephen R. Quake,et al.  Single molecule measurement of the “speed limit” of DNA polymerase , 2009, Proceedings of the National Academy of Sciences.

[18]  Thomas L. Dean,et al.  On the Technology Prospects and Investment Opportunities for Scalable Neuroscience , 2013, 1307.7302.

[19]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.

[20]  T. Gasser,et al.  Alignment of curves by dynamic time warping , 1997 .

[21]  Theodore J. Perkins Maximum likelihood trajectories for continuous-time Markov chains , 2009, NIPS.

[22]  R. Bambara,et al.  Site-specific pausing of deoxyribonucleic acid synthesis catalyzed by four forms of Escherichia coli DNA polymerase III. , 1983, Biochemistry.

[23]  L. Miller,et al.  Accurate decoding of reaching movements from field potentials in the absence of spikes , 2012, Journal of neural engineering.

[24]  Samuel H. Wilson,et al.  Efficiency of Correct Nucleotide Insertion Governs DNA Polymerase Fidelity* , 2002, The Journal of Biological Chemistry.

[25]  Konrad P. Körding,et al.  Statistical Analysis of Molecular Signal Recording , 2013, PLoS Comput. Biol..

[26]  Konrad P. Körding,et al.  Puzzle Imaging: Using Large-Scale Dimensionality Reduction Algorithms for Localization , 2015, PloS one.

[27]  B.-H. Juang,et al.  On the hidden Markov model and dynamic time warping for speech recognition — A unified view , 1984, AT&T Bell Laboratories Technical Journal.

[28]  P. J. Sjöström,et al.  Functional specificity of local synaptic connections in neocortical networks , 2011, Nature.

[29]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[30]  T. Kunkel DNA Replication Fidelity* , 2004, Journal of Biological Chemistry.

[31]  Ninghui Sun,et al.  Locality and Parallelism Optimization for Dynamic Programming Algorithm in Bioinformatics , 2006, ACM/IEEE SC 2006 Conference (SC'06).

[32]  Keith E. J. Tyo,et al.  Measuring Cation Dependent DNA Polymerase Fidelity Landscapes by Deep Sequencing , 2012, PloS one.

[33]  M. Chamberlin,et al.  Analysis and suppression of DNA polymerase pauses associated with a trinucleotide consensus. , 1996, Nucleic acids research.

[34]  Jan M. Rabaey,et al.  Physical principles for scalable neural recording , 2013, Front. Comput. Neurosci..