Controlling the structure of sequence-defined poly(phosphodiester)s for optimal MS/MS reading of digital information.

Digital polymers are monodisperse chains with a controlled sequence of co-monomers, defined as letters of an alphabet, and are used to store information at the molecular level. Reading such messages is hence a sequencing task that can be efficiently achieved by tandem mass spectrometry. To improve their readability, structure of sequence-controlled synthetic polymers can be optimized, based on considerations regarding their fragmentation behavior. This strategy is described here for poly(phosphodiester)s, which were synthesized as monodisperse chains with more than 100 units but exhibited extremely complex dissociation spectra. In these polymers, two repeating units that differ by a simple H/CH3 variation were defined as the 0 and 1 bit of the ASCII code and spaced by a phosphate moiety. They were readily ionized in negative ion mode electrospray but dissociated via cleavage at all phosphate bonds upon collisional activation. Although allowing a complete sequence coverage of digital poly(phosphodiester)s, this fragmentation behavior was not efficient for macromolecules with more than 50 co-monomers, and data interpretation was very tedious. The structure of these polymers was then modified by introducing alkoxyamine linkages at appropriate location throughout the chain. A first design consisted of placing these low dissociation energy bonds between each monomeric bit: while cleavage of this sole bond greatly simplified MS/MS spectra, efficient sequencing was limited to chains with up to about 50 units. In contrast, introduction of alkoxyamine bonds between each byte (i.e. a set of eight co-monomers) was a more successful strategy. Long messages (so far, up to 8 bytes) could be read in MS3 experiments, where single-byte containing fragments released during the first activation stage were further dissociated for sequencing. The whole sequence of such byte-truncated poly(phosphodiester)s could be easily re-constructed based on a mass tagging system which permits to determine the original location of each byte in the chain. Copyright © 2017 John Wiley & Sons, Ltd.

[1]  S. Fields,et al.  Protein analysis on a proteomic scale , 2003, Nature.

[2]  J. Lutz,et al.  Convergent synthesis of digitally-encoded poly(alkoxyamine amide)s. , 2015, Chemical Communications.

[3]  Jean-François Lutz,et al.  Identification‐Tagging of Methacrylate‐Based Intraocular Implants Using Sequence Defined Polyurethane Barcodes , 2017 .

[4]  J. Lutz,et al.  Optimal ATRP-Made Soluble Polymer Supports for Phosphoramidite Chemistry. , 2016, Chemistry.

[5]  G. Church,et al.  Next-Generation Digital Information Storage in DNA , 2012, Science.

[6]  Jean-François Lutz,et al.  Design and synthesis of digitally encoded polymers that can be decoded and erased , 2015, Nature Communications.

[7]  D. Branton,et al.  The potential and challenges of nanopore sequencing , 2008, Nature Biotechnology.

[8]  B. Domon,et al.  A systematic nomenclature for carbohydrate fragmentations in FAB-MS/MS spectra of glycoconjugates , 1988, Glycoconjugate Journal.

[9]  Jean-François Lutz,et al.  Synthesis of non-natural sequence-encoded polymers using phosphoramidite chemistry. , 2015, Journal of the American Chemical Society.

[10]  K. Biemann Laying the groundwork for proteomics: Mass spectrometry from 1958 to 1988 , 2007 .

[11]  Jean-François Lutz,et al.  Orthogonal Synthesis of "Easy-to-Read" Information-Containing Polymers Using Phosphoramidite and Radical Coupling Steps. , 2016, Journal of the American Chemical Society.

[12]  Gary L. Glish,et al.  Tandem Mass Spectrometry of Small, Multiply Charged Oligonucleotides , 1992, Journal of the American Society for Mass Spectrometry.

[13]  Krzysztof Matyjaszewski,et al.  From precision polymers to complex materials and systems , 2016 .

[14]  David R. Liu,et al.  Sequence-Controlled Polymers , 2013, Science.

[15]  Jean-François Lutz,et al.  Information-containing macromolecules. , 2014, Nature chemistry.

[16]  Didier Gigmes,et al.  Chemoselective Synthesis of Uniform Sequence-Coded Polyurethanes and Their Use as Molecular Tags , 2016 .

[17]  C. Wesdemiotis,et al.  Fragmentation pathways of polymer ions. , 2011, Mass spectrometry reviews.

[18]  J. Lutz,et al.  Reading polymers: sequencing of natural and synthetic macromolecules. , 2014, Angewandte Chemie.

[19]  M. Tyers,et al.  From genomics to proteomics , 2003, Nature.

[20]  J. Lutz,et al.  Tandem mass spectrometry sequencing in the negative ion mode to read binary information encoded in sequence-defined poly(alkoxyamine amide)s. , 2016, Rapid communications in mass spectrometry : RCM.

[21]  Jean-François Lutz,et al.  Synthesis of Monodisperse Sequence-Coded Polymers with Chain Lengths above DP100. , 2015, ACS macro letters.

[22]  M. Mann,et al.  The abc's (and xyz's) of peptide sequencing , 2004, Nature Reviews Molecular Cell Biology.

[23]  Jean-François Lutz,et al.  A Simple Post-Polymerization Modification Method for Controlling Side-Chain Information in Digital Polymers. , 2017, Angewandte Chemie.

[24]  Olgica Milenkovic,et al.  Coding in 2D: Using Intentional Dispersity to Enhance the Information Capacity of Sequence-Coded Polymer Barcodes. , 2016, Angewandte Chemie.

[25]  Jean-François Lutz,et al.  Coding Macromolecules: Inputting Information in Polymers Using Monomer-Based Alphabets , 2015 .

[26]  H. Sleiman,et al.  An efficient and modular route to sequence-defined polymers appended to DNA. , 2014, Angewandte Chemie.

[27]  J. Behr,et al.  Oligonucleotide-oligospermine conjugates (zip nucleic acids): a convenient means of finely tuning hybridization temperatures. , 2008, Journal of the American Chemical Society.

[28]  Clemens Mayer,et al.  An Epigenetics‐Inspired DNA‐Based Data Storage System , 2016, Angewandte Chemie.

[29]  Ewan Birney,et al.  Towards practical, high-capacity, low-maintenance information storage in synthesized DNA , 2013, Nature.

[30]  Jean-François Lutz,et al.  MS/MS-Assisted Design of Sequence-Controlled Synthetic Polymers for Improved Reading of Encoded Information , 2017, Journal of The American Society for Mass Spectrometry.

[31]  Jean-François Lutz,et al.  Mass spectrometry sequencing of long digital polymers facilitated by programmed inter-byte fragmentation , 2017, Nature Communications.

[32]  S. Carr,et al.  Improved electrospray ionization of synthetic oligodeoxynucleotides , 1991 .

[33]  Jean-François Lutz,et al.  MS/MS Digital Readout: Analysis of Binary Information Encoded in the Monomer Sequences of Poly(triazole amide)s. , 2016, Analytical chemistry.

[34]  Jean-François Lutz,et al.  MS/MS Sequencing of Digitally Encoded Poly(alkoxyamine amide)s , 2015 .

[35]  Rui Gao,et al.  Nanopore-based sequencing and detection of nucleic acids. , 2013, Angewandte Chemie.

[36]  Synthesis of Monodisperse Sequence‐Defined Polymers Using Protecting‐Group‐Free Iterative Strategies , 2015 .

[37]  Jean-François Lutz,et al.  Preparation of Information-Containing Macromolecules by Ligation of Dyad-Encoded Oligomers. , 2015, Chemistry.

[38]  Yaniv Erlich,et al.  DNA Fountain enables a robust and efficient storage architecture , 2016, Science.

[39]  Jean-François Lutz,et al.  Writing on polymer chains. , 2013, Accounts of chemical research.