Encoding information in synthetic metabolomes

Biomolecular information systems offer exciting potential advantages and opportunities to complement conventional semiconductor technologies. Much attention has been paid to information-encoding polymers, but small molecules also play important roles in biochemical information systems. Downstream from DNA, the metabolome is an information-rich molecular system with diverse chemical dimensions which could be harnessed for information storage and processing. As a proof of principle of small-molecule postgenomic data storage, here we demonstrate a workflow for representing abstract data in synthetic mixtures of metabolites. Our approach leverages robotic liquid handling for writing digital information into chemical mixtures, and mass spectrometry for extracting the data. We present several kilobyte-scale image datasets stored in synthetic metabolomes, which can be decoded with accuracy exceeding 99% using multi-mass logistic regression. Cumulatively, >100,000 bits of digital image data was written into metabolomes. These early demonstrations provide insight into some of the benefits and limitations of small-molecule chemical information systems.

[1]  Gianni Panagiotou,et al.  Navigating the Human Metabolome for Biomarker Identification and Design of Pharmaceutical Molecules , 2010, Journal of biomedicine & biotechnology.

[2]  Mick Watson,et al.  A Review of Bioinformatics Tools for Bio-Prospecting from Metagenomic Sequence Data , 2017, Front. Genet..

[3]  Peter Dawyndt,et al.  Multifunctional sequence-defined macromolecules for chemical data storage , 2018, Nature Communications.

[4]  Christopher Rose,et al.  Inscribed matter as an energy-efficient means of communication with an extraterrestrial civilization , 2004, Nature.

[5]  John Hardy,et al.  Genome, transcriptome and proteome: the rise of omics data and their integration in biomedical sciences , 2016, Briefings Bioinform..

[6]  Yinjie J. Tang,et al.  Pathway Confirmation and Flux Analysis of Central Metabolic Pathways in Desulfovibrio vulgaris Hildenborough using Gas Chromatography-Mass Spectrometry and Fourier Transform-Ion Cyclotron Resonance Mass Spectrometry , 2006, Journal of bacteriology.

[7]  G. Vladimirov,et al.  Fourier transform ion cyclotron resonance (FT ICR) mass spectrometry: Theory and simulations. , 2016, Mass spectrometry reviews.

[8]  Martin Kircher,et al.  Deep proteome and transcriptome mapping of a human cancer cell line , 2011, Molecular systems biology.

[9]  Christopher Rose,et al.  Parallelized Linear Classification with Volumetric Chemical Perceptrons , 2018, 2018 IEEE International Conference on Rebooting Computing (ICRC).

[10]  Daniel Weindl,et al.  Complexity of dopamine metabolism , 2013, Cell Communication and Signaling.

[11]  Ian D. Wilson,et al.  Managing the challenge of chemically reactive metabolites in drug development , 2011, Nature Reviews Drug Discovery.

[12]  Fumio Matsuda,et al.  Technical Challenges in Mass Spectrometry-Based Metabolomics. , 2016, Mass spectrometry.

[13]  Gamage Upeksha Ganegoda,et al.  New Trends of Digital Data Storage in DNA , 2016, BioMed research international.

[14]  Douglas B. Kell,et al.  The metabolome 18 years on: a concept comes of age , 2016, Metabolomics.

[15]  Jos Hermans,et al.  Direct electrical quantification of glucose and asparagine from bodily fluids using nanopores , 2018, Nature Communications.

[16]  D. Kell,et al.  Mass Spectrometry Tools and Metabolite-specific Databases for Molecular Identification in Metabolomics , 2009 .

[17]  Simone Giannerini,et al.  DNA as information: at the crossroads between biology, mathematics, physics and chemistry , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[18]  Hui Sun,et al.  Mass spectrometry-based metabolomics: applications to biomarker and metabolic pathway research. , 2016, Biomedical chromatography : BMC.

[19]  Yaniv Erlich,et al.  DNA Fountain enables a robust and efficient storage architecture , 2016, Science.

[20]  M E Belov,et al.  Zeptomole-sensitivity electrospray ionization--Fourier transform ion cyclotron resonance mass spectrometry of proteins. , 2000, Analytical chemistry.

[21]  Bonnie A. Sheriff,et al.  A 160-kilobit molecular electronic memory patterned at 1011 bits per square centimetre , 2007, Nature.

[22]  Kazuki Saito,et al.  Modern plant metabolomics: advanced natural product gene discoveries, improved technologies, and future prospects. , 2015, Natural product reports.

[23]  Dietmar Schomburg,et al.  MetaboliteDetector: comprehensive analysis tool for targeted and nontargeted GC/MS based metabolome analysis. , 2009, Analytical chemistry.

[24]  Pan-Jun Kim,et al.  Global metabolic interaction network of the human gut microbiota for context-specific community-scale analysis , 2017, Nature Communications.

[25]  Emily Pentzer,et al.  Beyond binary: optical data storage with 0, 1, 2, and 3 in polymer films , 2017 .

[26]  Stefan Lorkowski,et al.  Complexity of vitamin E metabolism. , 2016, World journal of biological chemistry.

[27]  Z. Erdélyi,et al.  Encoding Information into Polyethylene Glycol Using an Alcohol-Isocyanate “Click” Reaction , 2020, International journal of molecular sciences.

[28]  Christopher E Arcadia,et al.  In Situ Nanopore Fabrication and Single-Molecule Sensing with Microscale Liquid Contacts. , 2017, ACS nano.

[29]  G A Nagana Gowda,et al.  Overview of mass spectrometry-based metabolomics: opportunities and challenges. , 2014, Methods in molecular biology.

[30]  Gang Fu,et al.  PubChem Substance and Compound databases , 2015, Nucleic Acids Res..

[31]  Leila Motiei,et al.  Message in a molecule , 2016, Nature Communications.

[32]  M. Mann,et al.  Quantitative, high-resolution proteomics for data-driven systems biology. , 2011, Annual review of biochemistry.

[33]  Gregory Timp,et al.  Reading the primary structure of a protein with 0.07 nm3 resolution using a subnanometre-diameter pore. , 2016, Nature nanotechnology.

[34]  David S. Wishart,et al.  HMDB 4.0: the human metabolome database for 2018 , 2017, Nucleic Acids Res..

[35]  U. Sauer,et al.  Frontiers of high-throughput metabolomics. , 2017, Current opinion in chemical biology.

[36]  G. Church,et al.  Next-Generation Digital Information Storage in DNA , 2012, Science.

[37]  John Parkinson,et al.  The conservation and evolutionary modularity of metabolism , 2009, Genome Biology.

[38]  Olgica Milenkovic,et al.  DNA punch cards for storing data on native DNA sequences via enzymatic nicking , 2020, Nature Communications.

[39]  B. Hammock,et al.  Mass spectrometry-based metabolomics. , 2007, Mass spectrometry reviews.

[40]  Christopher Rose,et al.  Multicomponent molecular memory , 2020, Nature Communications.