Encoding Information in Synthetic Metabolomes

Biomolecular information systems offer numerous potential advantages over conventional semiconductor technologies. Downstream from DNA, the metabolome is an information-rich molecular system with diverse chemical dimensions which could be harnessed for information storage and processing. As a proof of principle of postgenomic data storage, here we demonstrate a workflow for representing abstract data in synthetic metabolomes. Our approach leverages robotic liquid handling for writing digital information into chemical mixtures, and mass spectrometry for extracting the data. We present several kilobyte-scale image datasets stored in synthetic metabolomes, which are decoded with accuracy exceeding 98-99% using multi-mass logistic regression. Cumulatively, >100,000 bits of digital image data was written into metabolomes. These early demonstrations provide insight into the benefits and limitations of postgenomic chemical information systems.

[1]  Stefan Lorkowski,et al.  Complexity of vitamin E metabolism. , 2016, World journal of biological chemistry.

[2]  Yinjie J. Tang,et al.  Pathway Confirmation and Flux Analysis of Central Metabolic Pathways in Desulfovibrio vulgaris Hildenborough using Gas Chromatography-Mass Spectrometry and Fourier Transform-Ion Cyclotron Resonance Mass Spectrometry , 2006, Journal of bacteriology.

[3]  Gregory Timp,et al.  Reading the primary structure of a protein with 0.07 nm3 resolution using a subnanometre-diameter pore. , 2016, Nature nanotechnology.

[4]  David S. Wishart,et al.  HMDB 4.0: the human metabolome database for 2018 , 2017, Nucleic Acids Res..

[5]  Simone Giannerini,et al.  DNA as information: at the crossroads between biology, mathematics, physics and chemistry , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[6]  Hui Sun,et al.  Mass spectrometry-based metabolomics: applications to biomarker and metabolic pathway research. , 2016, Biomedical chromatography : BMC.

[7]  Gianni Panagiotou,et al.  Navigating the Human Metabolome for Biomarker Identification and Design of Pharmaceutical Molecules , 2010, Journal of biomedicine & biotechnology.

[8]  Peter Dawyndt,et al.  Multifunctional sequence-defined macromolecules for chemical data storage , 2018, Nature Communications.

[9]  Gamage Upeksha Ganegoda,et al.  New Trends of Digital Data Storage in DNA , 2016, BioMed research international.

[10]  John Parkinson,et al.  The conservation and evolutionary modularity of metabolism , 2009, Genome Biology.

[11]  G. Church,et al.  Next-Generation Digital Information Storage in DNA , 2012, Science.

[12]  Dietmar Schomburg,et al.  MetaboliteDetector: comprehensive analysis tool for targeted and nontargeted GC/MS based metabolome analysis. , 2009, Analytical chemistry.

[13]  Pan-Jun Kim,et al.  Global metabolic interaction network of the human gut microbiota for context-specific community-scale analysis , 2017, Nature Communications.

[14]  Mick Watson,et al.  A Review of Bioinformatics Tools for Bio-Prospecting from Metagenomic Sequence Data , 2017, Front. Genet..

[15]  D. Kell,et al.  Mass Spectrometry Tools and Metabolite-specific Databases for Molecular Identification in Metabolomics , 2009 .

[16]  Jos Hermans,et al.  Direct electrical quantification of glucose and asparagine from bodily fluids using nanopores , 2018, Nature Communications.

[17]  Fumio Matsuda,et al.  Technical Challenges in Mass Spectrometry-Based Metabolomics. , 2016, Mass spectrometry.

[18]  B. Hammock,et al.  Mass spectrometry-based metabolomics. , 2007, Mass spectrometry reviews.

[19]  Douglas B. Kell,et al.  The metabolome 18 years on: a concept comes of age , 2016, Metabolomics.

[20]  John Hardy,et al.  Genome, transcriptome and proteome: the rise of omics data and their integration in biomedical sciences , 2016, Briefings Bioinform..

[21]  G A Nagana Gowda,et al.  Overview of mass spectrometry-based metabolomics: opportunities and challenges. , 2014, Methods in molecular biology.

[22]  Martin Kircher,et al.  Deep proteome and transcriptome mapping of a human cancer cell line , 2011, Molecular systems biology.

[23]  Gang Fu,et al.  PubChem Substance and Compound databases , 2015, Nucleic Acids Res..

[24]  Leila Motiei,et al.  Message in a molecule , 2016, Nature Communications.

[25]  M. Mann,et al.  Quantitative, high-resolution proteomics for data-driven systems biology. , 2011, Annual review of biochemistry.

[26]  Kazuki Saito,et al.  Modern plant metabolomics: advanced natural product gene discoveries, improved technologies, and future prospects. , 2015, Natural product reports.

[27]  Christopher Rose,et al.  Inscribed matter as an energy-efficient means of communication with an extraterrestrial civilization , 2004, Nature.

[28]  G. Vladimirov,et al.  Fourier transform ion cyclotron resonance (FT ICR) mass spectrometry: Theory and simulations. , 2016, Mass spectrometry reviews.

[29]  Christopher Rose,et al.  Parallelized Linear Classification with Volumetric Chemical Perceptrons , 2018, 2018 IEEE International Conference on Rebooting Computing (ICRC).

[30]  Daniel Weindl,et al.  Complexity of dopamine metabolism , 2013, Cell Communication and Signaling.

[31]  Ian D. Wilson,et al.  Managing the challenge of chemically reactive metabolites in drug development , 2011, Nature Reviews Drug Discovery.

[32]  Emily Pentzer,et al.  Beyond binary: optical data storage with 0, 1, 2, and 3 in polymer films , 2017 .

[33]  U. Sauer,et al.  Frontiers of high-throughput metabolomics. , 2017, Current opinion in chemical biology.

[34]  Yaniv Erlich,et al.  DNA Fountain enables a robust and efficient storage architecture , 2016, Science.

[35]  M E Belov,et al.  Zeptomole-sensitivity electrospray ionization--Fourier transform ion cyclotron resonance mass spectrometry of proteins. , 2000, Analytical chemistry.

[36]  Bonnie A. Sheriff,et al.  A 160-kilobit molecular electronic memory patterned at 1011 bits per square centimetre , 2007, Nature.