Realizing the potential of blockchain technologies in genomics

Genomics data introduce a substantial computational burden as well as data privacy and ownership issues. Data sets generated by high-throughput sequencing platforms require immense amounts of computational resources to align to reference genomes and to call and annotate genomic variants. This problem is even more pronounced if reanalysis is needed for new versions of reference genomes, which may impose high loads to existing computational infrastructures. Additionally, after the compute-intensive analyses are completed, the results are either kept in centralized repositories with access control, or distributed among stakeholders using standard file transfer protocols. This imposes two main problems: (1) Centralized servers become gatekeepers of the data, essentially acting as an unnecessary mediator between the actual data owners and data users; and (2) servers may create single points of failure both in terms of service availability and data privacy. Therefore, there is a need for secure and decentralized platforms for data distribution with user-level data governance. A new technology, blockchain, may help ameliorate some of these problems. In broad terms, the blockchain technology enables decentralized, immutable, incorruptible public ledgers. In this Perspective, we aim to introduce current developments toward using blockchain to address several problems in omics, and to provide an outlook of possible future implications of the blockchain technology to life sciences.

[1]  Taghreed Justinia,et al.  Blockchain Technologies: Opportunities for Solving Real-World Problems in Healthcare and Biomedical Sciences , 2019, Acta informatica medica : AIM : journal of the Society for Medical Informatics of Bosnia & Herzegovina : casopis Drustva za medicinsku informatiku BiH.

[2]  M. Shabani,et al.  DNA Data Marketplace: An Analysis of the Ethical Concerns Regarding the Participation of the Individuals , 2019, Front. Genet..

[3]  Gerben Menschaert,et al.  Beyond Genes: Re-Identifiability of Proteomic Data and Its Implications for Personalized Medicine , 2019, Genes.

[4]  J. Gans,et al.  Initial Coin Offerings and the Value of Crypto Tokens , 2018 .

[5]  Prateek Saxena,et al.  Finding The Greedy, Prodigal, and Suicidal Contracts at Scale , 2018, ACSAC.

[6]  Mahdi H. Miraz,et al.  Applications of Blockchain Technology Beyond Cryptocurrency , 2018, ArXiv.

[7]  Usman W. Chohan,et al.  The Decentralized Autonomous Organization and Governance Issues , 2017 .

[8]  Peter W. Eklund,et al.  Proof of Stake Blockchain: Performance and Scalability for Groupware Communications , 2017, MEDES.

[9]  L. Ohno-Machado,et al.  A community effort to protect genomic data sharing, collaboration and outsourcing , 2017, npj Genomic Medicine.

[10]  Murat Kantarcioglu,et al.  Blockchain: A Graph Primer , 2017, ArXiv.

[11]  Ricardo Neisse,et al.  A Blockchain-based Approach for Data Accountability and Provenance Tracking , 2017, ARES.

[12]  Jan Hendrik Witte,et al.  The Blockchain: A Gentle Four Page Introduction , 2016, ArXiv.

[13]  Jan Witte,et al.  The Blockchain: A Gentle Introduction , 2016 .

[14]  Siraj Raval,et al.  Decentralized Applications: Harnessing Bitcoin's Blockchain Technology , 2016 .

[15]  J. McPherson,et al.  Coming of age: ten years of next-generation sequencing technologies , 2016, Nature Reviews Genetics.

[16]  Don Tapscott,et al.  Blockchain Revolution: How the Technology Behind Bitcoin Is Changing Money, Business, and the World , 2016 .

[17]  Feng Hao,et al.  Refund Attacks on Bitcoin's Payment Protocol , 2016, Financial Cryptography.

[18]  Elaine Shi,et al.  On Scaling Decentralized Blockchains - (A Position Paper) , 2016, Financial Cryptography Workshops.

[19]  Can Alkan,et al.  Coinami: A Cryptocurrency with DNA Sequence Alignment as Proof-of-work , 2016, ArXiv.

[20]  Aviv Zohar,et al.  Optimal Selfish Mining Strategies in Bitcoin , 2015, Financial Cryptography.

[21]  M. Schatz,et al.  Big Data: Astronomical or Genomical? , 2015, PLoS biology.

[22]  Jeremy Clark,et al.  SoK: Research Perspectives and Challenges for Bitcoin and Cryptocurrencies , 2015, 2015 IEEE Symposium on Security and Privacy.

[23]  Takeshi Koshiba,et al.  Secure pattern matching using somewhat homomorphic encryption , 2013, CCSW.

[24]  Jean-Pierre Hubaux,et al.  Protecting and evaluating genomic privacy in medical tests and personalized medicine , 2013, WPES.

[25]  S. Salzberg,et al.  Repetitive DNA and next-generation sequencing: computational challenges and solutions , 2011, Nature Reviews Genetics.

[26]  Raymond K. Auerbach,et al.  The real cost of sequencing: higher than you think! , 2011, Genome Biology.

[27]  Craig Gentry,et al.  Fully homomorphic encryption using ideal lattices , 2009, STOC '09.

[28]  Vijay S. Pande,et al.  Folding@home: Lessons from eight years of volunteer distributed computing , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[29]  P. Flicek,et al.  The need for speed , 2009, Genome Biology.

[30]  Tatsuya Akutsu,et al.  Dynamic programming algorithms for RNA secondary structure prediction with pseudoknots , 2000, Discret. Appl. Math..

[31]  Whitfield Diffie,et al.  New Directions in Cryptography , 1976, IEEE Trans. Inf. Theory.

[32]  Mattias Scherer,et al.  Performance and Scalability of Blockchain Networks and Smart Contracts , 2017 .

[33]  Vitalik Buterin A NEXT GENERATION SMART CONTRACT & DECENTRALIZED APPLICATION PLATFORM , 2015 .

[34]  M. Metzker Sequencing technologies — the next generation , 2010, Nature Reviews Genetics.

[35]  Craig Gentry,et al.  A fully homomorphic encryption scheme , 2009 .

[36]  S. Nakamoto,et al.  Bitcoin: A Peer-to-Peer Electronic Cash System , 2008 .

[37]  William E. Hart,et al.  Robust Proofs of NP-Hardness for Protein Folding: General Lattices and Energy Potentials , 1997, J. Comput. Biol..