Realizing private and practical pharmacological collaboration

Sharing pharmaceutical research Increased collaboration will enhance our ability to predict new therapeutic drug candidates. Such data sharing is currently limited by concerns about intellectual property and competing commercial interests. Hie et al. introduce an end-to-end pipeline, using modern cryptographic tools, for secure pharmacological collaboration. Multiple entities can thus securely combine their private datasets to collectively obtain more accurate predictions of new drug-target interactions. The computational pipeline is practical, producing results with improved accuracy in a few days over a wide area network on a real dataset with more than a million interactions. Science, this issue p. 347 A computational protocol enables private pharmacological data to be securely combined. Although combining data from multiple entities could power life-saving breakthroughs, open sharing of pharmacological data is generally not viable because of data privacy and intellectual property concerns. To this end, we leverage modern cryptographic tools to introduce a computational protocol for securely training a predictive model of drug–target interactions (DTIs) on a pooled dataset that overcomes barriers to data sharing by provably ensuring the confidentiality of all underlying drugs, targets, and observed interactions. Our protocol runs within days on a real dataset of more than 1 million interactions and is more accurate than state-of-the-art DTI prediction methods. Using our protocol, we discover previously unidentified DTIs that we experimentally validated via targeted assays. Our work lays a foundation for more effective and cooperative biomedical research.

[1]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[2]  Avi Wigderson,et al.  Completeness theorems for non-cryptographic fault-tolerant distributed computation , 1988, STOC '88.

[3]  Donald Beaver,et al.  Efficient Multiparty Protocols Using Circuit Randomization , 1991, CRYPTO.

[4]  D L Cheney,et al.  Design and structure-activity relationships of potent and selective inhibitors of blood coagulation factor Xa. , 1999, Journal of medicinal chemistry.

[5]  C. Supuran,et al.  Carbonic anhydrase inhibitors: synthesis of membrane-impermeant low molecular weight sulfonamides possessing in vivo selectivity for the membrane-bound versus cytosolic isozymes. , 2000, Journal of medicinal chemistry.

[6]  O. Witte,et al.  The BCR-ABL story: bench to bedside and back. , 2004, Annual review of immunology.

[7]  I-Lin Lu,et al.  Novel indole-based peroxisome proliferator-activated receptor agonists: design, SAR, structural biology, and biological activities. , 2005, Journal of medicinal chemistry.

[8]  J. Berger,et al.  Design and synthesis of potent and subtype-selective PPARα agonists , 2006 .

[9]  Cynthia Dwork,et al.  Calibrating Noise to Sensitivity in Private Data Analysis , 2006, TCC.

[10]  Kazuo Ohta,et al.  Multiparty Computation for Interval, Equality, and Comparison Without Bit-Decomposition Protocol , 2007, Public Key Cryptography.

[11]  N. Brooijmans,et al.  Novel purine and pyrazolo[3,4-d]pyrimidine inhibitors of PI3 kinase-alpha: Hit to lead studies. , 2010, Bioorganic & medicinal chemistry letters.

[12]  Charles C. Persinger,et al.  How to improve R&D productivity: the pharmaceutical industry's grand challenge , 2010, Nature Reviews Drug Discovery.

[13]  Octavian Catrina,et al.  Secure Computation with Fixed-Point Numbers , 2010, Financial Cryptography.

[14]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[15]  Patrick J. Curran,et al.  Discovery of novel imidazo[1,2-a]pyrazin-8-amines as Brk/PTK6 inhibitors. , 2011, Bioorganic & medicinal chemistry letters.

[16]  Masaki Kobayashi,et al.  An Mdm2 antagonist, Nutlin-3a, induces p53-dependent and proteasome-mediated poly(ADP-ribose) polymerase1 degradation in mouse fibroblasts. , 2011, Biochemical and biophysical research communications.

[17]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[18]  Daniel Cressey,et al.  Traditional drug-discovery model ripe for reform , 2011, Nature.

[19]  Ivan Damgård,et al.  Multiparty Computation from Somewhat Homomorphic Encryption , 2012, IACR Cryptol. ePrint Arch..

[20]  W. Garten,et al.  Development of substrate analogue inhibitors for the human airway trypsin-like protease HAT. , 2011, Bioorganic & medicinal chemistry letters.

[21]  I. Khanna,et al.  Drug discovery in pharmaceutical industry: productivity challenges and trends. , 2012, Drug discovery today.

[22]  David Page,et al.  Area under the Precision-Recall Curve: Point Estimates and Confidence Intervals , 2013, ECML/PKDD.

[23]  Chee Keong Kwoh,et al.  Drug-target interaction prediction by learning from local information and neighbors , 2013, Bioinform..

[24]  Chang Liu,et al.  Predicting Drug–Target Interactions Using Probabilistic Matrix Factorization , 2013, J. Chem. Inf. Model..

[25]  Hao Ding,et al.  Collaborative matrix factorization with multiple similarities for predicting drug-target interactions , 2013, KDD.

[26]  Stratis Ioannidis,et al.  Privacy-preserving matrix factorization , 2013, CCS.

[27]  G. Giannini,et al.  Novel PARP-1 inhibitors based on a 2-propanoyl-3H-quinazolin-4-one scaffold. , 2014, Bioorganic & medicinal chemistry letters.

[28]  Xiang Zhang,et al.  Drug repositioning by integrating target information through a heterogeneous network model , 2014, Bioinform..

[29]  S. Reardon Pharma firms join NIH on drug development , 2014, Nature.

[30]  Ivan Damgård,et al.  Secure Multiparty Computation and Secret Sharing , 2015 .

[31]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[32]  Sean K. Simmons,et al.  Enabling Privacy Preserving GWAS in Heterogeneous Human Populations. , 2016, RECOMB 2016.

[33]  Robert D. Finn,et al.  The Pfam protein families database: towards a more sustainable future , 2015, Nucleic Acids Res..

[34]  Damian Szklarczyk,et al.  STITCH 5: augmenting protein–chemical interaction networks with tissue and affinity data , 2015, Nucleic Acids Res..

[35]  Daniel R. Zerbino,et al.  Ensembl 2016 , 2015, Nucleic Acids Res..

[36]  J. Reindl,et al.  A New Nanobody-Based Biosensor to Study Endogenous PARP1 In Vitro and in Live Human Cells , 2016, PloS one.

[37]  Ian Goodfellow,et al.  Deep Learning with Differential Privacy , 2016, CCS.

[38]  Yao Lu,et al.  Oblivious Neural Network Predictions via MiniONN Transformations , 2017, IACR Cryptol. ePrint Arch..

[39]  Giuseppe Ateniese,et al.  Deep Models Under the GAN: Information Leakage from Collaborative Deep Learning , 2017, CCS.

[40]  Jian Peng,et al.  A network integration approach for drug-target interaction prediction and computational drug repositioning from heterogeneous information , 2017, Nature Communications.

[41]  Sarvar Patel,et al.  Practical Secure Aggregation for Privacy-Preserving Machine Learning , 2017, IACR Cryptol. ePrint Arch..

[42]  Farinaz Koushanfar,et al.  DeepSecure: Scalable Provably-Secure Deep Learning , 2017, 2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC).

[43]  David J. Wu,et al.  Secure genome-wide association analysis using multiparty computation , 2018, Nature Biotechnology.