A Trusted Healthcare Data Analytics Cloud Platform

This paper presents a cloud-based system for health care applications. Our system has advanced features for preserving privacy which are essential for health care applications that deal with confidential data. We describe some of the bioinformatics applications which our system is designed for. Performance is significantly enhanced by caching, and enhanced clients for performing part of the computations are a key component of our system. Cloud, due to its pay-as-you-go pricing and API based deployment model, has become widely used for delivering and maintaining infrastructure technology for businesses. However, there are significant challenges with using the cloud for applications with strict privacy and compliance requirements; health care applications fall in this domain. This paper describes an architecture and solutions for handling these types of applications.

[1]  A. Chiang,et al.  Systematic Evaluation of Drug–Disease Relationships to Identify Leads for Novel Drug Uses , 2009, Clinical pharmacology and therapeutics.

[2]  Ping Zhang,et al.  Predicting Drug-Drug Interactions Through Large-Scale Similarity-Based Link Prediction , 2016, ESWC.

[3]  Srinivas Devadas,et al.  Intel SGX Explained , 2016, IACR Cryptol. ePrint Arch..

[4]  Stefan Berger,et al.  vTPM: Virtualizing the Trusted Platform Module , 2006, USENIX Security Symposium.

[5]  Aiko Pras,et al.  Benchmarking personal cloud storage , 2013, Internet Measurement Conference.

[6]  Trent Jaeger,et al.  Design and Implementation of a TCG-based Integrity Measurement Architecture , 2004, USENIX Security Symposium.

[7]  R. Boivie SecureBlue + + : CPU Support for Secure Execution , 2011 .

[8]  David Madigan,et al.  Multiple Self‐Controlled Case Series for Large‐Scale Longitudinal Observational Databases , 2013, Biometrics.

[9]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[10]  Raúl Gracia Tinedo,et al.  Actively Measuring Personal Cloud Storage , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[11]  Yanli Wang,et al.  PubChem: Integrated Platform of Small Molecules and Biological Activities , 2008 .

[12]  Elisa Bertino,et al.  Efficient and Scalable Integrity Verification of Data and Query Results for Graph Databases , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[13]  Ying Li,et al.  Exploiting Electronic Health Records to Mine Drug Effects on Laboratory Test Results , 2017, CIKM.

[14]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[15]  Ping Zhang,et al.  Towards Drug Repositioning: A Unified Computational Framework for Integrating Multiple Aspects of Drug Similarity and Disease Similarity , 2014, AMIA.

[16]  J. Coebergh,et al.  Lower Risk of Cancer in Patients on Metformin in Comparison With Those on Sulfonylurea Derivatives , 2011, Diabetes Care.

[17]  Arun Iyengar,et al.  Supporting Data Analytics Applications Which Utilize Cognitive Services , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[18]  Patrick Seemann,et al.  Matrix Factorization Techniques for Recommender Systems , 2014 .

[19]  Enrique Castro-Leon,et al.  Building the Infrastructure for Cloud Security: A Solutions View , 2014 .

[20]  Peer Bork,et al.  The SIDER database of drugs and side effects , 2015, Nucleic Acids Res..

[21]  Oded Goldreich,et al.  Foundations of Cryptography: Volume 2, Basic Applications , 2004 .

[22]  Hao Ye,et al.  Construction of Drug Network Based on Side Effects and Its Application for Drug Repositioning , 2014, PloS one.

[23]  Daniel M. Dias,et al.  Health Cloud: An Enabler for Healthcare Transformation , 2016, 2016 IEEE International Conference on Services Computing (SCC).

[24]  Joachim Posegga,et al.  Redactable Signatures for Independent Removal of Structure and Content , 2012, ISPEC.

[25]  Elisa Bertino,et al.  Leakage-free redactable signatures , 2012, CODASPY '12.

[26]  Oded Goldreich,et al.  The Foundations of Cryptography - Volume 2: Basic Applications , 2001 .

[27]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[28]  Alysson Bessani,et al.  The TClouds platform: concept, architecture and instantiations , 2013, DISCCO '13.

[29]  Wei Jiang,et al.  Healthcare Data Gateways: Found Healthcare Intelligence on Blockchain with Novel Privacy Risk Control , 2016, Journal of Medical Systems.

[30]  Ville Leppänen,et al.  Security in Container-Based Virtualization through vTPM , 2016, 2016 IEEE/ACM 9th International Conference on Utility and Cloud Computing (UCC).

[31]  Brian E. Granger,et al.  IPython: A System for Interactive Scientific Computing , 2007, Computing in Science & Engineering.

[32]  Markus Krötzsch,et al.  Wikidata , 2014, Commun. ACM.

[33]  Yu Gu,et al.  Engineering Scalable, Secure, Multi-Tenant Cloud for Healthcare Data , 2017, 2017 IEEE World Congress on Services (SERVICES).

[34]  Alexander A. Morgan,et al.  Discovery and Preclinical Validation of Drug Indications Using Compendia of Public Gene Expression Data , 2011, Science Translational Medicine.

[35]  P. Mell,et al.  SP 800-145. The NIST Definition of Cloud Computing , 2011 .

[36]  Alexander A. Morgan,et al.  Computational Repositioning of the Anticonvulsant Topiramate for Inflammatory Bowel Disease , 2011, Science Translational Medicine.

[37]  K. Śmietana,et al.  Outlook for the next 5 years in drug innovation , 2012, Nature Reviews Drug Discovery.

[38]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[39]  Charles C. Persinger,et al.  How to improve R&D productivity: the pharmaceutical industry's grand challenge , 2010, Nature Reviews Drug Discovery.

[40]  Núria Queralt-Rosinach,et al.  DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes , 2015, Database J. Biol. Databases Curation.

[41]  David Safford,et al.  Trustworthy geographically fenced hybrid clouds , 2014, Middleware.

[42]  Heng Luo,et al.  DPDR-CPI, a server that predicts Drug Positioning and Drug Repositioning via Chemical-Protein Interactome , 2016, Scientific Reports.

[43]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[44]  Ying Li,et al.  Validating drug repurposing signals using electronic health records: a case study of metformin associated with reduced cancer mortality , 2014, J. Am. Medical Informatics Assoc..

[45]  Elisa Bertino,et al.  Privacy-preserving authentication of trees and graphs , 2013, International Journal of Information Security.

[46]  Arun Iyengar,et al.  Providing Enhanced Functionality for Data Store Clients , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).