The cloud4health Project: Secondary Use of Clinical Data with Secure Cloud-Based Text Mining Services

Advances in translational and personalized medicine require the integration of multiple patient related resources across different organizational bodies. Thus, secure cloud environments for huge data processing, storage and data integration are needed. Moreover, the integration of clinical patient data is indispensable for translational research. Although operational e-health record systems are established in most hospitals, many clinical and phenotypically relevant parameters can only be found in unstructured texts like medical records and reports. To meet these challenges, the cloud4health project established a cloud-based text mining platform to facilitate information extraction of biomedical texts in a secure cloud environment. In order to comply with privacy regulations, general technical demands and security rules for such a cloud installation were developed and have been implemented. Different clinical use cases show the wide spectrum of application of specific text mining services in a secure cloud environment. As application examples, two use cases utilizing text mining technologies to analyse pathology and surgery reports are analysed in detail.

[1]  Uwe K. Schneider Sekundärnutzung klinischer Daten – Rechtliche Rahmenbedingungen , 2015 .

[2]  Cong Wang,et al.  Toward publicly auditable secure cloud data storage services , 2010, IEEE Network.

[3]  Ellen M. Voorhees,et al.  Overview of the TREC 2012 Medical Records Track , 2012, TREC.

[4]  Juliane Fluck,et al.  A Semantic Platform for Information Retrieval from E-Health Records , 2011, TREC.

[5]  David Carrell,et al.  A Strategy for Deploying Secure Cloud-Based Natural Language Processing Systems for Applied Research Involving Clinical Text , 2011, 2011 44th Hawaii International Conference on System Sciences.

[6]  Kyle Chard,et al.  A cloud-based approach to medical NLP. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[7]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[8]  Marco Casassa Mont,et al.  Privacy compliance and enforcement on European healthgrids: an approach through ontology , 2010, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[9]  Thomas Martin Deserno,et al.  Aspekte des datenschutzgerechten Managements klinischer Forschungsdaten , 2012, GI-Jahrestagung.

[10]  Kyle Chard,et al.  Scalability and cost of a cloud-based approach to medical NLP , 2011, 2011 24th International Symposium on Computer-Based Medical Systems (CBMS).

[11]  George Hripcsak,et al.  Automated encoding of clinical documents based on natural language processing. , 2004, Journal of the American Medical Informatics Association : JAMIA.

[12]  Arnold W. Pratt,et al.  Automatic indexing of pathology data , 1978, J. Am. Soc. Inf. Sci..

[13]  J Laufer,et al.  Secure Secondary Use of Clinical Data with Cloud-based NLP Services , 2014, Methods of Information in Medicine.

[14]  Graham Wilcock,et al.  Unstructured Information Management Architecture (UIMA) , 2009 .

[15]  Ahmad-Reza Sadeghi,et al.  Flexible patient-controlled security for electronic health records , 2012, IHI '12.

[16]  Shuying Shen,et al.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text , 2011, J. Am. Medical Informatics Assoc..

[17]  Hans-Ulrich Prokosch,et al.  A scoping review of cloud computing in healthcare , 2015, BMC Medical Informatics and Decision Making.

[18]  Cong Wang,et al.  Dynamic Data Operations with Deduplication in Privacy-Preserving Public Auditing for Secure Cloud Storage , 2017, 22017 IEEE International Conference on Computational Science and Engineering (CSE) and IEEE International Conference on Embedded and Ubiquitous Computing (EUC).

[19]  Thomas Ganslandt,et al.  Identitätsmanagement für Patienten in medizinischen Forschungsverbünden , 2012, GI-Jahrestagung.

[20]  Xiaoyan Wang,et al.  Active computerized pharmacovigilance using natural language processing, statistics, and electronic health records: a feasibility study. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[21]  Özlem Uzuner,et al.  Extracting medication information from clinical text , 2010, J. Am. Medical Informatics Assoc..

[22]  T Ganslandt,et al.  Unlocking Data for Clinical Research – The German i2b2 Experience , 2011, Applied Clinical Informatics.

[23]  Yao Zheng,et al.  Scalable and Secure Sharing of Personal Health Records in Cloud Computing Using Attribute-Based Encryption , 2019, IEEE Transactions on Parallel and Distributed Systems.

[24]  Khaled El Emam,et al.  Estimating the re-identification risk of clinical data sets , 2012, BMC Medical Informatics and Decision Making.

[25]  Son Doan,et al.  Application of information technology: MedEx: a medication information extraction system for clinical narratives , 2010, J. Am. Medical Informatics Assoc..

[26]  Chia-Ping Shen,et al.  A Data-Mining Framework for Transnational Healthcare System , 2012, Journal of Medical Systems.

[27]  Daniel Hanisch,et al.  ProMiner: rule-based protein and gene entity recognition , 2005, BMC Bioinformatics.

[28]  Randolph A. Miller,et al.  Identifying UMLS concepts from ECG Impressions using Knowledge Map , 2005, AMIA.

[29]  Peter L. Elkin,et al.  A randomized controlled trial of the accuracy of clinical record retrieval using SNOMED-RT as compared with ICD9-CM , 2001, AMIA.

[30]  Cui Tao,et al.  Building a robust, scalable and standards-driven infrastructure for secondary use of EHR data: The SHARPn project , 2012, J. Biomed. Informatics.

[31]  Prakash M. Nadkarni,et al.  Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions , 2011, J. Am. Medical Informatics Assoc..

[32]  Christopher G Chute,et al.  The SHARPn project on secondary use of Electronic Medical Record data: progress, plans, and possibilities. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[33]  Nathan Regola,et al.  Storing and Using Health Data in a Virtual Private Cloud , 2013, Journal of medical Internet research.