A Big Data Architecture for the Extraction and Analysis of EHR Data

In the current Italian eHealth scenario, a national IT platform has been designed and developed with the purpose of ensuring the interoperability between the various Electronic Health Record (EHR) systems that have been adopted in the different regions of the country, according to the requirements provided by Italian Laws. In this way, the healthcare providers and the policy makers can acquire and process the data of a patient despite its initial format and source, allowing an improved quality of patient care and optimizing the management of the financial resources. To further exploit this huge resource of health and social data, it is very important to allow the extraction of the complex information buried under the Big Data source enabled by the EHRs, providing the physicians, the researchers and public health policy makers with innovative instruments. Meeting this need is not a trivial task, due to the difficulties of processing different document formats and processing Natural Language text, alongside to the problems related to the data size. In this paper we propose a Big Data architecture that is able to extract information from the documents acquired by the EHRs, integrate and process them, providing a set of valuable data for both physicians and patients, as well as decision makers.

[1]  Rui Dai,et al.  Classifying medical relations in clinical text via convolutional neural networks , 2018, Artif. Intell. Medicine.

[2]  Yasin N. Silva,et al.  SQL: From Traditional Databases to Big Data , 2016, SIGCSE.

[3]  Naveen K. Chilamkurti,et al.  A data integration platform for patient-centered e-healthcare and clinical decision support , 2019, Future Gener. Comput. Syst..

[4]  Achim Lang,et al.  e-Health policy and deployment activities in Europe. , 2011, Telemedicine journal and e-health : the official journal of the American Telemedicine Association.

[5]  Joseph K. Bradley,et al.  Spark SQL: Relational Data Processing in Spark , 2015, SIGMOD Conference.

[6]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[7]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[8]  Giuseppe De Pietro,et al.  A Technological Framework for EHR Interoperability: Experiences from Italy , 2016, ICT4AgeingWell.

[9]  Steven Bethard,et al.  A Survey on Recent Advances in Named Entity Recognition from Deep Learning models , 2018, COLING.

[10]  Giuseppe De Pietro,et al.  Deep neural network for hierarchical extreme multi-label text classification , 2019, Appl. Soft Comput..

[11]  Mario Ciampi,et al.  A clustering based methodology to support the translation of medical specifications to software models , 2018, Appl. Soft Comput..

[12]  Kenneth D. Mandl,et al.  SMART on FHIR: a standards-based, interoperable apps platform for electronic health records , 2016, J. Am. Medical Informatics Assoc..

[13]  Sandro José Rigo,et al.  Toward a Model for Personal Health Record Interoperability , 2019, IEEE Journal of Biomedical and Health Informatics.

[14]  Parisa Rashidi,et al.  Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis , 2017, IEEE Journal of Biomedical and Health Informatics.

[15]  Sharon E. Straus,et al.  Personal health records: a scoping review , 2011, J. Am. Medical Informatics Assoc..

[16]  Giuseppe De Pietro,et al.  Towards Interoperability of EHR Systems: The Case of Italy , 2016, ICT4AgeingWell.

[17]  Andrew McCallum,et al.  Dynamic conditional random fields: factorized probabilistic models for labeling and segmenting sequence data , 2004, J. Mach. Learn. Res..

[18]  Konstantinos Moustakas,et al.  Integrating an openEHR-based personalized virtual model for the ageing population within HBase , 2019, BMC Medical Informatics and Decision Making.

[19]  Guillaume Lample,et al.  Neural Architectures for Named Entity Recognition , 2016, NAACL.

[20]  Adam Wright,et al.  What makes an EHR "open" or interoperable? , 2015, J. Am. Medical Informatics Assoc..