Empowering personalized medicine with big data and semantic web technology: Promises, challenges, and use cases

In healthcare, big data tools and technologies have the potential to create significant value by improving outcomes while lowering costs for each individual patient. Diagnostic images, genetic test results and biometric information are increasingly generated and stored in electronic health records presenting us with challenges in data that is by nature high volume, variety and velocity, thereby necessitating novel ways to store, manage and process big data. This presents an urgent need to develop new, scalable and expandable big data infrastructure and analytical methods that can enable healthcare providers access knowledge for the individual patient, yielding better decisions and outcomes. In this paper, we briefly discuss the nature of big data and the role of semantic web and data analysis for generating “smart data” which offer actionable information that supports better decision for personalized medicine. In our view, the biggest challenge is to create a system that makes big data robust and smart for healthcare providers and patients that can lead to more effective clinical decision-making, improved health outcomes, and ultimately, managing the healthcare costs. We highlight some of the challenges in using big data and propose the need for a semantic data-driven environment to address them. We illustrate our vision with practical use cases, and discuss a path for empowering personalized medicine using big data and semantic web technology.

[1]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[2]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[3]  Amit P. Sheth,et al.  Semantics-Empowered Approaches to Big Data Processing for Physical-Cyber-Social Applications , 2013, AAAI Fall Symposia.

[4]  Joachim Roski,et al.  Creating value in health care through big data: opportunities and policy implications. , 2014, Health affairs.

[5]  Amit P. Sheth,et al.  PhylOnt: A domain-specific ontology for phylogeny analysis , 2012, 2012 IEEE International Conference on Bioinformatics and Biomedicine.

[6]  D. Juárez,et al.  Big data in pharmacy practice: current use, challenges, and the future , 2015, Integrated pharmacy research & practice.

[7]  Amit P. Sheth,et al.  Online Information Searching for Cardiovascular Diseases: An Analysis of Mayo Clinic Search Query Logs , 2014 .

[8]  Tripty Singh,et al.  A modern data architecture with apache Hadoop , 2015, 2015 International Conference on Green Computing and Internet of Things (ICGCIoT).

[9]  Casey Lynnette Overby,et al.  Personalized medicine: challenges and opportunities for translational bioinformatics. , 2013, Personalized medicine.

[10]  Suzette J. Bielinski,et al.  Applying semantic web technologies for phenome-wide scan using an electronic health record linked Biobank , 2012, Journal of Biomedical Semantics.

[11]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[12]  Maryam Panahiazar,et al.  Advancing data reuse in phyloinformatics using an ontology-driven Semantic Web approach , 2013, BMC Medical Genomics.

[13]  K. Jain Textbook of Personalized Medicine , 2009, Springer New York.

[14]  Amit P. Sheth,et al.  Kino: A Generic Document Management System for Biologists Using SA-REST and Faceted Search , 2011, 2011 IEEE Fifth International Conference on Semantic Computing.