iASiS: Towards Heterogeneous Big Data Analysis for Personalized Medicine

The vision of IASIS project is to turn the wave of big biomedical data heading our way into actionable knowledge for decision makers. This is achieved by integrating data from disparate sources, including genomics, electronic health records and bibliography, and applying advanced analytics methods to discover useful patterns. The goal is to turn large amounts of available data into actionable information to authorities for planning public health activities and policies. The integration and analysis of these heterogeneous sources of information will enable the best decisions to be made, allowing for diagnosis and treatment to be personalised to each individual. The project offers a common representation schema for the heterogeneous data sources. The iASiS infrastructure is able to convert clinical notes into usable data, combine them with genomic data, related bibliography, image data and more, and create a global knowledge base. This facilitates the use of intelligent methods in order to discover useful patterns across different resources. Using semantic integration of data gives the opportunity to generate information that is rich, auditable and reliable. This information can be used to provide better care, reduce errors and create more confidence in sharing data, thus providing more insights and opportunities. Data resources for two different disease categories are explored within the iASiS use cases, dementia and lung cancer.

[1]  Vassiliki Rentoumi,et al.  Automatic detection of linguistic indicators as a means of early detection of Alzheimer's disease and of related dementias: A computational linguistics analysis , 2017, 2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom).

[2]  Ernestina Menasalvas Ruiz,et al.  Clinical Narrative Analytics Challenges , 2016, IJCRS.

[3]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[4]  Arcadi Navarro,et al.  The European Genome-phenome Archive of human data consented for biomedical research , 2015, Nature Genetics.

[5]  Ernestina Menasalvas Ruiz,et al.  Challenges of Medical Text and Image Processing: Machine Learning Approaches , 2016, Machine Learning for Health Informatics.

[6]  G. Paliouras,et al.  Big data for supporting precision medicine in lung cancer patients. , 2018 .

[7]  Ricardo Villamarín-Salomón,et al.  ClinVar: public archive of interpretations of clinically relevant variants , 2015, Nucleic Acids Res..

[8]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[9]  Davide Cirillo,et al.  omiXcore: a web server for prediction of protein interactions with large RNA , 2017, Bioinform..

[10]  Maria-Esther Vidal,et al.  Data Integration for Supporting Biomedical Knowledge Graph Creation at Large-Scale , 2018, DILS.

[11]  Anastasia Krithara,et al.  IASIS: BIG DATA FOR PRECISION MEDICINE , 2018, Alzheimer's & Dementia.

[12]  Alexandros Armaos,et al.  RNAct: Protein–RNA interaction predictions for model organisms with supporting experimental data , 2018, Nucleic Acids Res..

[13]  Guillermo Palma,et al.  Unveiling Scholarly Communities over Knowledge Graphs , 2018, TPDL.

[14]  Mingming Jia,et al.  COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer , 2010, Nucleic Acids Res..