Supporting Translational Research on Inherited Cardiomyopathies through Information Technology

OBJECTIVES The INHERITANCE project, funded by the European Commission, is aimed at studying genetic or inherited Dilated cardiomyopathies (DCM) and at understanding the impact and management of the disease within families that suffer from heart conditions that are caused by DCMs. The biomedical informatics research activity of the project aims at implementing information technology solutions to support the project team in the different phases of their research, in particular in genes screening prioritization and new gene-disease association discovery. METHODS In order to manage the huge quantity of scientific, clinical and patient data generated by the project several advanced biomedical informatics tools have been developed. The paper describes a layer of software instruments to support translation of the results of the project in clinical practice as well as to support the scientific discovery process. This layer includes data warehousing, intelligent querying of the phenotype data, integrated search of biological data and knowledge repositories, text mining of the relevant literature, and case based reasoning. RESULTS At the moment, a set of 1,394 patients and 9,784 observations has been stored into the INHERITANCE data warehouse. The literature database contains more than 1,100,000 articles retrieved from the Pubmed and generically related to cardiac diseases, already analyzed for extracting medical concepts and genes. CONCLUSIONS After two years of project the data warehouse has been completely set up and the text mining tools for automatic literature analysis have been implemented and tested. A first prototype of the decision support tool for knowledge discovery and gene prioritization is available, but a more complete release is still under development.

[1]  Blaz Zupan,et al.  On Quality of Different Annotation Sources for Gene Expression Analysis , 2009, AIME.

[2]  D. Lindberg,et al.  Unified Medical Language System , 2020, Definitions.

[3]  Tatiana A. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2004, Nucleic Acids Res..

[4]  Mobyen Uddin Ahmed,et al.  Case-Based Reasoning Systems in the Health Sciences: A Survey of Recent Trends and Developments , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[5]  Blaz Zupan,et al.  Text Mining approaches for Automated Literature Knowledge Extraction and Representation , 2010, MedInfo.

[6]  E. Perakslis,et al.  Effective knowledge management in translational medicine , 2010, Journal of Translational Medicine.

[7]  D. Lipman,et al.  National Center for Biotechnology Information , 2019, Springer Reference Medizin.

[8]  James J. Cimino,et al.  Towards the development of a conceptual distance metric for the UMLS , 2004, J. Biomed. Informatics.

[9]  Eloisa Arbustini,et al.  Classification of the cardiomyopathies: a position statement from the European Society Of Cardiology Working Group on Myocardial and Pericardial Diseases. , 2007, European heart journal.

[10]  Frank Leymann,et al.  Web Services , 2004, Informatik-Spektrum.

[11]  Christine E Seidman,et al.  The genetic basis for cardiac remodeling. , 2005, Annual review of genomics and human genetics.

[12]  George Hripcsak,et al.  Inter-patient distance metrics using SNOMED CT defining relationships , 2006, J. Biomed. Informatics.

[13]  B. Barnes,et al.  Review: The Data Warehouse Toolkit (Second Edition) , 2003 .

[14]  Riccardo Bellazzi,et al.  An ICT infrastructure to integrate clinical and molecular data in oncology research , 2012, BMC Bioinformatics.

[15]  Griffin M. Weber,et al.  Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2) , 2010, J. Am. Medical Informatics Assoc..

[16]  Martijn J. Schuemie,et al.  Structuring and extracting knowledge for the support of hypothesis generation in molecular biology , 2009, BMC Bioinformatics.

[17]  Y. Pinto,et al.  Primary prevention of sudden death in patients with lamin A/C gene mutations. , 2006, The New England journal of medicine.

[18]  L. Tavazzi,et al.  Long-term outcome and risk stratification in dilated cardiolaminopathies. , 2008, Journal of the American College of Cardiology.