Rare Diseases Internet Information Retrieval and Knowledge Discovery

A rare or orphan disease (RD) is that which affects only an insignificant percentage of population, so traditionally has not received the effort devoted to common diseases. In common diseases there are healing or palliative treatments while there are over 4000 RD without a known cure. In Europe, RD affect between 27 and 36 million people (Rodwell & Aymé, 2013). Any progress in this field has a great social impact that translates directly into an improved quality of life of those affected by RD (Dragusin et al., 2011). Therefore, any advance in knowledge generation in the context of RD is welcome. One of the most important developments has come from data mining (see (Groth, 1998), and (Tomar & Agarwal, 2013)). The reduced amount of data makes the application of data mining techniques not easy in the field of RDs. Although for one disease the number of patients and data is reduced, they are numerous as a whole. Several initiatives collect data from RD patients around the world using the power of Internet (Orphadata, 2015). Given singularity of RDs, it is necessary to provide the information systems, the capacity to collect data and also promote cross-flow of information between the agents (patients and researchers). In this situation, it is interesting the development of projects like the one sponsored by the Avenzoar Chair (University of Seville, Spain), known as ER2.0 Project (see details in Rabasco et al. (2013a) and Rabasco et al. (2013b)). The underlying idea is to make it real to collaborate, share, and take decisions in a web 2.0 manner between patients and researchers in RDs. This chapter contains, first, a review of the information networks associated with RD, highlighting the registry, typology, associations and consortia related as well as the existing relationships between Rare Diseases Internet Information Retrieval and Knowledge Discovery

[1]  Petra Perner Intelligent data analysis in medicine - Recent advances , 2006, Artif. Intell. Medicine.

[2]  P. Facchin,et al.  The importance of helplines in National Plans , 2014, Orphanet Journal of Rare Diseases.

[3]  Divya Tomar,et al.  A survey on Data Mining approaches for Healthcare , 2013, BSBT 2013.

[4]  H. Koh,et al.  Data mining applications in healthcare. , 2005, Journal of healthcare information management : JHIM.

[5]  Nikola Bogunovic,et al.  Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features , 2011, Artif. Intell. Medicine.

[6]  Domenica Taruscio,et al.  Rare Disease Registries Classification and Characterization: A Data Mining Approach , 2015, Public Health Genomics.

[7]  Marie-Christine Jaulent,et al.  Sequential pattern mining to discover relations between genes and rare diseases , 2012, 2012 25th IEEE International Symposium on Computer-Based Medical Systems (CBMS).

[8]  Varun Kumar,et al.  An Empirical Study of the Applications of Web Mining Techniques in Health Care , 2011 .

[9]  Seyed Kamaledin Setarehdan,et al.  Support vector machine-based arrhythmia classification using reduced features of heart rate variability signal , 2008, Artif. Intell. Medicine.

[10]  David A. Hanauer,et al.  Data Mining for Identifying Novel Associations and Temporal Relationships with Charcot Foot , 2014, Journal of diabetes research.

[11]  Patrik Eklund,et al.  DIAGAID: a connectionist approach to determine the diagnostic value of clinical data , 1991, Artif. Intell. Medicine.

[12]  Blaz Zupan,et al.  Predictive data mining in clinical medicine: Current issues and guidelines , 2008, Int. J. Medical Informatics.

[13]  R. Geetha Ramani,et al.  Data Mining in Clinical Data Sets: A Review , 2012 .

[14]  Marcel H. Schulz,et al.  Bayesian ontology querying for accurate and noise-tolerant semantic searches , 2012, Bioinform..

[15]  S. Mundlos,et al.  The Human Phenotype Ontology , 2010, Clinical genetics.

[16]  M. Tuchman,et al.  A longitudinal study of urea cycle disorders. , 2014, Molecular genetics and metabolism.

[17]  R. Moxley,et al.  If you build a rare disease registry, will they enroll and will they use it? Methods and data from the National Registry of Myotonic Dystrophy (DM) and Facioscapulohumeral Muscular Dystrophy (FSHD). , 2012, Contemporary clinical trials.

[18]  Ole Winther,et al.  Rare Disease Diagnosis as an Information Retrieval Task , 2011, ICTIR.

[19]  Michael Brudno,et al.  PhenoTips: Patient Phenotyping Software for Clinical and Research Use , 2013, Human mutation.

[20]  Alex Alves Freitas,et al.  A constrained-syntax genetic programming system for discovering classification rules: application to medical data sets , 2004, Artif. Intell. Medicine.

[21]  N. Bogunovic,et al.  An overview of free software tools for general data mining , 2014, 2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[22]  Jane Hunter,et al.  Mining Skeletal Phenotype Descriptions from Scientific Literature , 2013, PloS one.

[23]  B. Aronow,et al.  Orphan Diseases, Bioinformatics and Drug Discovery , 2012 .

[24]  Illhoi Yoo,et al.  Data Mining in Healthcare and Biomedicine: A Survey of the Literature , 2012, Journal of Medical Systems.

[25]  Jane Hunter,et al.  Decision Support Methods for Finding Phenotype — Disorder Associations in the Bone Dysplasia Domain , 2012, PloS one.

[26]  Kimberly J. Johnson,et al.  Development of an international internet-based neurofibromatosis Type 1 patient registry. , 2013, Contemporary clinical trials.

[27]  S. D. Gheware,et al.  Data Mining: Task, Tools, Techniques and Applications , 2014 .

[28]  Peter N. Robinson,et al.  Querying phenotype-genotype relationships on patient datasets using semantic web technology: the example of cerebrotendinous xanthomatosis , 2012, BMC Medical Informatics and Decision Making.

[29]  Kemal Polat,et al.  Detection of ECG Arrhythmia using a differential expert system approach based on principal component analysis and least square support vector machine , 2007, Appl. Math. Comput..

[30]  Warren T. Jones,et al.  Research Paper: Association Rules and Data Mining in Hospital Infection Control and Public Health Surveillance , 1998, J. Am. Medical Informatics Assoc..

[31]  G. Iskrov,et al.  National Registries of Rare Diseases in Europe: An Overview of the Current Situation and Experiences , 2014, Public Health Genomics.

[32]  Armando Blanco,et al.  DrugNet: Network-based drug-disease prioritization by integrating heterogeneous data , 2015, Artif. Intell. Medicine.

[33]  Monica Mazzucato,et al.  A population-based registry as a source of health indicators for rare diseases: the ten-year experience of the Veneto Region’s rare diseases registry , 2014, Orphanet Journal of Rare Diseases.