Inferring characteristic phenotypes via class association rule mining in the bone dysplasia domain

Finding, capturing and describing characteristic features represents a key aspect in disorder definition, diagnosis and management. This process is particularly challenging in the case of rare disorders, due to the sparse nature of data and expertise. From a computational perspective, finding characteristic features is associated with some additional major challenges, such as formulating a computationally tractable definition, devising appropriate inference algorithms or defining sound validation mechanisms. In this paper we aim to deal with each of these problems in the context provided by the skeletal dysplasia domain. We propose a clear definition for characteristic phenotypes, we experiment with a novel, class association rule mining algorithm and we discuss our lessons learned from both an automatic and human-based validation of our approach.

[1]  L. Ohno-Machado Journal of Biomedical Informatics , 2001 .

[2]  Peter I. Cowling,et al.  MCAR: multi-class classification based on association rule , 2005, The 3rd ACS/IEEE International Conference onComputer Systems and Applications, 2005..

[3]  Mark A. Musen,et al.  The Open Biomedical Annotator , 2009, Summit on translational bioinformatics.

[4]  Jiawei Han,et al.  Frequent pattern mining: current status and future directions , 2007, Data Mining and Knowledge Discovery.

[5]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[6]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[7]  Bernhard Pfeifer,et al.  A new rule-based algorithm for identifying metabolic markers in prostate cancer using tandem mass spectrometry , 2008, Bioinform..

[8]  Adam Wright,et al.  An automated technique for identifying associations between medications, laboratory results and problems , 2010, J. Biomed. Informatics.

[9]  Ji Hoon Kang,et al.  Association Rule Mining and Network Analysis in Oriental Medicine , 2013, PloS one.

[10]  Susan Jensen Mining Medical Data for Predictive and Sequential patterns : PKDD 2001 , .

[11]  Carol Friedman,et al.  Mining multi-item drug adverse effect associations in spontaneous reporting systems , 2010, BMC Bioinformatics.

[12]  Durga Toshniwal,et al.  Association Rule for Classification of Type-2 Diabetic Patients , 2010, 2010 Second International Conference on Machine Learning and Computing.

[13]  José A. Reyes,et al.  Prediction of protein-protein interaction types using association rule based classification , 2009, BMC Bioinformatics.

[14]  Qinbao Song,et al.  A Weighted Voting-Based Associative Classification Algorithm , 2010, Comput. J..

[15]  Elpiniki I. Papageorgiou,et al.  A new methodology for Decisions in Medical Informatics using fuzzy cognitive maps based on fuzzy rule-extraction techniques , 2011, Appl. Soft Comput..

[16]  P. Robinson,et al.  The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease. , 2008, American journal of human genetics.

[17]  Chris Mungall,et al.  Phenotype ontologies: the bridge between genomics and evolution. , 2007, Trends in ecology & evolution.

[18]  Sheila Unger,et al.  Nosology and Classification of Genetic Skeletal Disorders: 2010 Revision , 2011, American journal of medical genetics. Part A.

[19]  M. Cevdet Ince,et al.  An expert system for detection of breast cancer based on association rules and neural network , 2009, Expert Syst. Appl..