Automatic classification of epilepsy types using ontology-based and genetics-based machine learning

OBJECTIVES In the presurgical analysis for drug-resistant focal epilepsies, the definition of the epileptogenic zone, which is the cortical area where ictal discharges originate, is usually carried out by using clinical, electrophysiological and neuroimaging data analysis. Clinical evaluation is based on the visual detection of symptoms during epileptic seizures. This work aims at developing a fully automatic classifier of epileptic types and their localization using ictal symptoms and machine learning methods. METHODS We present the results achieved by using two machine learning methods. The first is an ontology-based classification that can directly incorporate human knowledge, while the second is a genetics-based data mining algorithm that learns or extracts the domain knowledge from medical data in implicit form. RESULTS The developed methods are tested on a clinical dataset of 129 patients. The performance of the methods is measured against the performance of seven clinicians, whose level of expertise is high/very high, in classifying two epilepsy types: temporal lobe epilepsy and extra-temporal lobe epilepsy. When comparing the performance of the algorithms with that of a single clinician, who is one of the seven clinicians, the algorithms show a slightly better performance than the clinician on three test sets generated randomly from 99 patients out of the 129 patients. The accuracy obtained for the two methods and the clinician is as follows: first test set 65.6% and 75% for the methods and 56.3% for the clinician, second test set 66.7% and 76.2% for the methods and 61.9% for the clinician, and third test set 77.8% for the methods and the clinician. When compared with the performance of the whole population of clinicians on the rest 30 patients out of the 129 patients, where the patients were selected by the clinicians themselves, the mean accuracy of the methods (60%) is slightly worse than the mean accuracy of the clinicians (61.6%). Results show that the methods perform at the level of experienced clinicians, when both the methods and the clinicians use the same information. CONCLUSION Our results demonstrate that the developed methods form important ingredients for realizing a fully automatic classification of epilepsy types and can contribute to the definition of signs that are most important for the classification.

[1]  H. Lüders,et al.  Semiological Seizure Classification * , 1998, Epilepsia.

[2]  A. Covanis,et al.  Aura in temporal lobe epilepsy: clinical and electroencephalographic correlation. , 1983, Journal of neurology, neurosurgery, and psychiatry.

[3]  Vincenzo Catania,et al.  Genetic Tuning of Fuzzy Rule Deep Structures for Efficient Knowledge Extraction from Medical Data , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[4]  N. Satyanandam Data Mining Machine Learning Approaches and Medical Diagnose Systems : A Survey , 2012 .

[5]  Geoffrey I. Webb,et al.  Encyclopedia of Machine Learning , 2011, Encyclopedia of Machine Learning.

[6]  Nadia Colombo,et al.  Temporal lobe epilepsy: neuropathological and clinical correlations in 243 surgically treated patients. , 2009, Epileptic disorders : international epilepsy journal with videotape.

[7]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[8]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[9]  Tong Heng Lee,et al.  Evolutionary computing for knowledge discovery in medical diagnosis , 2003, Artif. Intell. Medicine.

[10]  W. G. Cochran The $\chi^2$ Test of Goodness of Fit , 1952 .

[11]  Mohammad Reza Khayyambashi,et al.  Extracting Temporal Rules from Medical Data , 2009, 2009 International Conference on Computer Technology and Development.

[12]  Bernd Stieger,et al.  Ontology-based classification of unstructured information , 2010, 2010 Fifth International Conference on Digital Information Management (ICDIM).

[13]  José L. V. Mejino,et al.  A reference ontology for biomedical informatics: the Foundational Model of Anatomy , 2003, J. Biomed. Informatics.

[14]  Lucila Ohno-Machado,et al.  A Comparison of Machine Learning Methods for the Diagnosis of Pigmented Skin Lesions , 2001, J. Biomed. Informatics.

[15]  Fatemeh Hosseinkhah,et al.  Challenges in Data Mining on Medical Databases , 2009 .

[16]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[17]  N. F. Noy,et al.  Ontology Development 101: A Guide to Creating Your First Ontology , 2001 .

[18]  Sulabha S. Apte,et al.  Improved Study of Heart Disease Prediction System using Data Mining Classification Techniques , 2012 .

[19]  Rodolfo Ostos,et al.  Machine Learning Approach to Extract Diagnostic and Prognostic Thresholds: Application in Prognosis of Cardiovascular Mortality , 2012, Comput. Math. Methods Medicine.

[20]  Lino Nobili,et al.  Surgery for temporal lobe epilepsy in children: relevance of presurgical evaluation and analysis of outcome. , 2013, Journal of neurosurgery. Pediatrics.

[21]  L. Darrell Whitley,et al.  The GENITOR Algorithm and Selection Pressure: Why Rank-Based Allocation of Reproductive Trials is Best , 1989, ICGA.

[22]  Ian Horrocks,et al.  f-SWRL: A Fuzzy Extension of SWRL , 2005, ICANN.

[23]  Xin Yao,et al.  Neural-Based Learning Classifier Systems , 2008, IEEE Transactions on Knowledge and Data Engineering.

[24]  Alex A. Freitas,et al.  A survey of evolutionary algorithms for data mining and knowledge discovery , 2003 .

[25]  L. Nobili,et al.  Sleep-related hyperkinetic seizures: always a frontal onset? , 2005, Neurological Sciences.

[26]  Kwong-Sak Leung,et al.  Medical Data Mining Using Evolutionary Computation , 2022 .

[27]  Daniel Karlsson A Design and Prototype for a Decision-Support System in the Field of Urinary Tract Infections - Application of OpenGALEN Techniques for Indexing Medical Information , 2001, MedInfo.

[28]  Jian Wang,et al.  Automatic Seizure Detection Based on Support Vector Machines with Genetic Algorithms , 2006, SEAL.

[29]  Daniel L. Rubin,et al.  Using ontologies linked with geometric models to reason about penetrating injuries , 2006, Artif. Intell. Medicine.

[30]  Bernhard Ganter,et al.  Conceptual Structures: Logical, Linguistic, and Computational Issues , 2000, Lecture Notes in Computer Science.

[31]  Soni Jyoti,et al.  Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction , 2011 .

[32]  Pierre Jannin,et al.  Surgical models for computer-assisted neurosurgery , 2007, NeuroImage.

[33]  John F. Sowa,et al.  Ontology, Metadata, and Semiotics , 2000, ICCS.

[34]  Guilherme Carvalhal Ribas,et al.  Study of fetal and postnatal morphological development of the brain sulci. , 2013, Journal of neurosurgery. Pediatrics.

[35]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[36]  Narayan Ganesan,et al.  Application of Neural Networks in Diagnosing Cancer Disease using Demographic Data , 2010 .

[37]  Tim Kovacs,et al.  Genetics-Based Machine Learning , 2012, Handbook of Natural Computing.

[38]  Pridi Siregar,et al.  Model-based diagnosis of brain disorders: a prototype framework , 1995, Artif. Intell. Medicine.

[39]  Giovanni Acampora,et al.  Ontology-based intelligent fuzzy agent for diabetes application , 2009, 2009 IEEE Symposium on Intelligent Agents.

[40]  O Bodenreider,et al.  Biomedical ontologies in action: role in knowledge management, data integration and decision support. , 2008, Yearbook of medical informatics.

[41]  I. Jensen,et al.  TEMPORAL LOBE EPILEPSY , 1976, Acta Neurochirurgica.

[42]  D. Darling,et al.  A Test of Goodness of Fit , 1954 .

[43]  Rogers P Pole The GALEN High Level Ontology , 1996 .

[44]  L Tassi,et al.  Surgical treatment of drug-resistant nocturnal frontal lobe epilepsy. , 2007, Brain : a journal of neurology.

[45]  Tommaso Di Noia,et al.  An end stage kidney disease predictor based on an artificial neural networks ensemble , 2013, Expert Syst. Appl..

[46]  Kevin Donnelly,et al.  SNOMED-CT: The advanced terminology and coding system for eHealth. , 2006, Studies in health technology and informatics.

[47]  Binod Prasad,et al.  A Comparative Study of Machine Learning Algorithms as Expert Systems in Medical Diagnosis (Asthma) , 2011 .

[48]  Holger Knublauch,et al.  The Protégé OWL Plugin: An Open Development Environment for Semantic Web Applications , 2004, SEMWEB.

[49]  Reinhold Haux,et al.  A Bayesian expert system for clinical detecting coronary artery disease , 2009 .

[50]  Christian S. Perone,et al.  Pyevolve: a Python open-source framework for genetic algorithms , 2009, SEVO.

[51]  Jason H. Moore,et al.  Learning classifier systems: a complete introduction, review, and roadmap , 2009 .

[52]  Harleen Kaur,et al.  The impact of data mining techniques on medical diagnostics , 2006, Data Sci. J..

[53]  F. Elevitch SNOMED CT: electronic health record enhances anesthesia patient safety. , 2005, AANA journal.

[54]  Ivan Bratko,et al.  Microarray data mining with visual programming , 2005, Bioinform..

[55]  Vili Podgorelec,et al.  Evolutionary design of decision trees for medical application , 2012, WIREs Data Mining Knowl. Discov..

[56]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[57]  Jan Jantzen,et al.  An Aphasia Database on the Internet: A Model for Computer-Assisted Analysis in Aphasiology , 2000, Brain and Language.

[58]  Florin Gorunescu,et al.  Data Mining - Concepts, Models and Techniques , 2011, Intelligent Systems Reference Library.

[59]  Lei Zhang,et al.  Ontology-based Clustering Algorithm with Feature Weights , 2010 .

[60]  Philippe Kahane,et al.  Definition and localization of the epileptogenic zone The Bancaud and Talairach view on the epileptogenic zone : a working hypothesis , 2006 .