论文信息 - Classification of epidemiological data: a comparison of genetic algorithm and decision tree approaches

Classification of epidemiological data: a comparison of genetic algorithm and decision tree approaches

Describes an application of genetic algorithms (GAs) to classify epidemiological data, which is often challenging to classify due to noise and other factors. For such complex data (that requires a large number of very specific rules in order to achieve high accuracy), smaller rule sets, composed of more general rules, may be preferable, even if they are less accurate. The GA presented in this paper allows the user to encourage smaller rule sets by setting a parameter. The rule sets found are also compared to those created by standard decision-tree algorithms. The results illustrate tradeoffs involving the number of rules, descriptive accuracy, predictive accuracy, and accuracy in describing and predicting positive examples across different rule sets.

Clare Bates Congdon | C. Congdon

[1] J. C. Bean. Genetics and random keys for sequencing amd optimization , 1993 .

[2] Usama M. Fayyad,et al. Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[3] Richard C. Strohman,et al. Ancient Genomes, Wise Bodies, Unhealthy People: Limits of a Genetic Paradigm in Biology and Medicine , 2015, Perspectives in biology and medicine.

[4] Clare Bates Congdon,et al. A comparison of genetic algorithms and other machine learning systems on a complex classification task from common disease research , 1995 .

[5] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .

[6] James C. Bean,et al. Genetic Algorithms and Random Keys for Sequencing and Optimization , 1994, INFORMS J. Comput..

[7] U. Fayyad. On the induction of decision trees for multiple concept learning , 1991 .

[8] James C. Bean,et al. Random keys for job shop scheduling , 1993 .