论文信息 - The cost-minimizing inverse classification problem: a genetic algorithm approach

The cost-minimizing inverse classification problem: a genetic algorithm approach

Abstract We consider the inverse problem in classification systems described as follows. Given a set of prototype cases representing a set of categories, a similarity function, and a new case classified in some category, we find the cost-minimizing changes to the attribute values such that the case is reclassified as a member of a (different) preferred category. The problem is “inverse” because the usual mapping is from a case to its unknown category. The increased application of classification systems in business suggests that this inverse problem can be of significant benefit to decision makers as a form of sensitivity analysis. Analytic approaches to this inverse problem are difficult to formulate as the constraints are either not available or difficult to determine. To investigate this inverse problem, we develop several genetic algorithms and study their performance as problem difficulty increases. We develop a real genetic algorithm with feasibility control, a traditional binary genetic algorithm, and a steepest ascent hill climbing algorithm. In a series of simulation experiments, we compare the performance of these algorithms to the optimal solution as the problem difficulty increases (more attributes and classes). In addition, we analyze certain algorithm effects (level of feasibility control, operator design, and fitness function) to determine the best approach. Our results indicate the viability of the real genetic algorithm and the importance of feasibility control as the problem difficulty increases.

Michael V. Mannino | Murlidhar V. Koushik | M. Mannino

[1] Francesco Ricci,et al. Advanced metrics for class-driven similarity search , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[2] Jim Antonisse,et al. A New Interpretation of Schema Notation that Overtums the Binary Encoding Constraint , 1989, ICGA.

[3] Hyun Myung,et al. Evolutionary programming techniques for constrained optimization problems , 1997, IEEE Trans. Evol. Comput..

[4] John J. Grefenstette,et al. Optimization of Control Parameters for Genetic Algorithms , 1986, IEEE Transactions on Systems, Man, and Cybernetics.

[5] Katta G. Murty,et al. Exterior point algorithms for nearest points and convex quadratic programs , 1992, Math. Program..

[6] Ray Bareiss,et al. Concept Learning and Heuristic Classification in WeakTtheory Domains , 1990, Artif. Intell..

[7] David Avis,et al. A pivoting algorithm for convex hulls and vertex enumeration of arrangements and polyhedra , 1991, SCG '91.

[8] Zbigniew Michalewicz,et al. A Survey of Constraint Handling Techniques in Evolutionary Computation Methods , 1995 .

[9] ZakarauskasPierre,et al. Complexity Analysis for Partitioning Nearest Neighbor Searching Algorithms , 1996 .

[10] Khaled S. Al-Sultan,et al. A Tabu search approach to the clustering problem , 1995, Pattern Recognit..

[11] Steven Orla Kimbrough,et al. On automating candle lighting analysis: insight from search with genetic algorithms and approximate models , 1994, 1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences.

[12] Herbert Edelsbrunner,et al. Algorithms in Combinatorial Geometry , 1987, EATCS Monographs in Theoretical Computer Science.

[13] Zbigniew Michalewicz,et al. Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.