Mining constraint relationships and redundancies with association analysis for optimization problem formulation

Constraints are necessary in optimization problems to steer optimization algorithms away from solutions which are not feasible or practical. However, redundant constraints are often added, which needlessly complicate the problem's description. This article introduces a probabilistic method to identify redundant inequality constraints for black-box optimization problems. The method uses Jaccard similarity to find item groups where the occurrence of a single item implies the occurrence of all other items in the group. The remaining groups are then mined with association analysis. Furthermore, unnecessary constraints are classified as redundant owing to co-occurrence, implication or covering. These classifications are presented as rules (in readable text), to indicate the relationships among constraints. The algorithm is applied to mathematical problems and to the engineering design of a pressure vessel. It was found that the rules are informative and correct, based on the available samples. Limitations of the proposed methods are also discussed.

[1]  Nicolas Pasquier,et al.  Discovering Frequent Closed Itemsets for Association Rules , 1999, ICDT.

[2]  Tomasz Imielinski,et al.  Database Mining: A Performance Perspective , 1993, IEEE Trans. Knowl. Data Eng..

[3]  Gautam Mitra,et al.  Analysis of mathematical programming problems prior to applying the simplex algorithm , 1975, Math. Program..

[4]  P. Sumathi,et al.  A Comparative Study of Redundant Constraints Identification Methods in Linear Programming Problems , 2010 .

[5]  Douglass J. Wilde,et al.  Globally optimal design , 1978 .

[6]  Stanley Zionts,et al.  Techniques for Removing Nonbinding Constraints and Extraneous Variables from Linear Programming Problems , 1966 .

[7]  Ling Zhou,et al.  Efficient association rule mining among both frequent and infrequent items , 2007, Comput. Math. Appl..

[8]  Redundancy in nonlinear systems: A set covering approach. , 1999 .

[9]  Jaideep Srivastava,et al.  Selecting the right interestingness measure for association patterns , 2002, KDD.

[10]  J. Boot,et al.  On Trivial and Binding Constraints in Programming Problems , 1962 .

[11]  Philip S. Yu,et al.  AC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery , 2006, Sixth International Conference on Data Mining (ICDM'06).

[12]  Pang-Ning Tan,et al.  Introduction To Data Mining”, Person Education, 2007 , 2015 .

[13]  Kemper Lewis,et al.  Foraging-Directed Adaptive Linear Programming : An Algorithm For Sloving Nonlinear Mixed Discrete/Continuous Design Problems , 1996 .

[14]  Jianhong Wu,et al.  Association Bundle - A New Pattern for Association Analysis , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[15]  J. F. McDonald,et al.  A degenerate extreme point strategy for the classification of linear constraints as redundant or necessary , 1989 .

[16]  Vipin Kumar,et al.  Association Analysis Techniques for Bioinformatics Problems , 2009, BICoB.

[17]  Stanley Zionts,et al.  Redundancy in Mathematical Programming , 1983 .

[18]  Sridhar Ramaswamy,et al.  Cyclic association rules , 1998, Proceedings 14th International Conference on Data Engineering.

[19]  M. Steinbach,et al.  Global teleconnections of climate to terrestrial carbon flux , 2003 .

[20]  G. G. Wang,et al.  Mode-pursuing sampling method for global optimization on expensive black-box functions , 2004 .

[21]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[22]  Jian Pei,et al.  Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[23]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[24]  Hui Xiong,et al.  Hyperclique pattern discovery , 2006, Data Mining and Knowledge Discovery.

[25]  Robert L. Smith,et al.  Hit-and-run algorithms for the identification of nonredundant linear inequalities , 1987, Math. Program..

[26]  S. Shekhar,et al.  Global Teleconnections of Ocean Climate to Terrestrial Carbon Flux , 2002 .

[27]  M. Karwan,et al.  Redundancy in mathematical programming : a state-of-the-art survey , 1983 .