A Concept-based Feature Extraction Approach

A concept has a perceived property and a set of constituents. The goal of this investigation is about extraction of meaningful relationships, if any, between the perceived property and the constituent’s attributes. Such meaningful relationships (features) may be used as a prediction tool. The presented methodology for extracting the features is based on the concept expansion. To the best of our knowledge, feature extractions based on a concept expansion approach, for use in data mining, has not been reported in the literature. The goal was met by introducing the b-concept, conceptualizing a universe of objects using b-concept, and generating the complete gamma-expansion (CGE) of the b-concepts. The features were extracted from CGEs as anchor prediction (AP) rules. The AP rules were crystalized by a sequence of horizontal-vertical reductions. The prediction powers of the AP rules and their crystalized version were investigated by: (i) using 10 pairs of training and test sets, and (ii) comparing their performances with the performance of the well-known ID3 approach over the same training and test sets. The results revealed that the AP rules and ID3 have similar performances. However, the crystallized prediction rules have a superior performance over the AP rules and ID3. The average of the correct prediction is up by 17%, the average of the false positive is down by 13%, and the average of false negative is up by 3%. In addition, the number of test objects that cannot be predicted is down by 7%. Keywords-b-concept; Concept expansion; Concept Analysis; Data Mining; Prediction Systems; and Crystallizing Prediction Rules.

[1]  Bernhard Ganter,et al.  Formal Concept Analysis: Mathematical Foundations , 1998 .

[2]  Rudolf Wille,et al.  Restructuring Lattice Theory: An Approach Based on Hierarchies of Concepts , 2009, ICFCA.

[3]  Vyvyan Evans,et al.  Lexical concepts, cognitive models and meaning-construction , 2006 .

[4]  L. Horsten Cuts and Clouds. Vagueness, its nature, and its logic , 2010 .

[5]  Zhendong Niu,et al.  Concept Based Query Expansion , 2013, 2013 Ninth International Conference on Semantics, Knowledge and Grids.

[6]  Ray R. Hashemi,et al.  Association Analysis of Alumni Giving: A Formal Concept Analysis , 2009, Int. J. Intell. Inf. Technol..

[7]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[8]  Lawrence W. Barsalou,et al.  Continuity of the conceptual system across species , 2005, Trends in Cognitive Sciences.

[9]  Vijay V. Raghavan,et al.  Association Mining and Formal Concept Analysis , 1998 .

[10]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[11]  W. Bruce Croft,et al.  Latent concept expansion using markov random fields , 2007, SIGIR.

[12]  Maokang Luo,et al.  Rough concept lattices and domains , 2009, Ann. Pure Appl. Log..

[13]  M. Danson Fuzzy concepts, scanty evidence, policy distance: the case for rigour and policy relevance in critical regional studies , 2007 .

[14]  Ray R. Hashemi,et al.  An Android Based Medication Reminder System: A Concept Analysis Approach , 2011, ICCS.

[15]  Ray R. Hashemi,et al.  Formal concept analysis for investigation of normal accidents , 2004, Int. J. Gen. Syst..