Some Aspect of Rule Discovery in Data Bases

Rule Discovery in Databases integrates machine learning, probabilistic techniques and database concepts to learn a range comprehensible knowledge in sparse, noisy and redundant data. The discovery enables the learning of rules from data and extract their underlying structure. In this paper, we present the probabilistic index and the notion of minimal set of discovered rules which enhance runtime performance, improve discovery accuracy, resist noise, converges with the size of the sample, and eliminates coarse and redundant rules. This index can be used within the framework of an incremental discovery system. In other words, in this paper, we describe the rule intensity measurement which is an index that answers the question ‘What is the probability of having a rule of the form ‘IF premise THEN Conclusion’; the premise and conclusion are conjunctions of propositions ?’