One of the main difficulties of pattern mining is to deal with items of different nature in the same itemset, which can occur in any domain except basket analysis. Indeed, if we consider the analysis of any transactional database composed by several entities and relationships, it is easy to understand that the equality function may be different for each element, which difficult the identification of frequent patterns. This situation is just one example of the need for using domain knowledge to manage the discovery process, but several other, no less important can be enumerated, such the need to consider patterns at higher levels of abstraction or the ability to deal with structured data. In this paper, we show how the Onto4AR framework can be explored to overcome these situations in a natural way, illustrating its use in the analysis of two distinct case studies. In the first one, exploring a cinematographic dataset, we capture patterns that characterize kinds of movies in accordance to the actors present in their casts and their roles. In the second one, identifying molecular fragments, we find structured patterns, including chains, rings and stars.
[1]
Ramakrishnan Srikant,et al.
Mining Association Rules with Item Constraints
,
1997,
KDD.
[2]
Tomasz Imielinski,et al.
Mining association rules between sets of items in large databases
,
1993,
SIGMOD Conference.
[3]
Roberto J. Bayardo,et al.
The many roles of constraints in data mining
,
2002
.
[4]
Cláudia Antunes,et al.
Using Context-Free Grammars to Constrain Apriori-based Algorithms for Mining Temporal Association Rules
,
2002
.
[5]
Cláudia Antunes,et al.
Constraint Relaxations for Discovering Unknown Sequential Patterns
,
2004,
KDID.
[6]
Thomas R. Gruber,et al.
A translation approach to portable ontology specifications
,
1993,
Knowl. Acquis..
[7]
Cláudia Antunes.
Onto4AR: a framework for mining association rules
,
2007
.
[8]
Steffen Staab,et al.
Ontology Learning for the Semantic Web
,
2002,
IEEE Intell. Syst..
[9]
Kyuseok Shim,et al.
SPIRIT: Sequential Pattern Mining with Regular Expression Constraints
,
1999,
VLDB.