Recent advances in computing, communications, digital storage technologies, and high-throughput data-acquisition technologies, make it possible to gather and store incredible volumes of data. It creates unprecedented opportunities for large-scale knowledge discovery from huge database. Data mining (DM) technology has emerged as a means of performing this discovery. There are countless researchers working on designing efficient data mining techniques, methods, and algorithms. Many data mining methods and algorithms have been developed and applied in a lot of application fields [1]. Unfortunately, most data mining researchers pay much attention to technique problems for developing data mining models and methods, while little to basic issues of data mining [2].
In this talk, some basic issues of data mining are addressed. What is data mining? What is the product of a data mining process? What are we doing in a data mining process? What is the rule we should obey in a data mining process? Through analyzing existing data mining methods, and domain-driven (or user-driven) data mining models [3-5], we find that we should take a data mining process as a process of knowledge transformation. Based on this understanding of data mining, a conceptual data mining model of domain-oriented data-driven data mining (3DM) is proposed [2]. The relationship between traditional domain-driven (or user-driven) data mining models and our proposed 3DM model is also analyzed. Some domain-oriented data-driven data mining algorithms for mining such knowledge as default rule [6], decision tree [7], and concept lattice [8] from database are proposed. The experiment results for these algorithms are also shown to illustrate the efficiency and performance of the knowledge acquired by our 3DM data mining algorithms.
[1]
Yiyu Yao,et al.
Interactive classification using a granule network
,
2005,
Fourth IEEE Conference on Cognitive Informatics, 2005. (ICCI 2005)..
[2]
Wang Yan,et al.
Concept Lattice Based Data-Driven Uncertain Knowledge Acquisition
,
2007
.
[3]
Chengqi Zhang,et al.
Domain-Driven Data Mining: Methodologies and Applications
,
2006,
AMT.
[4]
Yiyu Yao,et al.
Web Intelligence Meets Brain Informatics
,
2006,
WImBI.
[5]
Wang Guo-yin,et al.
A Self-Learning Model under Uncertain Condition
,
2003
.
[6]
Yu Wu,et al.
Data-driven decision tree learning algorithm based on rough set theory
,
2005,
Proceedings of the 2005 International Conference on Active Media Technology, 2005. (AMT 2005)..
[7]
Ch Chen,et al.
Pattern recognition and artificial intelligence
,
1976
.
[8]
Chengqi Zhang,et al.
Domain-driven in-depth pattern discovery: A practical methodology
,
2005
.
[9]
Guoyin Wang,et al.
Domain-Oriented Data-Driven Data Mining (3DM): Simulation of Human Knowledge Understanding
,
2006,
WImBI.
[10]
Philip S. Yu,et al.
Top 10 algorithms in data mining
,
2007,
Knowledge and Information Systems.