Classification algorithms for knowledge mapping of expert in Energy industry

Data classification is one technique of data mining using for creating knowledge map. The knowledge map describes who has what tacit knowledge and where the knowledge is collected. Knowledge map further helps employees to learn the jobs and expertise in organization. In many organizations there is a lack of guidelines for knowledge management and knowledge mapping. The objectives of this paper is to propose a classification algorithm for creating knowledge map of experts in an organization. The data set using in this paper are in the domain of energy experts. The performances of four classification algorithms are measured by comparing their prediction powers on expert classification. Four candidate algorithms include two types of Decision Trees (ID3 and C4.5), and two Rule-based (OneR and Prism). Results show that C4.5 algorithm is the best one, while ID3 is the last one in classification for knowledge mapping. Future research and implication are also suggested.

[1]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[2]  Geoffrey Holmes,et al.  A diagnostic tool for tree based supervised classification learning algorithms , 1999, ICONIP'99. ANZIIS'99 & ANNES'99 & ACNN'99. 6th International Conference on Neural Information Processing. Proceedings (Cat. No.99EX378).

[3]  Gerhard Knolmayer,et al.  Document Classification Methods for Organizing Explicit Knowledge , 2002 .

[4]  Martin J. Eppler Toward a Pragmatic Taxonomy of Knowledge Maps: Classification Principles, Sample Typologies, and Application Examples , 2006, Tenth International Conference on Information Visualisation (IV'06).

[5]  Salvatore Ruggieri,et al.  Efficient C4.5 , 2002, IEEE Trans. Knowl. Data Eng..

[6]  Sébastien Thomassey,et al.  A hybrid sales forecasting system based on clustering and decision trees , 2006, Decis. Support Syst..

[7]  Martin J. Eppler Making knowledge visible through intranet knowledge maps: concepts, elements, cases , 2001, Proceedings of the 34th Annual Hawaii International Conference on System Sciences.

[8]  Jean Tague-Sutcliffe Measuring the informativeness of a retrieval process , 1992, SIGIR '92.

[9]  Michael Negnevitsky,et al.  Artificial Intelligence: A Guide to Intelligent Systems , 2001 .

[10]  Tzung-Pei Hong,et al.  Two-phase PRISM learning algorithms , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[11]  Robert R. Korfhage,et al.  Information Storage and Retrieval , 1963 .

[12]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[13]  Fu-Ren Lin,et al.  Knowledge map creation and maintenance for virtual communities of practice , 2006, Inf. Process. Manag..

[14]  Richard J. Roiger,et al.  Data Mining: A Tutorial Based Primer , 2002 .

[15]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[16]  Remo Aslak Burkhard,et al.  Beyond Excel and Powerpoint: knowledge maps for the transfer and creation of knowledge in organizations , 2005, Ninth International Conference on Information Visualisation (IV'05).

[17]  Vijay V. Raghavan,et al.  A critical investigation of recall and precision as measures of retrieval system performance , 1989, TOIS.

[18]  Zhe Huang,et al.  Applying AI technology and rough set theory to mine association rules for supporting knowledge management , 2003, Proceedings of the 2003 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.03EX693).