This paper presents an original solution that offers necessary functionalities for design, implementation or simple evaluation of various text mining techniques based on Java library called JBOWL. This library was designed as open source API to support different phases of the whole text mining process and offers a wide range of relevant classification and clustering algorithms. JBOWL is particularly useful for enhancing existing software applications with text mining capabilities, as well as for support of practical education of text mining and its exploitation. In this paper we present two particular cases where JBOWL has been successfully integrated and tailored for specific way of exploitation. First case presents integration of JBOWL within collaborative application called KP-Lab System and the second one is a web-based system for education purposes. The proposed solution supports the whole text mining process, starting from creation of a corpus of relevant documents, application of various pre-processing methods, up to creation of text mining models in a form of classifiers and evaluation of the obtained models. The execution of different tasks in the same time is supported by task-based execution engine, which provides middleware-like transparent layer for distributed execution. Evaluation of developed solution was realized within the university course called Knowledge management. This course is organized at the Department of Cybernetics and Artificial Intelligence, Faculty of Electrical Engineering and Informatics, Technical University of Kosice. The paper also describes performed experiments and their results.
[1]
Nicolas Spyratos,et al.
Analyses of Knowledge Creation Processes Based on Different Types of Monitored Data
,
2009,
ISMIS.
[2]
Jan. Paralic,et al.
Java Library for Support of Text Mining and Retrieval
,
.
[3]
Steffen Staab,et al.
An Extensible Ontology Software Environment
,
2004,
Handbook on Ontologies.
[4]
Shusaku Tsumoto,et al.
Foundations of Intelligent Systems, 15th International Symposium, ISMIS 2005, Saratoga Springs, NY, USA, May 25-28, 2005, Proceedings
,
2005,
ISMIS.
[5]
F. Babic,et al.
Distributed task-based execution engine for support of text-mining processes
,
2009,
2009 7th International Symposium on Applied Machine Intelligence and Informatics.
[6]
Jozef Wagner,et al.
TRIALOGICAL LEARNING IN PRACTICE
,
2008
.
[7]
Ján Paralic,et al.
Knowledge Enhanced E-government Portal
,
2003,
KMGov.
[8]
Michael W. Berry,et al.
Survey of Text Mining
,
2003,
Springer New York.
[9]
Karol Furdik,et al.
Meta-learning Method for Authomatic Selection of Algorithms for Text Classification
,
2008
.
[10]
M. Sarnovsky,et al.
Text mining workflows construction with support of ontologies
,
2008,
2008 6th International Symposium on Applied Machine Intelligence and Informatics.