Ants algorithm is a universal and flexible solution which was first designed for solving optimization problem such as Traveling Salesman Problem. Analogy between finding the shortest way by ants and finding documents most alike, became a stimulus of ant based text document clustering method. This method consist of two phases, which are finding documents most alike (trial phase) and clusters making (dividing phase). In this paper, we implemented ant based document clustering method on 253 news documents in Indonesian language. Beside that, we developed enhanced confix stripping stemmer as an improvement of confix stripping stemmer for stemming news documents in Indonesian language. Result of the experiments proved that ants algorithm can be applied for classification of news document in Indonesian language, with the best Fmeasure achieved from experiments was 0.86. The experiments also showed that enhanced confix stripping stemmer had been succesfully solved confix stripping stemmer’s problems and reduce terms size up to 32.66%, while confix stripping stemmer only reduce 30.95%.
[1]
Gerald Salton,et al.
Automatic text processing
,
1988
.
[2]
Marco Dorigo,et al.
Ant system: optimization by a colony of cooperating agents
,
1996,
IEEE Trans. Syst. Man Cybern. Part B.
[3]
Paul J. Deitel,et al.
Java How to Program, Fifth Edition
,
2002
.
[4]
Lukasz Machnik,et al.
Documents Clustering techniques
,
2004,
Ann. UMCS Informatica.
[5]
Łukasz Machnik.
Documents clustering method based on Ants Algorithms
,
2006
.
[6]
Lukasz Machnik,et al.
ACO documents clustering - details of processing and results of experiments
,
2006,
Ann. UMCS Informatica.
[7]
Abdelmalek Amine,et al.
Evaluation and comparison of concept based and n-grams based text clustering using SOM
,
2008
.