Heuristic Organization and Domain Analysis of Software Repositories

In this paper we address the problem of software classification and, based on an algorithm for software classification through software repository clustering, we propose a method which performs automatic domain analysis on a given software collection. The described method relies on observations made after a tedious manual domain analysis is performed. Also, it requires a heuristic that can compute distances between two software components without any interference from the reuser.