Statistical Methods for Performance Evaluation of WEB Document Classification

The principal aim of this paper is to make a review of main statistical methods for classifying documents that could be easily adapted in the context of Web document retrieval. After presenting the most popular methods of classification we will also define the most accurate indicators for assessment of classifiers performance. Thus we will refer to the recall, precision, fscore, sensitivity and specificity. We will also describe how these indicators can be calculated in the context of Web documents.