Text mining and extracting features of documents

Features of each document are extracted 230, on the basis of a term-document matrix 100 updated by term-document matrix updating means 210 and of a basis vector, spanning a space of effective features, calculated by basis vector calculating means 230. Execution is repeated until a predetermined requirement given by a user is satisfied.