Evaluation of NLP systems
暂无分享,去创建一个
Computational linguistics as a science has had its evaluation methods since its early days: A concordance program can be evaluated according to its ability to find all occurrences, to list them properly, to have a flexible user interface etc., frequency programs *nay be evaluated according to their statistics, the possibility of lemmatisation, parsers are evaluated according to their efficiency etc. When we contemplate one component at a time and want a technical evaluation, we normally have no problem defining the evaluation criteria.
[1] Ralph Grishman,et al. Message Understanding Conference- 6: A Brief History , 1996, COLING.
[2] Margaret King,et al. Machine translation today , 1987 .
[3] Margaret King,et al. Evaluating natural language processing systems , 1996, CACM.
[4] David G. Hendry,et al. Spelling mistakes: how well do correctors perform? , 1993, CHI '93.