Comparación de Enfoques para Evaluar la Validación de Respuestas
暂无分享,去创建一个
[1] Sanda M. Harabagiu,et al. Methods for Using Textual Entailment in Open-Domain Question Answering , 2006, ACL.
[2] Sanda M. Harabagiu,et al. The Structure and Performance of an Open-Domain Question Answering System , 2000, ACL.
[3] Charles P. Friedman,et al. Evaluation Methods in Medical Informatics , 1997, Computers and Medicine.
[4] M. Felisa Verdejo,et al. Testing the Reasoning for Question Answering Validation , 2008, J. Log. Comput..
[5] J. Hanley,et al. The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.
[6] Eduard H. Hovy,et al. Question Answering in Webclopedia , 2000, TREC.
[7] Tetsuya Sakai,et al. On the reliability of information retrieval metrics based on graded relevance , 2007, Inf. Process. Manag..
[8] Tom Fawcett,et al. Robust Classification for Imprecise Environments , 2000, Machine Learning.
[9] Ellen M. Voorhees,et al. Evaluating Evaluation Measure Stability , 2000, SIGIR 2000.
[10] Robert C. Holte,et al. What ROC Curves Can't Do (and Cost Curves Can) , 2004, ROCAI.
[11] Dragomir R. Radev,et al. Question-answering by predictive annotation , 2000, SIGIR '00.