论文信息 - Learning to Extract Answers in Question Answering: Experimental Studies

Learning to Extract Answers in Question Answering: Experimental Studies

Question Answering (QA) systems are complex programs able to answer a question in natural language. Their source of information is a given corpus or, as assumed here, the Web. To achieve their goal, these systems perform various subtasks among which the last one, called answer extraction, is very similar to an Information Extraction task. The main objective of this study it to adapt machine learning techniques defined for Information Extraction tasks to the slightly different task of answer extraction in QA systems. The specificities of QA systems are identified and exploited in this adaptation. Three algorithms, assuming an increasing abstraction of natural language texts, are tested and compared. RÉSUMÉ. Les systèmes Question/Réponse sont des programmes complexes capables de répondre à une question en langage naturel, en utilisant comme source d’information soit un corpus donné, soit, comme c’est le cas ici, le Web. Pour cela, ces systèmes réalisent différentes soustâches parmi lesquelles la dernière, appelée extraction de la réponse, est très similaire à une tâche d’Extraction d’Information. L’objectif de cet article est d’adapter les techniques d’apprentissage automatique utilisées en Extraction d’Information à l’extraction de la réponse. Les spécificités des systèmes Question/Réponse sont identifiées et utilisées dans cette adaptation. Trois algorithmes utilisant une abstraction croissante du texte sont testés et comparés.

Isabelle Tellier | Marc Tommasi | Patrick Marty | Florent Jousse