A requirement analysis for an open set of human language technology tasks

This work presents a requirement analysis and a design proposal for a general architecture for a specified, yet open set of human language technology (HLT) tasks --- the set chosen is dubbed information refinement. Apart from using information refinement as a means to focus the requirement analysis and accompanying design proposal, the analysis and proposal are based on a survey of a number of projects that have had great impact on the realisation of today's HLT architectures, as well as on the experiences gained from a long-term case study aiming at composing a general purpose tool-kit for Swedish. The analysis and design are currently used in an ongoing effort at SICS to implement an open and general architecture for information refinement.

[1]  Fredrik Olsson,et al.  Exploiting Syntax when Detecting Protein Names in Text , 2002 .

[2]  Preben Hansen,et al.  Information Access and Refinement - a Research Theme , 2001 .

[3]  Kim Topley Core Java Foundation Classes , 1998 .

[4]  Pierre Hansen,et al.  The Information Seeking and Retrieval process at the Swedish Patent-and Registration Office , 2000, SIGIR 2000.

[5]  Thierry Declerck,et al.  Linguistic engineering using ALEP , 2000 .

[6]  Kristofer Franzén Adapting an English Information Extraction System to Swedish , 1999, NODALIDA.

[7]  Fredrik Olsson,et al.  Experiences of Language Engineering Algorithm Reuse , 2000, LREC.

[8]  Ralph Grishman,et al.  TIPSTER Text Phase II Architecture Design Version 2.1p 19 June 1996 , 1996, TIPSTER.

[9]  Alistair Cockburn,et al.  Structuring Use Cases with Goals , 2000 .

[10]  Fredrik Olsson Requirements and design considerations for an open and general architecture for information refinement , 2002 .

[11]  Fredrik Olsson,et al.  Exploring Key Phrases for Browsing an Online News Feed in a Mobile Context , 2001 .

[12]  Joseph Polifroni,et al.  Organization, communication, and control in the GALAXY-II conversational system , 1999, EUROSPEECH.

[13]  HAMISH CUNNINGHAM,et al.  Software architecture for language engineering , 2000 .

[14]  Timo Järvinen,et al.  A non-projective dependency parser , 1997, ANLP.

[15]  H. Alshawi,et al.  The Core Language Engine , 1994 .

[16]  Victor Zue,et al.  GALAXY-II: a reference architecture for conversational system development , 1998, ICSLP.

[17]  Joseph Polifroni,et al.  Galaxy-II as an Architecture for Spoken Dialogue Evaluation , 2000, LREC.

[18]  Mark Liberman,et al.  A formal framework for linguistic annotation , 1999, Speech Commun..

[19]  Mark Liberman,et al.  ATLAS: A Flexible and Extensible Architecture for Linguistic Annotation , 2000, LREC.

[20]  李幼升,et al.  Ph , 1989 .

[21]  Preben Hansen,et al.  The information seeking and retrieval process at the Swedish patent- and registration office: moving from lab-based to real life work-task environment , 2000, SIGIR 2000.

[22]  Diana Maynard,et al.  JAPE: a Java Annotation Patterns Engine , 2000 .