The traditional English text chunking approach identifies phrases by using only one model and phrases with the same types of features. It has been shown that the limitations of using only one model are that: the use of the same types of features is not suitable for all phrases, and data sparseness may also result. In this paper, the Distributed Multi-Agent based architecture approach is proposed and applied in the identification of English phrases. This strategy put phrases into agents according to their sensitive features and identifies different phrases in parallel, where the main features are: one, easy and quick communication between phrases; two, avoidance of data sparseness. By applying and testing the approach on the public training and test corpus, the F score for arbitrary phrases identification using Distributed Multi-Agent strategy achieves 95.70% compared to the previous best F score of 94.17%.
[1]
Robert C. Berwick,et al.
Principle-Based Parsing
,
1987
.
[2]
Steven Abney,et al.
Parsing By Chunks
,
1991
.
[3]
Wojciech Skut,et al.
A Maximum-Entropy Partial Parser for Unrestricted Text
,
1998,
VLC@COLING/ACL.
[4]
Steven P. Abney.
Partial parsing via finite-state cascades
,
1996,
Natural Language Engineering.
[5]
Ilker Hamzaoglu,et al.
Web Based Parallel/Distributed Medical Data Mining Using Software Agents
,
1997,
AMIA.
[6]
Sabine Buchholz,et al.
Introduction to the CoNLL-2000 Shared Task Chunking
,
2000,
CoNLL/LLL.
[7]
Tong Zhang,et al.
Text Chunking based on a Generalization of Winnow
,
2002,
J. Mach. Learn. Res..
[8]
Yuji Matsumoto,et al.
Use of Support Vector Learning for Chunk Identification
,
2000,
CoNLL/LLL.