Part-of-Speech Tagging Using Evolutionary Computation

Part-of-speech tagging is a task of considerable importance in the field of natural language processing. Its purpose is to automatically tag the words of a text with labels that designate the appropriate parts-of-speech. The approach proposed in this paper divides the problem into two tasks: a learning task and an optimization task. Algorithms from the field of evolutionary computation were adopted to tackle each of those tasks. We emphasize the use of swarm intelligence, not only for the good results achieved, but also because it is one of the first applications of such algorithms to this problem. This approach was designed with the aim of being easily extended to other natural language processing tasks that share characteristics with the part-of-speech tagging problem. The results obtained in two different English corpora are among the best published.

[1]  Irene Pimenta Rodrigues,et al.  BioPOS: Biologically Inspired Algorithms for POS Tagging , 2012 .

[2]  Eric Brill,et al.  Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[3]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[4]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[5]  Tiago Ferra de Sousa,et al.  Particle Swarm based Data Mining Algorithms for classification tasks , 2004, Parallel Comput..

[6]  Alexander Gelbukh,et al.  Computational Linguistics and Intelligent Text Processing , 2015, Lecture Notes in Computer Science.

[7]  Enrique Alba,et al.  Natural language tagging with genetic algorithms , 2006, Inf. Process. Lett..

[8]  Malcolm I. Heywood,et al.  Use of a genetic algorithm in brill's transformation-based part-of-speech tagger , 2005, GECCO '05.

[9]  Lourdes Araujo Part-of-Speech Tagging with Evolutionary Algorithms , 2002, CICLing.

[10]  Cícero Nogueira dos Santos,et al.  Portuguese Part-of-Speech Tagging Using Entropy Guided Transformation Learning , 2008, PROPOR.

[11]  Tiago Ferra de Sousa,et al.  Swarm optimisation as a new tool for data mining , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[12]  Thiago Alexandre Salgueiro Pardo,et al.  Computational Processing of the Portuguese Language - 11th International Conference, PROPOR 2014, São Carlos/SP, Brazil, October 6-8, 2014. Proceedings , 2014, Lecture Notes in Computer Science.