Comparing the Performance of Different NLP Toolkits in Formal and Social Media Text

Nowadays, there are many toolkits available for performing common natural language processing tasks, which enable the development of more powerful applications without having to start from scratch. In fact, for English, there is no need to develop tools such as tokenizers, part-of-speech (POS) taggers, chunkers or named entity recognizers (NER). The current challenge is to select which one to use, out of the range of available tools. This choice may depend on several aspects, including the kind and source of text, where the level, formal or informal, may influence the performance of such tools. In this paper, we assess a range of natural language processing toolkits with their default configuration, while performing a set of standard tasks (e.g. tokenization, POS tagging, chunking and NER), in popular datasets that cover newspaper and social network text. The obtained results are analyzed and, while we could not decide on a single toolkit, this exercise was very helpful to narrow our choice.

[1]  Evgeny Kharlamov,et al.  Controlled Query Evaluation for Datalog and OWL 2 Profile Ontologies , 2015, IJCAI.

[2]  Geylani Kardas,et al.  Model-driven development of multiagent systems: a survey and evaluation , 2013, The Knowledge Engineering Review.

[3]  Arto Salomaa,et al.  Automata-Theoretic Aspects of Formal Power Series , 1978, Texts and Monographs in Computer Science.

[4]  Bryan Ford,et al.  Parsing expression grammars: a recognition-based syntactic foundation , 2004, POPL '04.

[5]  Erik F. Tjong Kim Sang,et al.  Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition , 2003, CoNLL.

[6]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[7]  Ingo Br,et al.  Prolog programming for artificial intelligence , 1990 .

[8]  Jaroslav Porubän,et al.  Automatic derivation of domain terms and concept location based on the analysis of the identifiers , 2010, ArXiv.

[9]  Sumit Gulwani,et al.  Automated Grading of DFA Constructions , 2013, IJCAI.

[10]  Michael Wooldridge,et al.  Programming Multi-Agent Systems in AgentSpeak using Jason (Wiley Series in Agent Technology) , 2007 .

[11]  Iván García-Magariño,et al.  Towards the integration of the agent-oriented modeling diversity with a powertype-based language , 2014, Comput. Stand. Interfaces.

[12]  Walter Daelemans,et al.  Pattern for Python , 2012, J. Mach. Learn. Res..

[13]  David A. Ferrucci,et al.  UIMA: an architectural approach to unstructured information processing in the corporate research environment , 2004, Natural Language Engineering.

[14]  Rafael H. Bordini,et al.  BDI agent programming in AgentSpeak using Jason , 2006 .

[15]  S. Doaitse Swierstra Combinator Parsers - From Toys to Tools , 2000, Electron. Notes Theor. Comput. Sci..

[16]  Moharram Challenger,et al.  On the use of a domain-specific modeling language in the development of multiagent systems , 2014, Eng. Appl. Artif. Intell..

[17]  Atif M. Memon,et al.  GUI ripping: reverse engineering of graphical user interfaces for testing , 2003, 10th Working Conference on Reverse Engineering, 2003. WCRE 2003. Proceedings..

[18]  Mariela Inés Cortés,et al.  MAS-ML 2.0: Supporting the modelling of multi-agent systems with different agent architectures , 2015, J. Syst. Softw..

[19]  Maria João Varanda Pereira,et al.  Conclave: Ontology-Driven Measurement of Semantic Relatedness between Source Code Elements and Problem Domain Concepts , 2014, ICCSA.

[20]  José Paulo Leal,et al.  Mooshak: a Web‐based multi‐site programming contest system , 2003, Softw. Pract. Exp..

[21]  Vinay Vachharajani,et al.  A Proposed Architecture for Automated Assessment of Use Case Diagrams , 2014 .

[22]  Nelma Moreira,et al.  FAdo and GUItar : Tools for Automata Manipulation and Visualization , 2009 .

[23]  Richard J. Lipton,et al.  A Probabilistic Remark on Algebraic Program Testing , 1978, Inf. Process. Lett..

[24]  Zarina Shukur,et al.  A Design of an Assessment System for UML Class Diagram , 2007, 2007 International Conference on Computational Science and its Applications (ICCSA 2007).

[25]  Dietmar Seipel,et al.  Knowledge Engineering for Hybrid Deductive Databases , 2017, WLP / WFLP.

[26]  Raphaël Troncy,et al.  NERD meets NIF: Lifting NLP Extraction Results to the Linked Data Cloud , 2012, LDOW.

[27]  Kalina Bontcheva,et al.  TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text , 2013, RANLP.

[28]  Antonio Vallecillo A Journey through the Secret Life of Models , 2008, Perspectives Workshop: Model Engineering of Complex Systems.

[29]  André Almeida,et al.  GUItar and FAgoo: Graphical interface for automata visualization, editing, and interaction , 2010 .

[30]  Andrea Omicini,et al.  MAS Meta-models on Test: UML vs. OPM in the SODA Case Study , 2005, CEEMAS.

[31]  Douglas C. Schmidt,et al.  Guest Editor's Introduction: Model-Driven Engineering , 2006, Computer.

[32]  Wesley De Neve,et al.  Leveraging Existing Tools for Named Entity Recognition in Microposts , 2013, #MSM.

[33]  Zachary Parker,et al.  Comparing NoSQL MongoDB to an SQL DB , 2013, ACMSE '13.

[34]  Marjan Mernik,et al.  Domain-Specific Languages: A Systematic Mapping Study , 2017, SOFSEM.

[35]  Daan Leijen,et al.  Parsec: direct style monadic parser combinators for the real world , 2001 .

[36]  Jakob Nielsen,et al.  A mathematical model of the finding of usability problems , 1993, INTERCHI.

[37]  R. Nigel Horspool Recursive Ascent-Descent Parsers , 1990, CC.

[38]  Eljas Soisalon-Soininen,et al.  Parsing Theory: Volume II Lr(k) and Ll(k) Parsing , 1990 .

[39]  Jacques Sakarovitch,et al.  Introducing VAUCANSON , 2004, Theor. Comput. Sci..

[40]  Derick Wood,et al.  Grail: A C++ Library for Automata and Expressions , 1994, J. Symb. Comput..

[41]  Nuno Oliveira,et al.  Comparing general-purpose and domain-specific languages: An empirical study , 2010, Comput. Sci. Inf. Syst..

[42]  Laks V. S. Lakshmanan,et al.  On a theory of probabilistic deductive databases , 2001, Theory and Practice of Logic Programming.

[43]  Michael E. Bratman,et al.  Intention, Plans, and Practical Reason , 1991 .

[44]  S. Doaitse Swierstra,et al.  Higher order attribute grammars , 1989, PLDI '89.

[45]  Brian Henderson-Sellers,et al.  Agent-oriented methodologies , 2005 .

[46]  Joachim Baumeister,et al.  Declaratively Querying and Visualizing Knowledge Bases in Xml , 2004, INAP/WLP.

[47]  Arie van Deursen,et al.  Crawling Ajax-Based Web Applications through Dynamic Analysis of User Interface State Changes , 2012, TWEB.

[48]  Per Runeson,et al.  Guidelines for conducting and reporting case study research in software engineering , 2009, Empirical Software Engineering.

[49]  David T. Barnard,et al.  Producing a Top-Down Parse Order with Bottom-up Parsing , 1995, Inf. Process. Lett..

[50]  Valeria Seidita,et al.  A Notation for Modeling Jason-Like BDI Agents , 2012, 2012 Sixth International Conference on Complex, Intelligent, and Software Intensive Systems.

[51]  V. S. Subrahmanian,et al.  Theory of Generalized Annotated Logic Programming and its Applications , 1992, J. Log. Program..

[52]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[53]  Ted J. Biggerstaff,et al.  The concept assignment problem in program understanding , 1993, [1993] Proceedings Working Conference on Reverse Engineering.

[54]  Joachim Baumeister,et al.  Anomalies in ontologies with rules , 2010, J. Web Semant..

[55]  Lori L. Pollock,et al.  A field study of how developers locate features in source code , 2016, Empirical Software Engineering.

[56]  Stephen G. Kobourov,et al.  Spring Embedders and Force Directed Graph Drawing Algorithms , 2012, ArXiv.

[57]  Bostjan Slivnik,et al.  The embedded left LR parser , 2011, 2011 Federated Conference on Computer Science and Information Systems (FedCSIS).

[58]  R. Fabregat,et al.  A web-based e-learning tool for UML class diagrams , 2010, IEEE EDUCON 2010 Conference.

[59]  Jack Minker,et al.  Semantic Query Optimization in Expert Systems and Database Systems , 1984, Expert Database Workshop.

[60]  Ivan Lukovic,et al.  A DSL for modeling application-specific functionalities of business applications , 2015, Comput. Lang. Syst. Struct..

[61]  Bob Boiko,et al.  Content Management Bible , 2001 .

[62]  Tong Zhang,et al.  Named Entity Recognition through Classifier Combination , 2003, CoNLL.

[63]  Paulo S. C. Alencar,et al.  A Generative Approach for Multi-agent System Development , 2004, SELMAS.

[64]  Siu Cheung Hui,et al.  Probabilistic Equivalence Verification Approach for Automatic Mathematical Solution Assessment , 2013, IJCAI.

[65]  J. Jackson,et al.  Identifying Top Java Errors for Novice Programmers , 2005, Proceedings Frontiers in Education 35th Annual Conference.

[66]  Kevin Schneider,et al.  UI traces: Supporting the maintenance of interactive software , 2009, 2009 IEEE International Conference on Software Maintenance.

[67]  Bogdan Dit,et al.  Feature location in source code: a taxonomy and survey , 2013, J. Softw. Evol. Process..

[68]  Meritxell González Bermúdez An analysis of Twitter corpora and the differences between formal and colloquial tweets , 2015 .

[69]  Carlo Zaniolo,et al.  Logic and Databases: A History of Deductive Databases , 2014, Computational Logic.

[70]  Pablo Gamallo,et al.  Yet Another Suite of Multilingual NLP Tools , 2015, SLATE.

[71]  Paul Roe,et al.  Static Analysis of Students' Java Programs , 2004, ACE.

[72]  Abdul Sattar,et al.  A code snippet library for CS1 , 2012, INROADS.

[73]  Roel Vertegaal,et al.  SnipMatch: using source code context to enhance snippet retrieval and parameterization , 2012, UIST.

[74]  Chris J. Hinde,et al.  A diagram drawing tool for semi–automatic assessment of conceptual database diagrams , 2006 .

[75]  José Paulo Leal,et al.  A Structural Approach to Assess Graph-Based Exercises , 2015, SLATE.

[76]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.

[77]  Sufian Idris,et al.  A Design of an Assessment System for UML Class Diagram , 2007 .

[78]  Peter C. Chapin Formal languages I , 1973, CSC '73.

[79]  Atif M. Memon,et al.  GUITAR: an innovative tool for automated testing of GUI-driven software , 2014, Automated Software Engineering.

[80]  Ivan Luković,et al.  DSLs in Action with Model Based Approaches to Information System Development , 2014 .

[81]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[82]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[83]  Susan H. Rodger,et al.  JFLAP: An Interactive Formal Languages and Automata Package , 2006 .

[84]  Edward Vanhoutte,et al.  An Introduction to the TEI and the TEI Consortium , 2004, Lit. Linguistic Comput..

[85]  Michaela Bacíková,et al.  Defining Domain Language of Graphical User Interfaces , 2013, SLATE.

[86]  Franco Zambonelli,et al.  A Study of Some Multi-agent Meta-models , 2004, AOSE.

[87]  Marian H. Nodine,et al.  A Metamodel for Agents, Roles, and Groups , 2004, AOSE.

[88]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[89]  Mitchell P. Marcus,et al.  Text Chunking using Transformation-Based Learning , 1995, VLC@ACL.

[90]  Terence Parr,et al.  LL(*): the foundation of the ANTLR parser generator , 2011, PLDI '11.

[91]  Xavier Gómez Guinovart,et al.  Retreading Dictionaries for the 21st Century , 2013, SLATE.

[92]  Zarina Shukur,et al.  The Design of ADAT: A Tool for Assessing Automata-Based Assignments , 2008 .

[93]  Anand S. Rao,et al.  Decision Procedures for BDI Logics , 1998, J. Log. Comput..

[94]  Klaus Fischer,et al.  A platform-independent metamodel for multiagent systems , 2008, Autonomous Agents and Multi-Agent Systems.

[95]  W. van der Hoek,et al.  Multiagent System Technologies, 7th German Conference, MATES 2009, Hamburg, Germany, September 9-11, 2009. Proceedings , 2009, MATES.

[97]  Andrian Marcus,et al.  An information retrieval approach to concept location in source code , 2004, 11th Working Conference on Reverse Engineering.

[98]  Luc De Raedt,et al.  ProbLog: A Probabilistic Prolog and its Application in Link Discovery , 2007, IJCAI.

[99]  Sergej Chodarev,et al.  Profile-driven source code exploration , 2015, 2015 Federated Conference on Computer Science and Information Systems (FedCSIS).

[100]  Paul Walton Purdom,et al.  Semantic routines and LR(k) parsers , 1980, Acta Informatica.

[101]  Mauricio A. Saca Refactoring improving the design of existing code , 2017, 2017 IEEE 37th Central America and Panama Convention (CONCAPAN XXXVII).

[102]  Roy T. Fielding,et al.  Principled design of the modern Web architecture , 2000, Proceedings of the 2000 International Conference on Software Engineering. ICSE 2000 the New Millennium.

[104]  Daniel Moldt,et al.  Goal Representation for BDI Agent Systems , 2004, PROMAS.

[105]  GORDON FRASER,et al.  A Large-Scale Evaluation of Automated Unit Test Generation Using EvoSuite , 2014, ACM Trans. Softw. Eng. Methodol..

[106]  Boštjan Slivnik,et al.  Producing the left parse during bottom-up parsing , 2005, Inf. Process. Lett..

[107]  Miryung Kim,et al.  An ethnographic study of copy and paste programming practices in OOPL , 2004, Proceedings. 2004 International Symposium on Empirical Software Engineering, 2004. ISESE '04..

[108]  Jeffrey D. Ullman,et al.  Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[109]  Oren Etzioni,et al.  Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.

[110]  Christian Hahn A domain specific modeling language for multiagent systems , 2008, AAMAS.

[111]  A. S. Roa,et al.  AgentSpeak(L): BDI agents speak out in a logical computable language , 1996 .

[112]  Samet Atdag,et al.  A comparison of named entity recognition tools applied to biographical texts , 2013, 2nd International Conference on Systems and Computer Science.

[113]  Nelma Moreira,et al.  Testing the Equivalence of Regular Languages , 2009, DCFS.

[114]  Roger S. Pressman,et al.  Software Engineering: A Practitioner's Approach , 1982 .

[115]  Gary Court,et al.  JSON Schema: core definitions and terminology , 2013 .

[116]  Koushik Sen,et al.  CodeHint: dynamic and interactive synthesis of code snippets , 2014, ICSE.

[117]  Dominic Widdows,et al.  Orthogonal Negation in Vector Spaces for Modelling Word-Meanings and Document Retrieval , 2003, ACL.

[118]  Jean-Claude Royer,et al.  Aspect-Oriented, Model-Driven Software Product Lines: The AMPLE Way , 2011 .

[119]  José Paulo Leal,et al.  Enki: A Pedagogical Services Aggregator for Learning Programming Languages , 2016, ITiCSE.

[120]  Raphaël Troncy,et al.  Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web , 2014, LREC.

[121]  Joe Futrelle,et al.  Reasoning about Provenance with OWL and SWRL Rules , 2008, AAAI Spring Symposium: AI Meets Business Rules and Process Management.

[122]  Adrian Johnstone,et al.  BRNGLR: a cubic Tomita-style GLR parsing algorithm , 2007, Acta Informatica.

[123]  Kalina Bontcheva,et al.  GATE: an Architecture for Development of Robust HLT applications , 2002, ACL.

[124]  Kajal T. Claypool,et al.  XSnippet: mining For sample code , 2006, OOPSLA '06.

[125]  Moharram Challenger,et al.  A Domain Specific Metamodel for Semantic Web Enabled Multi-Agent Systems , 2011, CAiSE Workshops.

[126]  Preslav Nakov,et al.  SemEval-2015 Task 10: Sentiment Analysis in Twitter , 2015, *SEMEVAL.

[127]  Maurice van Keulen,et al.  Concept Extraction Challenge: University of Twente at #MSM2013 , 2013, #MSM.

[128]  ChengXiang Zhai,et al.  A study of methods for negative relevance feedback , 2008, SIGIR '08.

[129]  José João Almeida,et al.  Processing XML: a rewriting system approach , 2010 .

[130]  Ruzica Piskac,et al.  Interactive Synthesis of Code Snippets , 2011, CAV.

[131]  Tobias Blanke,et al.  Comparison of named entity recognition tools for raw OCR text , 2012, KONVENS.

[132]  Alfred V. Aho,et al.  Compilers: Principles, Techniques, and Tools , 1986, Addison-Wesley series in computer science / World student series edition.

[133]  Matthias Jarke,et al.  Logic Programming and Databases , 1984, Expert Database Workshop.

[134]  Eljas Soisalon-Soininen,et al.  Parsing Theory - Volume I: Languages and Parsing , 1988, EATCS Monographs on Theoretical Computer Science.

[135]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[136]  Alfred V. Aho,et al.  The Theory of Parsing, Translation, and Compiling , 1972 .

[137]  André L. Santos,et al.  GUI-driven code tracing , 2012, 2012 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC).

[138]  Jorge J. Gómez-Sanz,et al.  FAML: A Generic Metamodel for MAS Development , 2009, IEEE Transactions on Software Engineering.

[139]  Andrea Omicini,et al.  Artifacts in the A&A meta-model for multi-agent systems , 2008, Autonomous Agents and Multi-Agent Systems.

[140]  Rainer Koschke,et al.  On the Comprehension of Program Comprehension , 2014, TSEM.

[141]  Walter Daelemans,et al.  MBT: A Memory-Based Part of Speech Tagger-Generator , 1996, VLC@COLING.

[142]  P. Garrett,et al.  Basic complex analysis , 2022 .

[143]  Nelma Moreira,et al.  XML description for automata manipulations , 2010 .