MaltOptimizer: Fast and effective parser optimization

Statistical parsers often require careful parameter tuning and feature selection. This is a nontrivial task for application developers who are not interested in parsing for its own sake, and it can be time-consuming even for experienced researchers. In this paper we present MaltOptimizer, a tool developed to automatically explore parameters and features for MaltParser, a transition-based dependency parsing system that can be used to train parser's given treebank data. MaltParser provides a wide range of parameters for optimization, including nine different parsing algorithms, an expressive feature specification language that can be used to define arbitrarily rich feature models, and two machine learning libraries, each with their own parameters. MaltOptimizer is an interactive system that performs parser optimization in three stages. First, it performs an analysis of the training set in order to select a suitable starting point for optimization. Second, it selects the best parsing algorithm and tunes the parameters of this algorithm. Finally, it performs feature selection and tunes machine learning parameters. Experiments on a wide range of data sets show that MaltOptimizer quickly produces models that consistently outperform default settings and often approach the accuracy achieved through careful manual optimization.

[1]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[2]  van Gerardus Noord,et al.  Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010) , 2010 .

[3]  Eneko Agirre,et al.  Improving Dependency Parsing with Semantic Classes , 2011, ACL.

[4]  Joakim Nivre,et al.  MaltParser: A Data-Driven Parser-Generator for Dependency Parsing , 2006, LREC.

[5]  Carlos Gómez-Rodríguez,et al.  Dependency Parsing with Undirected Graphs , 2012, EACL 2012.

[6]  Joakim Nivre,et al.  Algorithms for Deterministic Incremental Dependency Parsing , 2008, CL.

[7]  Evelina Andersson,et al.  Cross-Framework Evaluation for Statistical Parsing , 2012, EACL.

[8]  Joakim Nivre,et al.  An Efficient Algorithm for Projective Dependency Parsing , 2003, IWPT.

[9]  Joakim Nivre,et al.  Characterizing the Errors of Data-Driven Dependency Parsing Models , 2007, EMNLP.

[10]  Joakim Nivre,et al.  Incrementality in Deterministic Dependency Parsing , 2004 .

[11]  Nerea Ezeiza,et al.  Exploiting the Contribution of Morphological Information to Parsing: the BASQUE TEAM system in the SPRML'2013 Shared Task , 2013, SPMRL@EMNLP.

[12]  Michael A. Covington,et al.  A Fundamental Algorithm for Dependency Parsing , 2004 .

[13]  Mojgan Seraji,et al.  Dependency Parsers for Persian , 2012, ALR@COLING.

[14]  Nizar Habash,et al.  Overview of the SPMRL 2013 Shared Task: A Cross-Framework Evaluation of Parsing Morphologically Rich Languages , 2013, SPMRL@EMNLP.

[15]  Joakim Nivre,et al.  An Improved Oracle for Dependency Parsing with Online Reordering , 2009, IWPT.

[16]  Andrew McCallum,et al.  Transition-based Dependency Parsing with Selectional Branching , 2013, ACL.

[17]  Philipp Koehn,et al.  Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL) , 2007 .

[18]  Joakim Nivre,et al.  Proceedings of the Fifth Workshop on Treebanks and Linguistic Theories , 2006 .

[19]  Anssi Yli-Jyrä,et al.  Multiplanarity - a model for dependency structures in treebanks , 2003 .

[20]  Walter Daelemans,et al.  Simultaneous feature selection and parameter optimization for memory-based natural language processing , 2000 .

[21]  Miguel Ballesterosy,et al.  Optimizing Planar and 2-Planar parsers with MaltOptimizer , 2012 .

[22]  Joakim Nivre,et al.  Non-Projective Dependency Parsing in Expected Linear Time , 2009, ACL.

[23]  Shashi Narayan,et al.  Proceedings of the 24th International Conference on Computational Linguistics (COLING) , 2012, International Conference on Computational Linguistics.

[24]  Joakim Nivre,et al.  Labeled Pseudo-Projective Dependency Parsing with Support Vector Machines , 2006, CoNLL.

[25]  Louisa Sadler,et al.  Structural Non-Correspondence in Translation , 1991, EACL.

[26]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[27]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[28]  Koby Crammer,et al.  Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[29]  Alicia Burga,et al.  Exploring Morphosyntactic Annotation over a Spanish Corpus for Dependency Parsing , 2013, DepLing.

[30]  Nur Izura Udzir,et al.  A Study on Feature Selection and Classification Techniques for Automatic Genre Classification of Traditional Malay Music , 2008, ISMIR.

[31]  Joakim Nivre Incremental Non-Projective Dependency Parsing , 2007, HLT-NAACL.

[32]  Fernando Pereira,et al.  Discriminative learning and spanning tree algorithms for dependency parsing , 2006 .

[33]  Joakim Nivre,et al.  MaltOptimizer: An Optimization Tool for MaltParser , 2012, EACL.

[34]  Antal van den Bosch Wrapped progressive sampling search for optimizing learning algorithm parameters , 2005 .

[35]  Andrew McCallum,et al.  Efficiently Inducing Features of Conditional Random Fields , 2002, UAI.

[36]  Fernando Pereira,et al.  Multilingual Dependency Analysis with a Two-Stage Discriminative Parser , 2006, CoNLL.

[37]  Miguel Ballesteros,et al.  Effective Morphological Feature Selection with MaltOptimizer at the SPMRL 2013 Shared Task , 2013, SPMRL@EMNLP.

[38]  Joakim Nivre,et al.  A Transition-Based System for Joint Part-of-Speech Tagging and Labeled Non-Projective Dependency Parsing , 2012, EMNLP.

[39]  Volkan Cirik,et al.  The AI-KU System at the SPMRL 2013 Shared Task : Unsupervised Features for Dependency Parsing , 2013, SPMRL@EMNLP.

[40]  Eneko Agirre,et al.  Contribution of Complex Lexical Information to Solve Syntactic Ambiguity in Basque , 2012, COLING.

[41]  Joakim Nivre,et al.  Transition-based Dependency Parsing with Rich Non-local Features , 2011, ACL.

[42]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[43]  Rada Mihalcea,et al.  Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Langu , 2011, ACL 2011.

[44]  Miguel Ballesteros Martínez Exploring automatic feature selection for transition-based dependency parsing , 2013 .

[45]  Joakim Nivre,et al.  Single Malt or Blended? A Study in Multilingual Parser Optimization , 2007, EMNLP.

[46]  Sabine Buchholz,et al.  CoNLL-X Shared Task on Multilingual Dependency Parsing , 2006, CoNLL.

[47]  Melba M. Crawford,et al.  Adaptive feature selection for hyperspectral data analysis , 2004, SPIE Remote Sensing.

[48]  Joakim Nivre,et al.  A Transition-Based Parser for 2-Planar Dependency Structures , 2010, ACL.

[49]  Jörg Tiedemann,et al.  Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12) , 2012 .

[50]  Abhimanyu Das,et al.  Submodular meets Spectral: Greedy Algorithms for Subset Selection, Sparse Approximation and Dictionary Selection , 2011, ICML.

[51]  Johan Hall,et al.  Transition-Based Natural Language Parsing with Dependency and Constituency Representations , 2008 .

[52]  Joakim Nivre,et al.  Pseudo-Projective Dependency Parsing , 2005, ACL.

[53]  Joakim Nivre,et al.  MaltParser: A Language-Independent System for Data-Driven Dependency Parsing , 2007, Natural Language Engineering.

[54]  Marco Carlo Passarotti,et al.  Will a Parser Overtake Achilles? First experiments on parsing the Ancient Greek Dependency Treebank , 2012 .

[55]  Pierre Nugues,et al.  Automatic Discovery of Feature Sets for Dependency Parsing , 2010, COLING.

[56]  Sebastian Riedel,et al.  The CoNLL 2007 Shared Task on Dependency Parsing , 2007, EMNLP.

[57]  Nello Cristianini,et al.  Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL) , 2012 .

[58]  Joakim Nivre,et al.  MaltOptimizer: A System for MaltParser Optimization , 2012, LREC.

[59]  Joakim Nivre,et al.  Feature Description for the Transition-Based Parser for Joint Part-of-Speech Tagging and Labeled Non-Projective Dependency Parsing , 2012 .