Software and Data for Corpus Pattern Analysis

This report describes the tools and resources developed to support Corpus Pattern Analysis (CPA)—a corpus-based method for building patterns dictionaries. The tools are an annotation of concordance in Sketch Engine, a special CPA editor for editing Pattern Dictionary of English Verbs (PDEV), dedicated servlets based on the Dictionary Editing and Browsing platform and a public interface for browsing the PDEV. The resources are SemEval 2015 Task 15 dataset and LEMON API.

[1]  Constantin Orasan,et al.  Barbecued Opakapaka: Using Semantic Preferences for Ontology Population , 2015, RANLP.

[2]  James Pustejovsky,et al.  Semantic Coercion in Language: Beyond Distributional Analysis , 2012 .

[3]  Adam Kilgarriff,et al.  Semi-Automatic Dictionary Drafting , 2010, A Way with Words.

[4]  Dong Yu,et al.  BLCUNLP: Corpus Pattern Analysis for Verbs Based on Dependency Chain , 2015, SemEval@NAACL-HLT.

[5]  Octavian Popescu Buildind a Resource of Patterns Using Semantic Types , 2012, LREC.

[6]  Adam Kilgarriff,et al.  GDEX: Automatically Finding Good Dictionary Examples in a Corpus , 2008 .

[7]  Elisabetta Jezek,et al.  Opposition Relations among Verb Frames , 2015, EVENTS@HLP-NAACL.

[8]  Octavian Popescu Learning Corpus Patterns Using Finite State Automata , 2013, IWCS.

[9]  Ted Pedersen Duluth: Word Sense Discrimination in the Service of Lexicography , 2015, SemEval@NAACL-HLT.

[10]  Asunción Gómez-Pérez,et al.  Interchanging lexical resources on the Semantic Web , 2012, Language Resources and Evaluation.

[11]  Adam Kilgarriff,et al.  Introduction to the Special Issue on SENSEVAL , 2000, Comput. Humanit..

[12]  James Pustejovsky,et al.  Constructing a Corpus-based Ontology Using Model Bias , 2006, FLAIRS.

[13]  Eckhard Bick,et al.  Tailored Feature Extraction for Lexical Disambiguation of English Verbs Based on Corpus Pattern Analysis , 2012, COLING.

[14]  Elisabetta Jezek,et al.  What lexical sets tell us about conceptual categories , 2010 .

[15]  Haofen Wang,et al.  The GuanXi network: a new multilingual LLOD for Language Learning applications , 2017, NLPLOD@RANLP.

[16]  Ismaïl El Maarouf,et al.  An empirical classification of verbs based on Semantic Types: the case of the 'poison' verbs , 2013, JSSP.

[17]  Silvie Cinková,et al.  A database of semantic clusters of verb usages , 2012, LREC.

[18]  Elisabetta Jezek,et al.  T-PAS; A resource of Typed Predicate Argument Structures for linguistic analysis and semantic processing , 2014, LREC.

[19]  James Pustejovsky,et al.  The Generative Lexicon , 1995, CL.

[20]  Patrick Hanks How people use words to make meanings : Semantic types meet valencies , 2012 .

[21]  Adam Kilgarriff,et al.  SemEval-2015 Task 15: A CPA dictionary-entry-building task , 2015, *SEMEVAL.

[22]  Patrick Hanks,et al.  Lexical Analysis: Norms and Exploitations , 2013 .

[23]  Martha Palmer,et al.  Inducing Example-based Semantic Frames from a Massive Amount of Verb Uses , 2014, EACL.

[24]  Piek T. J. M. Vossen,et al.  A Distributed Database System for Developing Ontological and Lexical Resources in Harmony , 2008, CICLing.

[25]  James Pustejovsky,et al.  Automated Induction of Sense in Context , 2004, COLING.

[26]  Martha Palmer,et al.  Mapping CPA Patterns onto OntoNotes Senses , 2014, LREC.

[27]  Silvie Cinková,et al.  Managing Uncertainty in Semantic Tagging , 2012, EACL.

[28]  Vít Baisa,et al.  Automatic classification of semantic patterns from the Pattern Dictionary of English Verbs , 2013, JSSP.

[29]  Pavel Rychlý,et al.  Manatee/Bonito - A Modular Corpus Manager , 2007, RASLAN.

[30]  John Sinclair,et al.  Corpus, Concordance, Collocation , 1991 .

[31]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[32]  Charles J. Fillmore,et al.  Frames and the semantics of understanding , 1985 .

[33]  Yorick Wilks,et al.  A Preferential, Pattern-Seeking, Semantics for Natural Language Inference , 1975, Artif. Intell..

[34]  Irene Renau,et al.  Using CPA to represent Spanish pronominal verbs in a learners’ dictionary , 2012 .

[35]  Patrick Hanks Corpus pattern analysis , 2004 .

[36]  Silvie Cinková,et al.  Optimizing semantic granularity for NLP – report on a lexicographic experiment , 2012 .

[37]  Vít Baisa,et al.  Disambiguating Verbs by Collocation: Corpus Lexicography meets Natural Language Processing , 2014, LREC.

[38]  Geoffrey Leech,et al.  100 Million Words of English:The British National Corpus (BNC) , 1992 .

[39]  James Pustejovsky,et al.  A Pattern Dictionary for Natural Language Processing , 2005 .

[40]  Ken Litkowski,et al.  Pattern Dictionary of English Prepositions , 2014, ACL.

[41]  Araceli Alonso Campos,et al.  Corpus Pattern Analysis in determining specialised uses of verbal lexical units , 2013 .

[42]  Adam Kilgarriff,et al.  The Sketch Engine: ten years on , 2014 .