IISCNLP at SemEval-2016 Task 2: Interpretable STS with ILP based Multiple Chunk Aligner

Interpretable semantic textual similarity (iSTS) task adds a crucial explanatory layer to pairwise sentence similarity. We address various components of this task: chunk level semantic alignment along with assignment of similarity type and score for aligned chunks with a novel system presented in this paper. We propose an algorithm, iMATCH, for the alignment of multiple non-contiguous chunks based on Integer Linear Programming (ILP). Similarity type and score assignment for pairs of chunks is done using a supervised multiclass classification technique based on Random Forrest Classifier. Results show that our algorithm iMATCH has low execution time and outperforms most other participating systems in terms of alignment score. Of the three datasets, we are top ranked for answer- students dataset in terms of overall score and have top alignment score for headlines dataset in the gold chunks track.

[1]  Steven Bethard,et al.  Back to Basics for Monolingual Alignment: Exploiting Word Similarity and Contextual Evidence , 2014, TACL.

[2]  Eneko Agirre,et al.  UBC: Cubes for English Semantic Textual Similarity and Supervised Approaches for Interpretable STS , 2015, SemEval@NAACL-HLT.

[3]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[4]  Harold W. Kuhn,et al.  The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[5]  Christian Hänig,et al.  ExB Themis: Extensive Feature Extraction from Word Alignments for Semantic Textual Similarity , 2015, *SEMEVAL.

[6]  Steven Bethard,et al.  Feature-Rich Two-Stage Logistic Regression for Monolingual Alignment , 2015, EMNLP.

[7]  Chris Callison-Burch,et al.  A Lightweight and High Performance Monolingual Word Aligner , 2013, ACL.

[8]  Nitin Madnani,et al.  TER-Plus: paraphrase, semantic, and alignment enhancements to Translation Edit Rate , 2009, Machine Translation.

[9]  Eneko Agirre,et al.  SemEval-2016 Task 2: Interpretable Semantic Textual Similarity , 2016, *SEMEVAL.

[10]  Kevin Knight,et al.  A New Objective Function for Word Alignment , 2009, ILP 2009.

[11]  Houda Bouamor,et al.  Monolingual Alignment by Edit Rate Computation on Sentential Paraphrase Pairs , 2011, ACL.

[12]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[13]  Vasile Rus,et al.  NeRoSim: A System for Measuring and Interpreting Semantic Textual Similarity , 2015, *SEMEVAL.

[14]  Laurence A. Wolsey,et al.  Integer and Combinatorial Optimization , 1988 .

[15]  Iain Dunning,et al.  PuLP : A Linear Programming Toolkit for Python , 2011 .