Distributional Neural Networks for Automatic Resolution of Crossword Puzzles

Automatic resolution of Crossword Puzzles (CPs) heavily depends on the quality of the answer candidate lists produced by a retrieval system for each clue of the puzzle grid. Previous work has shown that such lists can be generated using Information Retrieval (IR) search algorithms applied to the databases containing previously solved CPs and reranked with tree kernels (TKs) applied to a syntactic tree representation of the clues. In this paper, we create a labelled dataset of 2 million clues on which we apply an innovative Distributional Neural Network (DNN) for reranking clue pairs. Our DNN is computationally efficient and can thus take advantage of such large datasets showing a large improvement over the TK approach, when the latter uses small training data. In contrast, when data is scarce, TKs outperform DNNs.

[1]  Alessandro Moschitti,et al.  Encoding Semantic Resources in Syntactic Structures for Passage Reranking , 2014, EACL.

[2]  Alessandro Moschitti,et al.  Structural relationships for large-scale learning of answer re-ranking , 2012, SIGIR '12.

[3]  Alessandro Moschitti,et al.  Learning Adaptable Patterns for Passage Reranking , 2013, CoNLL.

[4]  Alessandro Moschitti,et al.  Learning to Rank Answer Candidates for Automatic Resolution of Crossword Puzzles , 2014, CoNLL.

[5]  Alessandro Moschitti,et al.  SACRY: Syntax-based Automatic Crossword puzzle Resolution sYstem , 2015, ACL.

[6]  Alessandro Moschitti,et al.  Learning to Rank Aggregated Answers for Crossword Puzzles , 2015, ECIR.

[7]  Alessandro Moschitti,et al.  Automatic Feature Engineering for Answer Selection and Extraction , 2013, EMNLP.

[8]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[9]  Jason Weston,et al.  Question Answering with Subgraph Embeddings , 2014, EMNLP.

[10]  Marco Gori,et al.  WebCrow: A Web-Based System for Crossword Solving , 2005, AAAI.

[11]  Moschitti Alessandro,et al.  A Retrieval Model for Automatic Resolution of Crossword Puzzles in Italian Language , 2014 .

[12]  Alessandro Moschitti,et al.  Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees , 2006, ECML.

[13]  Michael L. Littman,et al.  A probabilistic approach to solving crossword puzzles , 2002, Artif. Intell..

[14]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[15]  Hang Li,et al.  Convolutional Neural Network Architectures for Matching Natural Language Sentences , 2014, NIPS.

[16]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[17]  Ira Pohl,et al.  Heuristic Search Viewed as Path Finding in a Graph , 1970, Artif. Intell..

[18]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[19]  Alessandro Moschitti,et al.  Kernel methods, syntax and semantics for relational text categorization , 2008, CIKM '08.

[20]  Alessandro Moschitti,et al.  Building structures from classifiers for passage reranking , 2013, CIKM.

[21]  Iryna Gurevych,et al.  DKPro Similarity: An Open Source Framework for Text Similarity , 2013, ACL.

[22]  Lei Yu,et al.  Deep Learning for Answer Sentence Selection , 2014, ArXiv.