Book Recommendation Beyond the Usual Suspects - Embedding Book Plots Together with Place and Time Information

Content-based recommendation of books and other media is usually based on semantic similarity measures. While metadata can be compared easily, measuring the semantic similarity of narrative literature is challenging. Keyword-based approaches are biased to retrieve books of the same series or do not retrieve any results at all in sparser libraries. We propose to represent plots with dense vectors to foster semantic search for similar plots even if they do not have any words in common. Further, we propose to embed plots, places, and times in the same embedding space. Thereby, we allow arithmetics on these aspects. For example, a book with a similar plot but set in a different, user-specified place can be retrieved. We evaluate our findings on a set of 16,000 book synopses that spans literature from 500 years and 200 genres and compare our approach to a keyword-based baseline.

[1]  David Bamman,et al.  New Alignment Methods for Discriminative Book Summarization , 2013, ArXiv.

[2]  Vivien Petras,et al.  Supporting Book Search: A Comprehensive Comparison of Tags vs. Controlled Vocabulary Metadata , 2017, Data Inf. Manag..

[3]  Christoph Lofi,et al.  Facet Embeddings for Explorative Analytics in Digital Libraries , 2017, TPDL.

[4]  Ralf Krestel,et al.  What Should I Cite? Cross-Collection Reference Recommendation of Patents and Papers , 2017, TPDL.

[5]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[6]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[7]  Geoffrey Zweig,et al.  Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[8]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[9]  Petr Knoth,et al.  Classifying Document Types to Enhance Search and Recommendations in Digital Libraries , 2017, TPDL.

[10]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[11]  Carlo Strapparava,et al.  Corpus-based and Knowledge-based Measures of Text Semantic Similarity , 2006, AAAI.

[12]  Vivien Petras,et al.  An In-Depth Analysis of Tags and Controlled Metadata for Book Search , 2017 .

[13]  Shah Khusro,et al.  Towards a semantic book search engine , 2016, 2016 International Conference on Open Source Systems & Technologies (ICOSST).

[14]  Marijn Koolen,et al.  Defining and Supporting Narrative-driven Recommendation , 2017, RecSys.

[15]  Jaap Kamps,et al.  Looking for Books in Social Media: An Analysis of Complex Search Requests , 2015, ECIR.

[16]  Germain Forestier,et al.  Towards a Semantic Search Engine for Scientific Articles , 2017, TPDL.