Semantic Recommendations of Books Using Recurrent Neural Networks

Digital transformations led to the development of supportive technologies, new tools for smart education, and emergent branches of research in the domain of digital library services. This paper introduces a content-based recommender system for Romanian books. The reference documents are old and were digitized via Optical Character Recognition (OCR), a process that generated noise in the conversion. The current prototype version of our system is trained on a corpus of 50 OCRed books which are split into corresponding paragraphs; thus, recommendations of related books to the user’s input query are provided only with regards to these reference documents. The trained neural models consider a bidirectional RNN layer with LSTM or GRU cells over pre-trained Romanian FastText embeddings, followed by a global max-pooling layer. The study shows competitive results on predicting books given an input text, as the proposed model achieves an overall accuracy of around 90%.

[1]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[2]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[3]  Tommy W. S. Chow,et al.  Organizing Books and Authors by Multilayer SOM , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[5]  Rabab Kreidieh Ward,et al.  Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[6]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[7]  Maria Soledad Pera,et al.  Analyzing Book-Related Features to Recommend Books for Emergent Readers , 2015, HT.

[8]  Loriene Roy,et al.  Content-based book recommending using learning for text categorization , 1999, DL '00.

[9]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[10]  Maria Soledad Pera,et al.  SOLE-R: A Semantic and Linguistic Approach for Book Recommendations , 2014, 2014 IEEE 14th International Conference on Advanced Learning Technologies.

[11]  Diana Inkpen,et al.  A survey of book recommender systems , 2017, Journal of Intelligent Information Systems.

[12]  Mohammad S. Obaidat,et al.  Authorship verification using deep belief network systems , 2017, Int. J. Commun. Syst..

[13]  Liang Wang,et al.  DeepStyle: Learning User Preferences for Visual Recommendation , 2017, SIGIR.

[14]  Bruno Martins,et al.  Stylometric relevance-feedback towards a hybrid book recommendation algorithm , 2012, BooksOnline '12.

[15]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.