Content-Based Filtering Recommendation in Abstract Search Using Neo4j

In this work, we focus on development of a content search on report documents and recommendation on related document from search result. The main contribution of this work is to model document content into graph. Document-Keyword graph was created to represent the relationship between document and its features. The data were stored as a connected graph in Ne04j graph database. The graph were used to filter keyword co-occurrence documents in order to reduce search space. The performance of the proposed model was evaluated with accuracy 0.77. To improve the accuracy, the model can be extended with collecting user selection as collaborative feedback to the system, or extended with domain specific ontology to analyze the semantic relationship of the documents.

[1]  Weimin Xu,et al.  Recipe recommendation considering the flavor of regional cuisines , 2016, 2016 International Conference on Progress in Informatics and Computing (PIC).

[2]  Jim Webber,et al.  Graph Databases: New Opportunities for Connected Data , 2013 .

[3]  Din J. Wasem,et al.  Mining of Massive Datasets , 2014 .

[4]  Guy Shani,et al.  A Survey of Accuracy Evaluation Metrics of Recommendation Tasks , 2009, J. Mach. Learn. Res..

[5]  Saman Haratizadeh,et al.  Graph-based collaborative ranking , 2016, Expert Syst. Appl..

[6]  F. O. Isinkaye,et al.  Recommendation systems: Principles, methods and evaluation , 2015 .

[7]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[8]  Minyong Shi,et al.  Analysis of film data based on Neo4j , 2017, 2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS).

[9]  Florian Boudin,et al.  How Document Pre-processing affects Keyphrase Extraction Performance , 2016, NUT@COLING.

[10]  Lada A. Adamic,et al.  Recipe recommendation using ingredient networks , 2011, WebSci '12.

[11]  Vimala Balakrishnan,et al.  Stemming and lemmatization: A comparison of retrieval performances , 2014 .