An adaptive fuzzy based recommender system for enterprise search

This thesis discusses relevance feedback including implicit parameters, explicit parameters and user query and how they could be used to build a recommender system to enhance the search performance in the enterprise. It presents an approach for the development of an adaptive fuzzy logic based recommender system for enterprise search. The system is designed to recommend documents and people based on the user query in a task specific search environment. The proposed approach provides a new mechanism for constructing and integrating a task, user and document profiles into a unified index thorough the use of relevance feedback and fuzzy rule based summarisation. The three profiles are fuzzy based and are created using the captured relevance feedback. In the task profile, each task was modelled as a sequence of weighted terms which were used by the users to complete the task. In the user profile, the user was modelled as a sequence of weighted terms which were used to search for the required information. In the document profile the document was modelled as a group of weighted terms which were used by the users to retrieve the document. Fuzzy sets and rules were used to calculate the term weight based on the term frequency in the user queries. An empirical research was carried out to capture the relevance feedback from 35 users on 20 predefined simulated enterprise search tasks and to investigate the correlation between the implicit and explicit relevance feedback. Based on the results, an adaptive linear predictive model was developed to estimate the document relevancy from the implicit feedback parameters. The predicted document relevancy was then used to train the fuzzy system which created and integrated the three profiles, as briefly described above. The captured data set was used to develop and train the fuzzy system. The proposed system achieved 89% accuracy performance classifying the relevant documents. With regard to the implementation, Apache Sorl, Apache Tikka, Oracle 11g and Java were used to develop a prototype system. The overall retrieval accuracy performance of the proposed system was tested by carrying out a comparative retrieval accuracy performance evaluation based on Precision (P), Recall (R) and ranking analysis. The values of P and R of the proposed system were compared with two other systems being the standard inverted index based Solr system and the semantic indexing based lucid system. The proposed system enhanced the value of P significantly where the average of P value has been increased from 0.00428 to 0.064 as compared with the

[1]  Lotfi A. Zadeh,et al.  MAKING COMPUTERS THINK LIKE PEOPLE , 1984 .

[2]  Andreas Stafylopatis,et al.  A hybrid movie recommender system based on neural networks , 2005, 5th International Conference on Intelligent Systems Design and Applications (ISDA'05).

[3]  Shaghayegh Sahebi,et al.  Cross-Domain Collaborative Recommendation in a Cold-Start Context: The Impact of User Profile Size on the Quality of Recommendation , 2013, UMAP.

[4]  Yunyao Li,et al.  Automatic suggestion of query-rewrite rules for enterprise search , 2012, SIGIR '12.

[5]  Malú Castellanos HotMiner: Discovering Hot Topics from Dirty Text , 2004 .

[6]  David Hawking,et al.  Challenges in Enterprise Search , 2004, ADC.

[7]  J. Mandel,et al.  The Statistical Analysis of Experimental Data. , 1965 .

[8]  David Smiley,et al.  Solr 1.4 Enterprise Search Server , 2009 .

[9]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[10]  Craig MacDonald,et al.  Voting techniques for expert search , 2008, Knowledge and Information Systems.

[11]  José M. Molina López,et al.  Agent-based collaborative filtering based on fuzzy recommendations , 2004, Int. J. Web Eng. Technol..

[12]  L. Tennakoon,et al.  Challenges in recruitment of research participants , 2003 .

[13]  Rahat Iqbal,et al.  An intelligent framework for monitoring student performance using fuzzy rule-based Linguistic Summarisation , 2012, 2012 IEEE International Conference on Fuzzy Systems.

[14]  Raymond J. Mooney,et al.  Content-boosted collaborative filtering for improved recommendations , 2002, AAAI/IAAI.

[15]  Christoph Mangold,et al.  u38: A Framework for Database-Supported Enterprise Document-Retrieval , 2006, 2006 10th International Database Engineering and Applications Symposium (IDEAS'06).

[16]  Lotfi A. Zadeh,et al.  Outline of a New Approach to the Analysis of Complex Systems and Decision Processes , 1973, IEEE Trans. Syst. Man Cybern..

[17]  Vimala Balakrishnan,et al.  Implicit user behaviours to improve post-retrieval document relevancy , 2014, Comput. Hum. Behav..

[18]  Morten Hertzum,et al.  The information-seeking practices of engineers: searching for documents as well as for people , 2000, Inf. Process. Manag..

[19]  Bracha Shapira,et al.  Recommender Systems Handbook , 2015, Springer US.

[20]  Debora Shaw,et al.  Handbook of usability testing: How to plan, design, and conduct effective tests , 1996 .

[21]  Alexander J. Smola,et al.  Scalable hierarchical multitask learning algorithms for conversion optimization in display advertising , 2014, WSDM.

[22]  Liliana Ardissono,et al.  Ubiquitous User Assistance in a Tourist Information Server , 2002, AH.

[23]  Ryen W. White,et al.  Text selections as implicit relevance feedback , 2012, SIGIR '12.

[24]  Gerald Reif,et al.  SemTree: Ontology-Based Decision Tree Algorithm for Recommender Systems , 2008, SEMWEB.

[25]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[26]  M. de Rijke,et al.  A language modeling framework for expert finding , 2009, Inf. Process. Manag..

[27]  Stefan Dlugolinsky,et al.  Approach for enterprise search and interoperability using lightweight semantic , 2014, IEEE 18th International Conference on Intelligent Engineering Systems INES 2014.

[28]  Prabhakar Raghavan,et al.  Social Networks: From the Web to the Enterprise , 2002, IEEE Internet Comput..

[29]  David Hawking,et al.  How Valuable is External Link Evidence When Searching Enterprise Webs? , 2004, ADC.

[30]  Anne E. James,et al.  Towards the Development of an Integrated Framework for Enhancing Enterprise Search Using Latent Semantic Indexing , 2011, ICCS.

[31]  Wei Wang,et al.  A class similarity based weight estimation algorithm for the personalized enterprise search engines , 2014 .

[32]  Yongtae Woo,et al.  A Hybrid Recommender System Combining Collaborative Filtering with Neural Network , 2002, AH.

[33]  Qiang Yang,et al.  Transfer learning for collaborative filtering via a rating-matrix generative model , 2009, ICML '09.

[34]  Yehuda Koren,et al.  Improved Neighborhood-based Collaborative Filtering , 2007 .

[35]  Douglas W. Oard,et al.  Modeling Information Content Using Observable Behavior , 2001 .

[36]  Zuhua Jiang,et al.  An inner-enterprise knowledge recommender system , 2010, Expert Syst. Appl..

[37]  Usman Afzal,et al.  Meven : An Enterprise Trust Recommender System , 2013 .

[38]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[39]  Judith Masthoff,et al.  Group Modeling: Selecting a Sequence of Television Items to Suit a Group of Viewers , 2004, User Modeling and User-Adapted Interaction.

[40]  Chris Cornelis,et al.  One-and-only item recommendation with fuzzy logic techniques , 2007, Inf. Sci..

[41]  Ian Witten,et al.  Data Mining , 2000 .

[42]  Ruslan Salakhutdinov,et al.  Bayesian probabilistic matrix factorization using Markov chain Monte Carlo , 2008, ICML '08.

[43]  Luis M. de Campos,et al.  A collaborative recommender system based on probabilistic inference from fuzzy observations , 2008, Fuzzy Sets Syst..

[44]  David Hawking,et al.  Query-independent evidence in home page finding , 2003, TOIS.

[45]  Michael Brückner,et al.  Context-Aware Tourism Recommender System Using Temporal Ontology and Naïve Bayes , 2015 .

[46]  J. Ruff,et al.  Information Overload: Causes, Symptoms and Solutions , 2002 .

[47]  Ronald R. Yager,et al.  Fuzzy logic methods in recommender systems , 2003, Fuzzy Sets Syst..

[48]  Eugene Agichtein,et al.  Beyond dwell time: estimating document relevance from cursor movements and other post-click searcher behavior , 2012, WWW.

[49]  M. E. Cintra,et al.  A comparative study on classic machine learning and fuzzy approaches for classification problems , 2009 .

[50]  Diane Kelly,et al.  Methods for Evaluating Interactive Information Retrieval Systems with Users , 2009, Found. Trends Inf. Retr..

[51]  Jonathan L. Herlocker,et al.  Click data as implicit relevance feedback in web search , 2007, Inf. Process. Manag..

[52]  Jiawei Han,et al.  Modeling and exploiting heterogeneous bibliographic networks for expertise ranking , 2012, JCDL '12.

[53]  Yang Song,et al.  Online behavioral genome sequencing from usage logs: decoding the search behaviors , 2014, WWW '14 Companion.

[54]  Zhi-Hua Zhou,et al.  Rule extraction: Using neural networks or for neural networks? , 2004, Journal of Computer Science and Technology.

[55]  Kemal A. Delic,et al.  Enterprise Knowledge Clouds: Next Generation KM Systems? , 2009, 2009 International Conference on Information, Process, and Knowledge Management.

[56]  Adam Prügel-Bennett,et al.  An Improved Switching Hybrid Recommender System Using Naive Bayes Classifier and Collaborative Filtering , 2010 .

[57]  Ryen W. White,et al.  Finding relevant documents using top ranking sentences: an evaluation of two alternative schemes , 2002, SIGIR '02.

[58]  Konstantinos G. Margaritis,et al.  Using SVD and demographic data for the enhancement of generalized Collaborative Filtering , 2007, Inf. Sci..

[59]  Nasser Yazdani,et al.  A3CRank: An adaptive ranking method based on connectivity, content and click-through data , 2010, Inf. Process. Manag..

[60]  Analía Amandi,et al.  Intelligent User Profiling , 2009, Artificial Intelligence: An International Perspective.

[61]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[62]  Barry Smyth,et al.  Passive Profiling from Server Logs in an Online Recruitment Environment , 2001, IJCAI 2001.

[63]  Djoerd Hiemstra,et al.  University of Twente at the TREC 2008 Enterprise Track: Using the Global Web as an Expertise Evidence Source , 2008, TREC.

[64]  Yi-Cheng Zhang,et al.  Recommender Systems , 2012, ArXiv.

[65]  Mark T. Maybury,et al.  Expert Finding Systems , 2006 .

[66]  Shakhatreh,et al.  Fuzzy Logic and Its Applications , 2007 .

[67]  Dan Frankowski,et al.  Collaborative Filtering Recommender Systems , 2007, The Adaptive Web.

[68]  Jaime Teevan,et al.  Implicit feedback for inferring user preference: a bibliography , 2003, SIGF.

[69]  Barry Smyth,et al.  Exploiting Query Repetition and Regularity in an Adaptive Community-Based Web Search Engine , 2004, User Modeling and User-Adapted Interaction.

[70]  Zuhua Jiang,et al.  Recommender system based on workflow , 2009, Decis. Support Syst..

[71]  Michael Idinopulos,et al.  Do you Know who your Experts are , 2006 .

[72]  M. White Enterprise Search , 2012 .

[73]  Linda Banwell,et al.  Users and user study methodology: the JUBILEE project , 2003, Inf. Res..

[74]  Ebrahim Mamdani,et al.  Applications of fuzzy algorithms for control of a simple dynamic plant , 1974 .

[75]  Hisao Ishibuchi,et al.  Rule weight specification in fuzzy rule-based classification systems , 2005, IEEE Transactions on Fuzzy Systems.

[76]  Mingxuan Sun,et al.  Learning multiple-question decision trees for cold-start recommendation , 2013, WSDM.

[77]  Ricardo Colomo Palacios,et al.  ReSySTER: A hybrid recommender system for Scrum team roles based on fuzzy and rough sets , 2012, Int. J. Appl. Math. Comput. Sci..

[78]  J. Bobadilla,et al.  Recommender systems survey , 2013, Knowl. Based Syst..

[79]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[80]  David Hawking,et al.  Panoptic Expert: Searching for experts not just for documents , 2001 .

[81]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[82]  Xingshe Zhou,et al.  TV Program Recommendation for Multiple Viewers Based on user Profile Merging , 2006, User Modeling and User-Adapted Interaction.

[83]  Kristian J. Hammond,et al.  Watson: Anticipating and Contextualizing Information Needs , 1999 .

[84]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[85]  Qiang Yang,et al.  Transfer Learning in Collaborative Filtering for Sparsity Reduction , 2010, AAAI.

[86]  J. K. Suresh,et al.  Ten Steps to Maturity in Knowledge Management: Lessons in Economy , 2006 .

[87]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[88]  Ryen W. White,et al.  The Use of Implicit Evidence for Relevance Feedback in Web Retrieval , 2002, ECIR.

[89]  Mary Shapcott,et al.  Generating semantically enriched user profiles for Web personalization , 2007, TOIT.

[90]  Hani Hagras,et al.  A fuzzy based agent for group decision support of applicants ranking within recruitment systems , 2009, 2009 IEEE Symposium on Intelligent Agents.

[91]  Jianchang Mao,et al.  Enterprise Search: Tough Stuff , 2004, ACM Queue.

[92]  Stavri G. Nikolov,et al.  Enterprise Search in the European Union: A Techno-economic Analysis , 2012 .

[93]  ChengXiang Zhai,et al.  Probabilistic Models for Expert Finding , 2007, ECIR.

[94]  Harsha K. Kalutarage,et al.  Towards a knowledge-based approach for effective decision-making in railway safety , 2015, J. Knowl. Manag..

[95]  Filip Radlinski,et al.  Evaluating the accuracy of implicit feedback from clicks and query reformulations in Web search , 2007, TOIS.

[96]  Amnon Meisels,et al.  A Decision Tree Based Recommender System , 2010, IICS.

[97]  Richard Hill,et al.  Improving Cloud Datacentre Scalability, Agility and Performance Using OpenFlow , 2012, 2012 Fourth International Conference on Intelligent Networking and Collaborative Systems.

[98]  David Hawking,et al.  Does topic metadata help with Web search , 2007 .

[99]  Patrice Perny,et al.  Preference-based Search and Machine Learning for Collaborative Filtering: the "Film-Conseil" Movie Recommender System , 2001 .

[100]  Siraj Ahmed Shaikh,et al.  Operating heavy duty vehicles under extreme heat conditions: A fuzzy approach for smart gear-shifting strategy , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[101]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[102]  Richard Hill,et al.  Gluing Grids and Clouds Together: A Service-Oriented Approach , 2011, EIDWT.

[103]  Alan Eckhardt Similarity of users' (content-based) preference models for Collaborative filtering in few ratings scenario , 2012, Expert Syst. Appl..

[104]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[105]  Byeong Man Kim,et al.  Constructing User Profiles for Collaborative Recommender System , 2004, APWeb.

[106]  Benjamin Schrauwen,et al.  Deep content-based music recommendation , 2013, NIPS.

[107]  Robin D. Burke,et al.  Hybrid Web Recommender Systems , 2007, The Adaptive Web.

[108]  Rahat Iqbal,et al.  Task-specific information retrieval systems for software engineers , 2012, J. Comput. Syst. Sci..

[109]  Jan Gulliksen,et al.  Key principles for user-centred systems design , 2003, Behav. Inf. Technol..

[110]  Hui Gao,et al.  A Probabilistic Recommendation Method Inspired by Latent Dirichlet Allocation Model , 2014 .

[111]  Weiguo Fan,et al.  ExpertRank: A topic-aware expert finding algorithm for online knowledge communities , 2013, Decis. Support Syst..

[112]  Andrei Z. Broder,et al.  Towards the next generation of enterprise search technology , 2004, IBM Syst. J..

[113]  Rahat Iqbal,et al.  An intelligent framework for activity led learning in network planning and management , 2014, Int. J. Commun. Networks Distributed Syst..