Fuzzy rule based profiling approach for enterprise information seeking and retrieval

With the exponential growth of information available on the Internet and various organisational intranets there is a need for profile based information seeking and retrieval (IS&R) systems. These systems should be able to support users with their context-aware information needs. This paper presents a new approach for enterprise IS&R systems using fuzzy logic to develop task, user and document profiles to model user information seeking behaviour. Relevance feedback was captured from real users engaged in IS&R tasks. The feedback was used to develop a linear regression model for predicting document relevancy based on implicit relevance indicators. Fuzzy relevance profiles were created using Term Frequency and Inverse Document Frequency (TF-IDF) analysis for the successful user queries. Fuzzy rule based summarisation was used to integrate the three profiles into a unified index reflecting the semantic weight of the query terms related to the task, user and document. The unified index was used to select the most relevant documents and experts related to the query topic. The overall performance of the system was evaluated based on standard precision and recall metrics which show significant improvements in retrieving relevant documents in response to user queries.

[1]  Partha Roy,et al.  A novel fuzzy document-based information retrieval scheme (FDIRS) , 2016, Applied Informatics.

[2]  Stavri G. Nikolov,et al.  Enterprise Search in the European Union: A Techno-economic Analysis , 2012 .

[3]  Deepa Anand,et al.  Folksonomy-based fuzzy user profiling for improved recommendations , 2014, Expert Syst. Appl..

[4]  Hani Hagras,et al.  A fuzzy based agent for group decision support of applicants ranking within recruitment systems , 2009, 2009 IEEE Symposium on Intelligent Agents.

[5]  Wei Wang,et al.  A class similarity based weight estimation algorithm for the personalized enterprise search engines , 2014 .

[6]  Yogesh Gupta,et al.  Fuzzy Based Approach to Develop Hybrid Ranking Function for Efficient Information Retrieval , 2014, ISI.

[7]  Witold Pedrycz,et al.  Granular Computing: Analysis and Design of Intelligent Systems , 2013 .

[8]  C. Lee Giles,et al.  Similar researcher search in academic environments , 2012, JCDL '12.

[9]  Yogesh Gupta,et al.  A new fuzzy logic based ranking function for efficient Information Retrieval system , 2015, Expert Syst. Appl..

[10]  Jianchang Mao,et al.  Enterprise Search: Tough Stuff , 2004, ACM Queue.

[11]  Prabhjot Singh,et al.  Implementation of an efficient Fuzzy Logic based Information Retrieval System , 2015, EAI Endorsed Trans. Scalable Inf. Syst..

[12]  M. White Enterprise Search , 2012 .

[13]  Dietmar Jannach,et al.  Adaptation and Evaluation of Recommendations for Short-term Shopping Goals , 2015, RecSys.

[14]  Ricardo Baeza-Yates,et al.  Query-sets: using implicit feedback and query patterns to organize web documents , 2008, WWW.

[15]  Witold Pedrycz,et al.  Building consensus in group decision making with an allocation of information granularity , 2014, Fuzzy Sets Syst..

[16]  Yunyao Li,et al.  Automatic suggestion of query-rewrite rules for enterprise search , 2012, SIGIR '12.

[17]  José M. Molina López,et al.  Agent-based collaborative filtering based on fuzzy recommendations , 2004, Int. J. Web Eng. Technol..

[18]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[19]  Jaime Teevan,et al.  Large scale query log analysis of re-finding , 2010, WSDM '10.

[20]  Ryen W. White,et al.  Text selections as implicit relevance feedback , 2012, SIGIR '12.

[21]  Yoichi Shinoda,et al.  Information filtering based on user behavior analysis and best match text retrieval , 1994, SIGIR '94.

[22]  M. de Rijke,et al.  A language modeling framework for expert finding , 2009, Inf. Process. Manag..

[23]  Kemal A. Delic,et al.  Enterprise Knowledge Clouds: Next Generation KM Systems? , 2009, 2009 International Conference on Information, Process, and Knowledge Management.

[24]  Zachary A. Collier,et al.  Ranking the Relative Importance of Toxicological Observations Based on Subject Matter Expertise , 2015 .

[25]  David Hawking,et al.  Challenges in Enterprise Search , 2004, ADC.

[26]  Rahat Iqbal,et al.  Design implications for task-specific search utilities for retrieval and re-engineering of code , 2017, Enterp. Inf. Syst..

[27]  Hisao Ishibuchi,et al.  Rule weight specification in fuzzy rule-based classification systems , 2005, IEEE Transactions on Fuzzy Systems.

[28]  Mingxuan Sun,et al.  Learning multiple-question decision trees for cold-start recommendation , 2013, WSDM.

[29]  Fan Yang,et al.  Modeling and broadening temporal user interest in personalized news recommendation , 2014, Expert Syst. Appl..

[30]  Javier Carbó,et al.  Agent-based collaborative filtering based on fuzzy recommendations , 2004 .

[31]  Craig MacDonald,et al.  Voting techniques for expert search , 2008, Knowledge and Information Systems.

[32]  Ronald Fagin,et al.  Searching the workplace web , 2003, WWW '03.

[33]  Yogesh Gupta,et al.  Fuzzy logic-based approach to develop hybrid similarity measure for efficient information retrieval , 2014, J. Inf. Sci..

[34]  Marcus Fontoura,et al.  Using annotations in enterprise search , 2006, WWW '06.

[35]  Usman Afzal,et al.  Meven : An Enterprise Trust Recommender System , 2013 .

[36]  Bu-Sung Lee,et al.  Cost Minimization for Provisioning Virtual Servers in Amazon Elastic Compute Cloud , 2011, 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems.

[37]  Malú Castellanos HotMiner: Discovering Hot Topics from Dirty Text , 2004 .

[38]  Lei Xu,et al.  A new multivariate test formulation: theory, implementation, and applications to genome-scale sequencing and expression , 2016, Applied Informatics.

[39]  Shenghua Bao,et al.  Research on Expert Search at Enterprise Track of TREC 2006 , 2005, TREC.

[40]  Alan Eckhardt Similarity of users' (content-based) preference models for Collaborative filtering in few ratings scenario , 2012, Expert Syst. Appl..

[41]  Yiqun Liu,et al.  How do users describe their information need: Query recommendation based on snippet click model , 2011, Expert Syst. Appl..

[42]  Bruno Martins,et al.  Stylometric relevance-feedback towards a hybrid book recommendation algorithm , 2012, BooksOnline '12.

[43]  Anne E. James,et al.  Integration, optimization and usability of enterprise applications , 2012, Proceedings of the 2012 IEEE 16th International Conference on Computer Supported Cooperative Work in Design (CSCWD).

[44]  Susan T. Dumais,et al.  Learning user interaction models for predicting web search result preferences , 2006, SIGIR.

[45]  Nasser Yazdani,et al.  A3CRank: An adaptive ranking method based on connectivity, content and click-through data , 2010, Inf. Process. Manag..

[46]  Jerry M. Mendel,et al.  Linguistic summarization using IF-THEN rules , 2010, International Conference on Fuzzy Systems.

[47]  Martin Szomszor,et al.  Comparison of implicit and explicit feedback from an online music recommendation service , 2010, HetRec '10.

[48]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[49]  Douglas W. Oard,et al.  Using Implicit Feedback for User Modeling in Internet and Intranet Searching ϕ , 2000 .

[50]  Ryen W. White,et al.  Personalizing web search results by reading level , 2011, CIKM '11.

[51]  Mohammad Yahya H. Al-Shamri,et al.  Fuzzy-Weighted Similarity Measures for Memory-Based Collaborative Recommender Systems , 2014 .

[52]  Min Wang,et al.  Entity centric query expansion for enterprise search , 2012, CIKM '12.

[53]  Vimala Balakrishnan,et al.  Implicit user behaviours to improve post-retrieval document relevancy , 2014, Comput. Hum. Behav..

[54]  Enrique Herrera-Viedma,et al.  A model to represent users trust in recommender systems using ontologies and fuzzy linguistic modeling , 2015, Inf. Sci..

[55]  Stefan Dlugolinsky,et al.  Approach for enterprise search and interoperability using lightweight semantic , 2014, IEEE 18th International Conference on Intelligent Engineering Systems INES 2014.

[56]  Ronald R. Yager,et al.  Fuzzy logic methods in recommender systems , 2003, Fuzzy Sets Syst..

[57]  Diane Kelly,et al.  Methods for Evaluating Interactive Information Retrieval Systems with Users , 2009, Found. Trends Inf. Retr..

[58]  Jonathan L. Herlocker,et al.  Click data as implicit relevance feedback in web search , 2007, Inf. Process. Manag..

[59]  Byeong Man Kim,et al.  Constructing User Profiles for Collaborative Recommender System , 2004, APWeb.

[60]  Rahat Iqbal,et al.  Task-specific information retrieval systems for software engineers , 2012, J. Comput. Syst. Sci..

[61]  LiLei,et al.  Modeling and broadening temporal user interest in personalized news recommendation , 2014 .

[62]  Rahat Iqbal,et al.  Comparative analysis of relevance feedback methods based on two user studies , 2016, Comput. Hum. Behav..

[63]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[64]  Jiawei Han,et al.  Modeling and exploiting heterogeneous bibliographic networks for expertise ranking , 2012, JCDL '12.

[65]  Mark Claypool,et al.  Inferring User Interest , 2001, IEEE Internet Comput..

[66]  Analía Amandi,et al.  Intelligent User Profiling , 2009, Artificial Intelligence: An International Perspective.

[67]  Andrei Z. Broder,et al.  Towards the next generation of enterprise search technology , 2004, IBM Syst. J..

[68]  Rahat Iqbal,et al.  An intelligent framework for activity led learning in network planning and management , 2014, Int. J. Commun. Networks Distributed Syst..

[69]  M. de Rijke,et al.  Formal models for expert finding in enterprise corpora , 2006, SIGIR.

[70]  Giovanna Castellano,et al.  Similarity-Based Fuzzy Clustering for User Profiling , 2007, 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops.

[71]  Weiguo Fan,et al.  ExpertRank: A topic-aware expert finding algorithm for online knowledge communities , 2013, Decis. Support Syst..

[72]  Chris Cornelis,et al.  One-and-only item recommendation with fuzzy logic techniques , 2007, Inf. Sci..

[73]  Hongyuan Zha,et al.  Co-ranking Authors and Documents in a Heterogeneous Network , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[74]  Luis M. de Campos,et al.  A collaborative recommender system based on probabilistic inference from fuzzy observations , 2008, Fuzzy Sets Syst..

[75]  Eugene Agichtein,et al.  Beyond dwell time: estimating document relevance from cursor movements and other post-click searcher behavior , 2012, WWW.

[76]  IqbalRahat,et al.  Comparative analysis of relevance feedback methods based on two user studies , 2016 .

[77]  Ryen W. White,et al.  Large-scale analysis of individual and task differences in search result page examination strategies , 2012, WSDM '12.

[78]  Jian Wang,et al.  Utilizing re-finding for personalized information retrieval , 2010, CIKM '10.

[79]  Gabriella Kazai,et al.  Proceedings of the fifth ACM workshop on Research advances in large digital book repositories and complementary media , 2010, CIKM 2010.

[80]  Patrice Perny,et al.  Preference-based Search and Machine Learning for Collaborative Filtering: the "Film-Conseil" Movie Recommender System , 2001 .

[81]  Hesham Hefny,et al.  An Enhanced Multi-view Fuzzy Information Retrieval Model based on Linguistics☆ , 2014 .

[82]  Hongbo Deng,et al.  Formal Models for Expert Finding on DBLP Bibliography Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[83]  Anne E. James,et al.  Integration, optimization and usability of enterprise applications , 2012, CSCWD.