A new fuzzy logic based ranking function for efficient Information Retrieval system

Abstract The relevant documents from large data sets are retrieved with the help of ranking function in Information Retrieval system. In this paper, a new fuzzy logic based ranking function is proposed and implemented to enhance the performance of Information Retrieval system. The proposed ranking function is based on the computation of different terms of term-weighting schema such as term frequency, inverse document frequency and normalization. Fuzzy logic is used at two levels to compute relevance score of a document with respect to the query in present work. All the experiments are performed on CACM and CISI benchmark data sets. The experimental results reveal that the performance of our proposed ranking function is much better than the fuzzy based ranking function developed by Rubens along with other widely used ranking function Okapi-BM25 in terms of precision, recall and F-measure.

[1]  Luca Viganò,et al.  Automated analysis of RBAC policies with temporal constraints and static role hierarchies , 2015, SAC.

[2]  Lotfi A. Zadeh,et al.  Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic , 1997, Fuzzy Sets Syst..

[3]  F. W. Lancaster,et al.  Information Retrieval Today , 1993 .

[4]  Ian H. Witten,et al.  Managing Gigabytes: Compressing and Indexing Documents and Images , 1999 .

[5]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[6]  Abraham Kandel,et al.  Fuzzy inference and its applicability to control systems , 1992 .

[7]  K. Iyakutti,et al.  A Genetic Algorithm based on Cosine Similarity for Relevant Document Retrieval , 2013 .

[8]  Weiguo Fan,et al.  Effective information retrieval using genetic algorithms based matching functions adaptation , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[9]  Donna K. Harman,et al.  Overview of the First Text REtrieval Conference (TREC-1) , 1992, TREC.

[10]  Michio Sugeno,et al.  Industrial Applications of Fuzzy Control , 1985 .

[11]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[12]  Ian H. Witten,et al.  Managing gigabytes (2nd ed.): compressing and indexing documents and images , 1999 .

[13]  Martti Juhola,et al.  On principal component analysis, cosine and Euclidean measures in information retrieval , 2007, Inf. Sci..

[14]  Bahgat A. Abdel Latef,et al.  Using Genetic Algorithm to Improve Information Retrieval Systems , 2008 .

[15]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[16]  Noureddine Mouaddib,et al.  A fuzzy information retrieval and management system and its applications , 1996, SAC '96.

[17]  Gerard Salton,et al.  Document Length Normalization , 1995, Inf. Process. Manag..

[18]  Shi-Jay Chen,et al.  Fuzzy Information Retrieval Based On A New Similarity Measure Of Generalized Fuzzy Numbers , 2011, Intell. Autom. Soft Comput..

[19]  Oscar Cordón,et al.  Fuzzy logic and multiobjective evolutionary algorithms as soft computing tools for persistent query learning in text retrieval environments , 2004, 2004 IEEE International Conference on Fuzzy Systems (IEEE Cat. No.04CH37542).

[20]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[21]  Shyi-Ming Chen,et al.  Fuzzy risk analysis based on similarity measures of generalized fuzzy numbers , 2003, IEEE Trans. Fuzzy Syst..

[22]  Shuaiqiang Wang,et al.  An immune programming-based ranking function discovery approach for effective information retrieval , 2010, Expert Syst. Appl..

[23]  Oscar Cordón,et al.  A review on the application of evolutionary computation to information retrieval , 2003, Int. J. Approx. Reason..

[24]  Weiguo Fan,et al.  A generic ranking function discovery framework by genetic programming for information retrieval , 2004, Inf. Process. Manag..

[25]  Michel Beigbeder,et al.  Fuzzy Proximity Ranking with Boolean Queries , 2005, TREC.

[26]  Ebrahim H. Mamdani,et al.  An Experiment in Linguistic Synthesis with a Fuzzy Logic Controller , 1999, Int. J. Hum. Comput. Stud..

[27]  Ding-An Chiang,et al.  Fuzzy information in extended fuzzy relational databases , 1997, Fuzzy Sets Syst..

[28]  Kui Wu,et al.  A soft relevance framework in content-based image retrieval systems , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Hichem Frigui,et al.  Interactive image retrieval using fuzzy sets , 2001, Pattern Recognit. Lett..

[30]  Endre Pap,et al.  Multicriteria-multistages linguistic evaluation and ranking of machine tools , 1999, Fuzzy Sets Syst..

[31]  Michio Sugeno,et al.  An introductory survey of fuzzy control , 1985, Inf. Sci..

[32]  Volkmar H. Haase,et al.  Access to Knowledge: Better Use of the Internet , 2002 .

[33]  T. Ross Fuzzy Logic with Engineering Applications , 1994 .

[34]  Chuen-Tsai Sun,et al.  Neuro-fuzzy And Soft Computing: A Computational Approach To Learning And Machine Intelligence [Books in Brief] , 1997, IEEE Transactions on Neural Networks.

[35]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[36]  Neil Rubens The Application of Fuzzy Logic to the Construction of the Ranking Function of Information Retrieval Systems , 2006, ArXiv.

[37]  Emanuele Della Valle,et al.  An Introduction to Information Retrieval , 2013 .

[38]  G. Furnas,et al.  Pictures of relevance: a geometric analysis of similarity measures , 1987 .

[39]  Chuen-Chien Lee FUZZY LOGIC CONTROL SYSTEMS: FUZZY LOGIC CONTROLLER - PART I , 1990 .

[40]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[41]  M. de Rijke,et al.  Result diversification based on query-specific cluster ranking , 2011, J. Assoc. Inf. Sci. Technol..

[42]  Shyi-Ming Chen,et al.  Document retrieval using fuzzy-valued concept networks , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[43]  Stephen E. Robertson The probabilistic character of relevance , 1977, Inf. Process. Manag..

[44]  Wei-Pang Yang,et al.  Learning to Rank for Information Retrieval Using Genetic Programming , 2007 .

[45]  Karen Sparck Jones A statistical interpretation of term specificity and its application in retrieval , 1972 .

[46]  P. S. Joshi,et al.  Wireless Speed Control Of An Induction Motor Using PWM Technique With GSM , 2013 .