Differential Evolution-Based Fusion and Its Properties for Web Search

In recent years, data fusion has been applied to many different application areas such as neural networks, classification, multi-sensor systems, image processing, information retrieval, Web search among others. Linear combination is a popular data fusion method due to its flexibility. Proper weight assignment is a key issue for its success. In this paper, we apply the differential evolution optimization method to find suitable weights in the search space. Experiments are carried out with authoritative TREC data and we find it is a good method for the task and can improve fusion performance significantly than the best component results and other heuristic data fusion methods. Moreover, we have two findings. One finding is compared with other fusion methods, differential evolution based method performs better when more component search engines are involved in the fusion process. The second is a relatively large number of queries (e.g. over 100 queries) should be used as training data in order to obtain reliable weights.

[1]  Shengli Wu,et al.  Linear combination of component results in information retrieval , 2012, Data Knowl. Eng..

[2]  Shengli Wu,et al.  Assigning appropriate weights for the linear combination data fusion method in information retrieval , 2009, Inf. Process. Manag..

[3]  Mohamed Farah,et al.  An outranking approach for rank aggregation in information retrieval , 2007, SIGIR.

[4]  Lei Yang,et al.  Query log analysis of an electronic health record search engine. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[5]  R. Storn,et al.  On the usage of differential evolution for function optimization , 1996, Proceedings of North American Fuzzy Information Processing.

[6]  Donna Harman,et al.  The Second Text Retrieval Conference (TREC-2) , 1995, Inf. Process. Manag..

[7]  Georgios John Fakas A novel keyword search paradigm in relational databases: Object summaries , 2011, Data Knowl. Eng..

[8]  Javed A. Aslam,et al.  Models for metasearch , 2001, SIGIR '01.

[9]  G. T. Tsao,et al.  Fuzzy-Decision-Making Problems of Fuel Ethanol Production Using a Genetically Engineered Yeast , 1998 .

[10]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[11]  B. Achiriloaie,et al.  VI REFERENCES , 1961 .

[12]  Javed A. Aslam,et al.  Condorcet fusion for improved retrieval , 2002, CIKM '02.

[13]  Garrison W. Cottrell,et al.  Fusion Via a Linear Combination of Scores , 1999, Information Retrieval.

[14]  Stefan Janaqi,et al.  Generalization of the strategies in differential evolution , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[15]  Feng-Sheng Wang,et al.  A mixed-coding scheme of evolutionary algorithms to solve mixed-integer nonlinear programming problems☆ , 2004 .

[16]  Moni Naor,et al.  Rank aggregation methods for the Web , 2001, WWW '01.

[17]  M. M. Ali,et al.  A numerical study of some modified differential evolution algorithms , 2006, Eur. J. Oper. Res..