Improving Domain Searches through Customized Search Engines

Search engines are ubiquitous tools for seeking information from the Internet and, as such, have become an integral part of our information society. New search engines that combine ideas from separate search engines generally outperform the search engines from which they took ideas. Designers, however, may not be aware of the work of other search engine developers or such work may not be available in modules that can be incorporated into another search engine. This research presents an interoperability architecture for building customized search engines. Existing search engines are analyzed and decomposed into self-contained components that are classified into six categories. A prototype, called the Automated Software Development Environment for Information Retrieval, was developed to implement the interoperability architecture, and an assessment of its feasibility was carried out. The prototype resolves conflicts between components of separate search engines and demonstrates how design features across search engines can be integrated.

[1]  Craig Silverstein,et al.  Analysis of a Very Large Altavista Query Log" SRC Technical note #1998-14 , 1998 .

[2]  Mathias Lux,et al.  Bag of visual words revisited: an exploratory study on robust image retrieval exploiting fuzzy codebooks , 2010, MDMKDD '10.

[3]  Edward A. Fox,et al.  Automatic query formulations in information retrieval , 1983, J. Am. Soc. Inf. Sci..

[4]  Feng-Chia Li,et al.  Comparison of the Hybrid Credit Scoring Models Based on Various Classifiers , 2010, Int. J. Intell. Inf. Technol..

[5]  Peter Bruza,et al.  Interactive Internet search: keyword, directory and query reformulation mechanisms compared , 2000, SIGIR '00.

[6]  Takenobu Tokunaga,et al.  Combining multiple evidence from different types of thesaurus for query expansion , 1999, SIGIR '99.

[7]  William P. Birmingham,et al.  Architecture of a metasearch engine that supports user information needs , 1999, CIKM '99.

[8]  Salvatore T. March,et al.  Ontological Foundations for Active Information Systems , 2007, Int. J. Intell. Inf. Technol..

[9]  Jane Greenberg,et al.  Automatic query expansion via lexical-semantic relationships , 2001, J. Assoc. Inf. Sci. Technol..

[10]  Emi Ishita,et al.  A search engine for Japanese academic papers , 2010, JCDL '10.

[11]  Ronaldo dos Santos Mello,et al.  A Bottom-Up Approach for Integration of XML Sources , 2001, Workshop on Information Integration on the Web.

[12]  Yuxin Mao,et al.  A Semantic-Based Search Engine for Traditional Medical Informatics , 2009, 2009 Fourth International Conference on Computer Sciences and Convergence Information Technology.

[13]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[14]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[15]  Andreas Nürnberger,et al.  multi Searcher: can we support people to get information from text they can't read or understand? , 2010, SIGIR '10.

[16]  Michael Brinkmeier,et al.  PageRank revisited , 2006, TOIT.

[17]  Rob C. van Ommering Building product populations with software components , 2002, ICSE '02.

[18]  Luis Gravano,et al.  Learning search engine specific query transformations for question answering , 2001, WWW '01.

[19]  C. J. Prabhakar Analysis of Face Space for Recognition using Interval-Valued Subspace Technique , 2012 .

[20]  Mark Hansen,et al.  Using navigation data to improve IR functions in the context of web search , 2001, CIKM '01.

[21]  Adele E. Howe,et al.  Experiences with selecting search engines using metasearch , 1997, TOIS.

[22]  Erik Cuevas,et al.  Corner Detection Using Fuzzy Principles , 2013 .

[23]  Gerard Salton,et al.  Improving Retrieval Performance by Relevance Feedback , 1997 .

[24]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[25]  Ryan Scherle,et al.  Towards context-based search engine selection , 2001, IUI '01.

[26]  Roi Blanco,et al.  Entity summarization of news articles , 2010, SIGIR.

[27]  Andreas Rauber,et al.  Uncovering Associations Between Documents , 2007 .

[28]  Ee-Peng Lim,et al.  On improving wikipedia search using article quality , 2007, WIDM '07.

[29]  Joon Ho Lee,et al.  Combining multiple evidence from different properties of weighting schemes , 1995, SIGIR '95.

[30]  P. Thangaraj,et al.  A Modified Watershed Segmentation Method to Segment Renal Calculi in Ultrasound Kidney Images , 2012, Int. J. Intell. Inf. Technol..

[31]  Philippe Lalanda,et al.  A Domain-Specific Software Architecture for Adaptive Intelligent Systems , 1995, IEEE Trans. Software Eng..

[32]  Hussein Suleman,et al.  A digital library component assembly environment , 2004 .

[33]  King-Lup Liu,et al.  Building efficient and effective metasearch engines , 2002, CSUR.

[34]  Mark Magennis,et al.  The potential and actual effectiveness of interactive query expansion , 1997, SIGIR '97.

[35]  Mounia Lalmas,et al.  Merging techniques for performing data fusion on the web , 2001, CIKM '01.

[36]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Zibin Zheng,et al.  WSExpress: A QoS-aware Search Engine for Web Services , 2010, 2010 IEEE International Conference on Web Services.

[38]  V. Sugumaran The Inaugural Issue of the International Journal of Intelligent Information Technologies , 2005 .

[39]  Ankush Mittal,et al.  Bayesian Network Technologies: Applications and Graphical Models , 2007 .

[40]  W. Bruce Croft,et al.  Corpus-based stemming using cooccurrence of word variants , 1998, TOIS.

[41]  Juan R. Rabuñal,et al.  Encyclopedia of Artificial Intelligence (3 Volumes) , 2009, Encyclopedia of Artificial Intelligence.

[42]  Marc Alexa,et al.  Sketch-based 3D shape retrieval , 2010, SIGGRAPH '10.

[43]  Efthimis N. Efthimiadis,et al.  UCLA-Okapi at TREC-2: Query Expansion Experiments , 1993, TREC.

[44]  Kiyoaki Shirai,et al.  Machine Learning Approaches for Mood Classification of Songs toward Music Search Engine , 2009, 2009 International Conference on Knowledge and Systems Engineering.

[45]  Hsinchun Chen Machine learning for information retrieval: neural networks, symbolic learning, and genetic algorithms , 1995 .

[46]  Azadeh Shakery,et al.  DirichletRank: Solving the zero-one gap problem of PageRank , 2008, TOIS.

[47]  Richard N. Taylor,et al.  A comprehensive approach for the development of modular software architecture description languages , 2005, TSEM.

[48]  Garrison W. Cottrell,et al.  Automatic combination of multiple ranked retrieval systems , 1994, SIGIR '94.

[49]  T. V. Prabhakar,et al.  KhojYantra: an integrated MetaSearch engine with classification, clustering and ranking , 2000, Proceedings 2000 International Database Engineering and Applications Symposium (Cat. No.PR00789).

[50]  Ian H. Witten,et al.  The New Zealand Digital Library MELody inDEX , 1997, D Lib Mag..

[51]  Meredith Ringel Morris,et al.  CoSearch: a system for co-located collaborative web search , 2008, CHI.

[52]  Volker Wulf,et al.  Component-based technologies for end-user development , 2004, Commun. ACM.

[53]  Helen R. Tibbo,et al.  The Cystic Fibrosis Database: Content and Research Opportunities. , 1991 .

[54]  Eila Niemelä,et al.  Dependency-aware Service Oriented Architecture and Service Composition , 2007, IEEE International Conference on Web Services (ICWS 2007).

[55]  W. Bruce Croft,et al.  The INQUERY Retrieval System , 1992, DEXA.

[56]  Antonio Bucchiarone,et al.  Towards an architectural approach for the dynamic and automatic composition of software components , 2006, ROSATEA '06.

[57]  Ofer Melnik,et al.  Concave Learners for Rankboost , 2007, J. Mach. Learn. Res..

[58]  Wanda Pratt,et al.  Transparent Queries: investigation users' mental models of search engines , 2001, SIGIR '01.

[59]  Sriram Raghavan,et al.  Searching the Web , 2001, ACM Trans. Internet Techn..

[60]  Tagelsir Mohamed Gasmelseid,et al.  Sociomateriality Implications of Multi-Agent Supported Collaborative Work Systems , 2012, Int. J. Intell. Inf. Technol..

[61]  Zongyuan Yang,et al.  A basic model for components implementation of software architecture , 2004, SOEN.

[62]  Amit Singhal,et al.  AT&T at TREC-6: SDR Track , 1997, TREC.

[63]  Jian Pei,et al.  Search and browse log mining for web information retrieval: challenges, methods, and applications , 2010, SIGIR.

[64]  Brad A. Myers,et al.  What to do when search fails: finding information by association , 2008, CHI.

[65]  Xiaotao Huang,et al.  A Relation-Based Search Engine in Semantic Web , 2007, IEEE Transactions on Knowledge and Data Engineering.

[66]  Jose Santos,et al.  Online Remote Control of a Wireless Home Automation Network , 2009, Int. J. Ambient Comput. Intell..

[67]  Ian Ruthven,et al.  Re-examining the potential effectiveness of interactive query expansion , 2003, SIGIR.

[68]  Amanda Spink,et al.  Real life information retrieval: a study of user queries on the Web , 1998, SIGF.

[69]  Evgeniy Gabrilovich,et al.  The anatomy of an ad: structured indexing and retrieval for sponsored search , 2010, WWW '10.

[70]  David Lo Multimodal Human Localization Using Bayesian Network Sensor Fusion , 2007 .

[71]  James P. Callan,et al.  Training algorithms for linear text classifiers , 1996, SIGIR '96.

[72]  Mike P. Papazoglou,et al.  Service oriented architectures: approaches, technologies and research issues , 2007, The VLDB Journal.

[73]  Binhai Zhu,et al.  Some Formal Analysis of Roccio's Similarity-Based Relvance Feedback Algorithm , 2000, ISAAC.

[74]  Richard N. Taylor,et al.  A Classification and Comparison Framework for Software Architecture Description Languages , 2000, IEEE Trans. Software Eng..

[75]  Kenneth Wai-Ting Leung,et al.  Personalized Concept-Based Clustering of Search Engine Queries , 2008, IEEE Transactions on Knowledge and Data Engineering.

[76]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[77]  Mike Thelwall,et al.  Blog search engines , 2007, Online Inf. Rev..

[78]  Sally Jo Cunningham,et al.  A user-centered design of a personal digital library for music exploration , 2010, JCDL '10.

[79]  Clement T. Yu,et al.  A highly scalable and effective method for metasearch , 2001, TOIS.

[80]  Vijay Kumar Mago,et al.  Cross-Disciplinary Applications of Artificial Intelligence and Pattern Recognition: Advancing Technologies , 2011 .

[81]  Richard McCreadie Leveraging user-generated content for news search , 2010, SIGIR '10.