PENGGUNAAN KAMUS SINONIM DAN HIPONIM SEBAGAI SUMBER EKSPANSI KUERI DALAM SISTEM TEMU KEMBALI INFORMASI BERBAHASA INDONESIA

In search for information, a user usually inputs a small number of words as a query. A problem of vocabulary mismatch is arisen when the system does not retrieve a document that contains relevant information but does not contain words in the query. Query expansion is used to increase the chance to retrieve potentially relevant documents by adding new words to the original query. In this work, we investigated the use of synonyms, hypernyms and holonyms as the source of query expansion in retrieving scientific papers in bahasa Indonesia. We collected papers in the field of communications science into a database and allowed it accessed through a web portal with a search facility. We used big dictionary of bahasa Indonesia (KBBI) and have appended scientific terms of communications science therein. During retrieval, the documents are sorted based on their similarity with the query words. We measured the number of retrieved documents and the relevance of documents in the top positions. Query expansion is proved to increase the number of retrieved documents by approximately 23 percent compared to retrieval without query expansion. However, the relevance of the documents on the top of the list does not change a lot.

[1]  W. Bruce Croft,et al.  Quary Expansion Using Local and Global Document Analysis , 1996, SIGIR Forum.

[2]  Dong Zhou,et al.  Improving search via personalized query expansion using social media , 2012, Information Retrieval.

[3]  Grace Hui Yang,et al.  Query change as relevance feedback in session search , 2013, SIGIR.

[4]  Michael E. Lesk,et al.  Relevance assessments and retrieval system evaluation , 1968, Inf. Storage Retr..

[5]  Pascal Poncelet,et al.  Enhancing passage retrieval in log files by query expansion based on explicit and pseudo relevance feedback , 2014, Comput. Ind..

[6]  M. Danesi Dictionary of media and communications , 2009 .

[7]  Chris Buckley,et al.  Improving automatic query expansion , 1998, SIGIR '98.

[8]  Chih-Ping Wei,et al.  Managing Word Mismatch Problems in Information Retrieval: A Topic-Based Query Expansion Approach , 2007, J. Manag. Inf. Syst..

[9]  Departemen Pendidikan Nasional,et al.  Tesaurus alfabetis bahasa Indonesia , 2009 .

[10]  Gerard Salton,et al.  Improving Retrieval Performance by Relevance Feedback , 1997 .

[11]  W. Bruce Croft,et al.  A Field Relevance Model for Structured Document Retrieval , 2012, ECIR.

[12]  Burkhard Schafer,et al.  Natural language processing and query expansion in legal information retrieval: Challenges and a response , 2010 .

[13]  Mandar Mitra,et al.  Query Expansion Using Term Distribution and Term Association , 2013, ArXiv.

[14]  Claudio Carpineto,et al.  A Survey of Automatic Query Expansion in Information Retrieval , 2012, CSUR.