CMedPort: Intelligent Searching for Chinese Medical Information

Most information retrieval techniques have been developed for English and other Western languages. As the second largest Internet language, Chinese provides a good setting for study of how search engine techniques developed for English could be generalized for use in other languages to facilitate Internet searching and browsing in a multilingual world. This paper reviews different techniques used in search engines and proposes an integrated approach to development of a Chinese medical portal: CMedPort. The techniques integrated into CMedPort include meta-search engines, cross-regional search, summarization and categorization. A user study was conducted to compare the effectiveness, efficiency and user satisfaction of CMedPort and three major Chinese search engines. Preliminary results from the user study show that CMedPort achieves similar accuracy in searching tasks, and higher effectiveness and efficiency in browsing tasks than Openfind, a Taiwan search engine portal. We believe that the proposed approach can be used to support Chinese information seeking in Web-based digital library applications.

[1]  Hsinchun Chen,et al.  Meeting medical terminology needs-the ontology-enhanced Medical Concept Mapper , 2001, IEEE Transactions on Information Technology in Biomedicine.

[2]  Yi Qin,et al.  Comparison of two approaches to building a vertical search tool: a case study in the nanotechnology domain , 2002, JCDL '02.

[3]  Hsinchun Chen,et al.  Browsing in hypertext: a cognitive study , 1992, IEEE Trans. Syst. Man Cybern..

[4]  Donna K. Harman,et al.  Overview of the Sixth Text REtrieval Conference (TREC-6) , 1997, Inf. Process. Manag..

[5]  Oren Etzioni,et al.  Multi-Engine Search and Comparison Using the MetaCrawler , 1995, World Wide Web J..

[6]  Gary Marchionini,et al.  Finding facts vs. browsing knowledge in hypertext systems , 1988, Computer.

[7]  Hsinchun Chen,et al.  Personalized spiders for web search and analysis , 2001, JCDL '01.

[8]  Hsinchun Chen,et al.  Updateable PAT-Tree Approach to Chinese Key PhraseExtraction using Mutual Information: A Linguistic Foundation for Knowledge Management , 1999 .

[9]  Gondy Leroy,et al.  MedTextus: An Ontology-enhanced Medical Portal , 2002 .

[10]  Marti A. Hearst,et al.  Reexamining the cluster hypothesis: scatter/gather on retrieval results , 1996, SIGIR '96.

[11]  Hsiao-Tieh Pu,et al.  Important Issues on Chinese Information Retrieval , 1996, Int. J. Comput. Linguistics Chin. Lang. Process..

[12]  Oren Etzioni,et al.  Grouper: A Dynamic Clustering Interface to Web Search Results , 1999, Comput. Networks.

[13]  Hsinchun Chen,et al.  Internet Categorization and Search: A Self-Organizing Approach , 1996, J. Vis. Commun. Image Represent..

[14]  Gary Marchionini,et al.  Previews and overviews in digital libraries: designing surrogates to support visual information seeking , 2000 .

[15]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.

[16]  Hsinchun Chen,et al.  Using sentence-selection heuristics to rank text segments in TXTRACTOR , 2002, JCDL '02.

[17]  Bin Zhu,et al.  elpfulMed: Intelligent searching for medical information over the internet , 2003, J. Assoc. Inf. Sci. Technol..

[18]  B. C. Walsh,et al.  Online text retrieval via browsing , 1988, Inf. Process. Manag..

[19]  Hsinchun Chen,et al.  MetaSpider: Meta-searching and categorization on the Web , 2001, J. Assoc. Inf. Sci. Technol..

[20]  M. E. Maron,et al.  An evaluation of retrieval effectiveness for a full-text document-retrieval system , 1985, CACM.

[21]  E H Shortliffe,et al.  The evolution of health-care records in the era of the Internet. , 1998, Studies in health technology and informatics.

[22]  Eduard Hovy,et al.  Automated Text Summarization in SUMMARIST , 1997, ACL 1997.

[23]  Peter B. Danzig,et al.  Scalable Internet resource discovery: research problems and approaches , 1994, CACM.

[24]  Hsinchun Chen,et al.  Internet Browsing and Searching: User Evaluations of Category Map and Concept Space Techniques , 1998, J. Am. Soc. Inf. Sci..

[25]  King-Lup Liu,et al.  Building efficient and effective metasearch engines , 2002, CSUR.