Web Information Retrieval for Health Professionals

This paper presents a Web Information Retrieval System (WebIRS), which is designed to assist the healthcare professionals to obtain up-to-date medical knowledge and information via the World Wide Web (WWW). The system leverages the document classification and text summarization techniques to deliver the highly correlated medical information to the physicians. The system architecture of the proposed WebIRS is first discussed, and then a case study on an application of the proposed system in a Hong Kong medical organization is presented to illustrate the adoption process and a questionnaire is administrated to collect feedback on the operation and performance of WebIRS in comparison with conventional information retrieval in the WWW. A prototype system has been constructed and implemented on a trial basis in a medical organization. It has proven to be of benefit to healthcare professionals through its automatic functions in classification and summarizing the medical information that the physicians needed and interested. The results of the case study show that with the use of the proposed WebIRS, significant reduction of searching time and effort, with retrieval of highly relevant materials can be attained.

[1]  Vivian Cothey,et al.  Web-crawling reliability , 2004, J. Assoc. Inf. Sci. Technol..

[2]  Javed Mostafa,et al.  Automatic classification using supervised learning in a medical document filtering application , 2000, Inf. Process. Manag..

[3]  E. Higginbotham,et al.  Minority primary care physicians' knowledge, attitudes, and practices on eye health and preferred sources of information. , 2009, Journal of the National Medical Association.

[4]  W. B. Lee,et al.  Data Mining in Biomedicine: Current Applications and Further Directions for Research , 2009, J. Softw. Eng. Appl..

[5]  Min Song,et al.  Handbook of Research on Text and Web Mining Technologies , 2008 .

[6]  Ted Pedersen,et al.  Using WordNet-based Context Vectors to Estimate the Semantic Relatedness of Concepts , 2006 .

[7]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[8]  Yuefeng Li,et al.  A knowledge-based model using ontologies for personalized web information gathering , 2010, Web Intell. Agent Syst..

[9]  Shourya Roy,et al.  Fast and accurate text classification via multiple linear discriminant projections , 2003, The VLDB Journal.

[10]  Han Tong Loh,et al.  Gather customer concerns from online product reviews - A text summarization approach , 2009, Expert Syst. Appl..

[11]  Chaochang Chiu,et al.  Applying Text Mining to Assist People Who Inquire HIV/AIDS Information from Internet , 2008, ISI Workshops.

[12]  Michael W. Berry,et al.  Survey of Text Mining: Clustering, Classification, and Retrieval , 2007 .

[13]  Chew Lim Tan,et al.  A comprehensive comparative study on term weighting schemes for text categorization with support vector machines , 2005, WWW '05.

[14]  Roger G. Stone,et al.  Naive Bayes vs. Decision Trees vs. Neural Networks in the Classification of Training Web Pages , 2009 .

[15]  Houkuan Huang,et al.  Feature selection for text classification with Naïve Bayes , 2009, Expert Syst. Appl..

[16]  Michael Wooldridge,et al.  Agent technology: foundations, applications, and markets , 1998 .

[17]  Wenqian Shang,et al.  A novel feature selection algorithm for text categorization , 2007, Expert Syst. Appl..

[18]  Marc Najork,et al.  Web Crawling , 2010, Found. Trends Inf. Retr..

[19]  Man Lan,et al.  A comparative study on term weighting schemes for text categorization , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[20]  V. Palade,et al.  Adaptive Web Sites - A Knowledge Extraction from Web Data Approach , 2008, Frontiers in Artificial Intelligence and Applications.

[21]  W. B. Lee,et al.  Experiences Sharing of Implementing Template-Based Electronic Medical Record System (TEMRS) in a Hong Kong Medical Organization , 2011, Journal of Medical Systems.

[22]  Steven Walczak A Multiagent Architecture for Developing Medical Information Retrieval Agents , 2004, Journal of Medical Systems.

[23]  Robert Kristofco,et al.  Physician internet medical information seeking and on‐line continuing education use patterns , 2002, The Journal of continuing education in the health professions.

[24]  Anil Sethi,et al.  Matching records in a national medical patient index , 2001, CACM.

[25]  Athar Sheikh,et al.  A Quantitative Assessment of Changing Trends in Internet Usage for Cancer Information , 2011, World Journal of Surgery.

[26]  Ivar Jacobson,et al.  The Unified Software Development Process , 1999 .

[27]  S. Kalichman,et al.  Internet access and Internet use for health information among people living with HIV-AIDS. , 2002, Patient education and counseling.

[28]  Wen-Hsiang Lu,et al.  Using Web resources to construct multilingual medical thesaurus for cross-language medical information retrieval , 2008, Decis. Support Syst..

[29]  Michael W. Berry,et al.  Survey of Text Mining , 2003, Springer New York.

[30]  Hércules Antonio do Prado,et al.  Emerging Technologies of Text Mining: Techniques and Applications , 2007 .

[31]  Akiko Aizawa,et al.  An information-theoretic perspective of tf-idf measures , 2003, Inf. Process. Manag..

[32]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[33]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[34]  Michele L. Ybarra,et al.  Help seeking behavior and the Internet: A national survey , 2006, Int. J. Medical Informatics.

[35]  Dino Isa,et al.  Using the self organizing map for clustering of text documents , 2009, Expert Syst. Appl..

[36]  Weiguo Fan,et al.  Tapping the power of text mining , 2006, CACM.

[37]  S. Rogers,et al.  Internet use among head and neck cancer survivors in the North West of England. , 2012, The British journal of oral & maxillofacial surgery.

[38]  Yiming Yang,et al.  An example-based mapping method for text categorization and retrieval , 1994, TOIS.

[39]  Elena Lloret,et al.  Text summarization contribution to semantic question answering: New approaches for finding answers on the web , 2011, Int. J. Intell. Syst..

[40]  Andreas Holzinger,et al.  Semantic Information in Medical Information Systems: Utilization of Text Mining Techniques to Analyze Medical Diagnoses , 2008, J. Univers. Comput. Sci..

[41]  Christopher S. G. Khoo,et al.  Automatic multidocument summarization of research abstracts: Design and user evaluation , 2007, J. Assoc. Inf. Sci. Technol..

[42]  Dunja Mladenic,et al.  Feature selection on hierarchy of web documents , 2003, Decis. Support Syst..

[43]  Marino Segnan Web Data Management Practices - Emerging Techniques and Technologies , 2007, Comput. J..

[44]  Susan D Scott,et al.  Nurses and Internet health information: a questionnaire survey. , 2008, Journal of advanced nursing.

[45]  Min-Yen Kan,et al.  Customization in a unified framework for summarizing medical literature , 2005, Artif. Intell. Medicine.

[46]  Thomas J. Housel,et al.  Measuring and Managing Knowledge , 2001 .

[47]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[48]  Shui-Shun Lin A document classification and retrieval system for R&D in semiconductor industry - A hybrid approach , 2009, Expert Syst. Appl..

[49]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[50]  Kam-Fai Wong,et al.  Interpreting TF-IDF term weights as making relevance decisions , 2008, TOIS.

[51]  Zhenyu Liu,et al.  Knowledge-based query expansion to support scenario-specific retrieval of medical free text , 2005, SAC '05.

[52]  Denise M. Oleske,et al.  Medical Information and the Internet: Do You Know What You Are Getting? , 2002, Journal of Medical Systems.

[53]  David D. Lewis,et al.  Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.