Indonesian medical retrieval case based on knowledge association rule similarity

In Case Based Reasoning (CBR), retrieval phase is one of the important phases in view of the dependence of the overall effectiveness of the CBR system on the case retrieval stage. To run the process of finding a new case, CBR systems typically utilize knowledge similarity called Similarity-Based Reasoning (SBR) in which the knowledge encoded in the form of term measures is used to calculate the similarity between a new case with the old one. In this paper, we attempt to build a new concept of similarity text for retrieval case on Indonesian medical sentences. The method to be used is the knowledge base of similarity in which the stage included: (i) the utilization of the knowledge for decision-case association, and (ii) the association extraction of knowledge by creating a rule based on attribute data to generate a subset of cases and solutions in a number of cases. The test results then showed the highest values found in the case of the active sentence form at 88.18% for precision, 88.23% for recall and 89.12% for F1-Measure.

[1]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[2]  Gina-Anne Levow,et al.  Term representation with Generalized Latent Semantic Analysis , 2007 .

[3]  William E. Winkler,et al.  String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage. , 1990 .

[4]  Diana Inkpen,et al.  Semantic text similarity using corpus-based word similarity and string similarity , 2008, ACM Trans. Knowl. Discov. Data.

[5]  Paul M. B. Vitányi,et al.  The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[6]  Se-Hak Chun,et al.  New knowledge extraction technique using probability for case‐based reasoning: application to medical diagnosis , 2006, Expert Syst. J. Knowl. Eng..

[7]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[8]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[9]  Barry Smyth,et al.  Personalized information ordering: a case study in online recruitment , 2003, Knowl. Based Syst..

[10]  R. A. M. O N L O P E Z D E M A N T A R A S,et al.  Retrieval, reuse, revision and retention in case-based reasoning , 2006 .

[11]  Armin Stahl,et al.  Learning of knowledge-intensive similarity measures in case-based reasoning , 2004 .

[12]  Alberto Barrón-Cedeño,et al.  Plagiarism Detection across Distant Language Pairs , 2010, COLING.

[13]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[14]  Diana Inkpen,et al.  Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words , 2006, LREC.

[15]  Agnar Aamodt,et al.  CASE-BASED REASONING: FOUNDATIONAL ISSUES, METHODOLOGICAL VARIATIONS, AND SYSTEM APPROACHES AICOM - ARTIFICIAL INTELLIGENCE COMMUNICATIONS , 1994 .

[16]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[17]  Roberto J. Bayardo,et al.  Mining the most interesting rules , 1999, KDD '99.

[18]  Siddharth Patwardhan,et al.  Incorporating Dictionary and Corpus Information into a Context Vector Measure of Semantic Relatednes , 2003 .

[19]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[20]  Ted Pedersen,et al.  An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet , 2002, CICLing.

[21]  Barry Smyth,et al.  Adaptation-Guided Retrieval: Questioning the Similarity Assumption in Reasoning , 1998, Artif. Intell..

[22]  Benno Stein,et al.  A Wikipedia-Based Multilingual Retrieval Model , 2008, ECIR.

[23]  Juan Luis Castro,et al.  Loss and gain functions for CBR retrieval , 2009, Inf. Sci..

[24]  Kyoung-jae Kim,et al.  Global optimization of case-based reasoning for breast cytology diagnosis , 2009, Expert Syst. Appl..

[25]  Shonali Krishnaswamy,et al.  Retrieval in CBR Using a Combination of Similarity and Association Knowledge , 2011, ADMA.

[26]  Graham A Stephen,et al.  Approximate String Matching , 1994, Encyclopedia of Algorithms.

[27]  Matthew A. Jaro,et al.  Probabilistic linkage of large public health data files. , 1995, Statistics in medicine.

[28]  Matthew A. Jaro,et al.  Advances in Record-Linkage Methodology as Applied to Matching the 1985 Census of Tampa, Florida , 1989 .

[29]  Peter Kolb,et al.  Experiments on the difference between semantic similarity and relatedness , 2009, NODALIDA.

[30]  Li Ning,et al.  Using Information Content to Evaluate Semantic Similarity on HowNet , 2012, CIS 2012.

[31]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.