Similar Cases Retrieval from the Database of Laboratory Test Results

We proposed a suitable method to search similar cases from the laboratory test results database, whose data are basically numerical and ordinal data. We transformed raw data into ordinal ranks and into new scores lying between 0 and 1, then calculated the Mahalanobis distances as a similarity measure. We used 3000 cases of blood count data. In 100 sample cases, 95% of the most similar 20 cases obtained by our method were included in those by the criterion (Mahalanobis distances calculated from raw data). Next, we applied our method to the data relevant to thyroid diseases. In 96 sample cases, the most similar 10 cases were retrieved from 1655 cases. The diagnoses were consistent with that of the sample cases in 32.4%. When we used Euclidean distance, the result worsened to 27.7%. Our method proved to be suitable in our attempt to identify similar cases in complicated laboratory test data.

[1]  Evans Cd,et al.  A case-based assistant for diagnosis and analysis of dysmorphic syndromes , 1995 .

[2]  Hans-Peter Kriegel,et al.  The X-tree : An Index Structure for High-Dimensional Data , 2001, VLDB.

[3]  S. Hollerbach,et al.  Estimation of habituation and signal-to-noise ratio of cortical evoked potentials to oesophageal electrical and mechanical stimulation , 1997, Medical and Biological Engineering and Computing.

[4]  N Linial,et al.  Global self-organization of all known protein sequences reveals inherent biological signatures. , 1997, Journal of molecular biology.

[5]  Luigi Portinale,et al.  Diabetic patients management exploiting case-based reasoning techniques , 2000, Comput. Methods Programs Biomed..

[6]  Shin'ichi Satoh,et al.  The SR-tree: an index structure for high-dimensional nearest neighbor queries , 1997, SIGMOD '97.

[7]  H. Buchner The Grid File : An Adaptable , Symmetric Multikey File Structure , 2001 .

[8]  E. Hontoria,et al.  Application of multivariate analysis for characterization of organic compounds from urban runoff. , 1990, The Science of the total environment.

[9]  Agnar Aamodt,et al.  Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..

[10]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[11]  P. Morosan,et al.  Observer-Independent Method for Microstructural Parcellation of Cerebral Cortex: A Quantitative Approach to Cytoarchitectonics , 1999, NeuroImage.

[12]  John Mylopoulos,et al.  Case-based reasoning in IVF: prediction and knowledge mining , 1998, Artif. Intell. Medicine.

[13]  Christos Faloutsos,et al.  The TV-tree: An index structure for high-dimensional data , 1994, The VLDB Journal.

[14]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[15]  B. Blaisdell,et al.  Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a variety of computer-generated model systems , 1989, Journal of Molecular Evolution.

[16]  D. Lacher,et al.  Application of multidimensional scaling in numerical taxonomy: analysis of isoenzyme types of Candida species. , 1991, Annals of clinical and laboratory science.

[17]  Renée J. Miller,et al.  Very Large Databases , 1999 .

[18]  Yutaka Hata,et al.  A Clustering Method Based on Rough Sets and Its Application to Knowledge Discovery in the Medical Database , 2001, MedInfo.

[19]  M Kanehisa,et al.  Prediction of membrane proteins based on classification of transmembrane segments. , 1998, Protein engineering.

[20]  Rainer Schmidt,et al.  Case-based reasoning for antibiotics therapy advice: an investigation of retrieval algorithms and prototypes , 2001, Artif. Intell. Medicine.

[21]  Rosemary Luckin,et al.  A Prototype Decision Support System for MR Spectroscopy-Assisted Diagnosis of Brain Tumours , 2001, MedInfo.

[22]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[23]  S R Lowry,et al.  Different Mapping Algorithms for the Analysis of Exfoliated Cervical Cells by Infrared Microscopy , 1997, Microscopy and Microanalysis.

[24]  K. Macura,et al.  Computerized Case-Based Instructional System for Computed Tomography and Magnetic Resonance Imaging of Brain Tumors , 1994, Investigative radiology.

[25]  M. Frize,et al.  Clinical decision-support systems for intensive care units using case-based reasoning. , 2000, Medical engineering & physics.

[26]  P. Rudan,et al.  Isolation by distance on the Island of Korcula: correlation analysis of distance measures. , 1988, American journal of physical anthropology.

[27]  Simone Santini,et al.  Similarity Measures , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  L. Yarzábal,et al.  Morphological differences between Venezuelan and African microfilariae of Onchocerca volvulus , 1988, Journal of Helminthology.

[29]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[30]  D. Morrison,et al.  An empirical comparison of distance matrix techniques for estimating codon usage divergence , 1994, Journal of Molecular Evolution.

[31]  H. Kawasaki-Fukumori,et al.  An Acoustical Basis for Universal Phonotactic Constraints , 1992, Language and speech.

[32]  Lawrence Hunter,et al.  GEST: a gene expression search tool based on a novel Bayesian similarity metric , 2001, ISMB.

[33]  D. Davison,et al.  A measure of DNA sequence dissimilarity based on Mahalanobis distance between frequencies of words. , 1997, Biometrics.

[34]  Patrice Degoulet,et al.  Unified modeling language and design of a case-based retrieval system in medical imaging , 1998, AMIA.

[35]  Peter Achermann,et al.  Individual ‘Fingerprints’ in Human Sleep EEG Topography , 2001, Neuropsychopharmacology.

[36]  Patrice Degoulet,et al.  A property concept frame representation for flexible image-content retrieval in histopathology databases , 2000, AMIA.

[37]  D. Liberati,et al.  A non-parametric method for the analysis of experimental tumour growth data , 2006, Medical & Biological Engineering & Computing.