Clinical Data-Driven Probabilistic Graph Processing

Electronic Medical Records (EMRs) encode an extraordinary amount of medical knowledge. Collecting and interpreting this knowledge, however, belies a significant level of clinical understanding. Automatically capturing the clinical information is crucial for performing comparative effectiveness research. In this paper, we present a data-driven approach to model semantic dependencies between medical concepts, qualified by the beliefs of physicians. The dependencies, captured in a patient cohort graph of clinical pictures and therapies is further refined into a probabilistic graphical model which enables efficient inference of patient-centered treatment or test recommendations (based on probabilities). To perform inference on the graphical model, we describe a technique of smoothing the conditional likelihood of medical concepts by their semantically-similar belief values. The experimental results, as compared against clinical guidelines are very promising.

[1]  R. Cicerone Initial National Priorities for Comparative Effectiveness Research , 2009 .

[2]  Shuying Shen,et al.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text , 2011, J. Am. Medical Informatics Assoc..

[3]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[4]  Werner Ceusters,et al.  Toward an Ontological Treatment of Disease and Diagnosis , 2009, Summit on translational bioinformatics.

[5]  Sanda M. Harabagiu,et al.  A flexible framework for deriving assertions from electronic medical records , 2011, J. Am. Medical Informatics Assoc..

[6]  William E. Grieb The general inquirer: A computer approach to content analysis: Philip J. Stone, Dexter C. Dunphy, Marshall S. Smith, Daniel M. Ogilvie, with associates. The MIT Press, Cambridge, Massachusetts, 1966. 651 pp. plus xx , 1968 .

[7]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[8]  C. W. Howard,et al.  Clinical practice guidelines for the management of orbital cellulitis. , 1998, Journal of pediatric ophthalmology and strabismus.

[9]  Philip J. Stone,et al.  Extracting Information. (Book Reviews: The General Inquirer. A Computer Approach to Content Analysis) , 1967 .

[10]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[11]  Patchen Dellinger,et al.  Practice guidelines for the diagnosis and management of skin and soft-tissue infections. , 2005, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[12]  J E Backus,et al.  MEDLINEplus: building and maintaining the National Library of Medicine's consumer health Web service. , 2000, Bulletin of the Medical Library Association.

[13]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[14]  Ellen M. Voorhees,et al.  Overview of the TREC 2012 Medical Records Track , 2012, TREC.

[15]  N. Jones,et al.  Guidelines for the management of periorbital cellulitis/abscess. , 2004, Clinical otolaryngology and allied sciences.

[16]  Sanda M. Harabagiu,et al.  The Impact of Belief Values on the Identification of Patient Cohorts , 2013, CLEF.

[17]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..