Towards Developing an Intelligent Agent to Assist in Patient Diagnosis Using Neural Networks on Unstructured Patient Clinical Notes: Initial Analysis and Models

Abstract Technological advances in information-communication technologies in the health ecosystem have allowed for the recording and consumption of massive amounts of structured and unstructured health data. In developing countries, the use of Electronic Medical Records (EMR) is necessary to address the need for efficient delivery of services and informed decision-making, especially at the local level where health facilities and practitioners may be lacking. Text mining is a variation of data mining that tries to extract non-trivial information and knowledge from unstructured text. This study aims to determine the feasibility of integrating an intelligent agent within EMRs for automatic diagnosis prediction based on the unstructured clinical notes. A Multilayer Feed-Forward Neural Network with Back Propagation training was implemented for classification. The two neural network models predicted hypertension against similar diagnoses with 11.52% and 10.53% percent errors but predicted with 54.01% and 64.82% percent errors when used on a group of similar diagnoses. Further development is needed for prediction of diagnoses with common symptoms and related diagnoses. The results still prove, however, that unstructured data possesses value beneficial for clinical decision support. If further analyzed with structured data, a more accurate intelligent agent may be explored.

[1]  Shaker El-Sappagh,et al.  A distributed clinical decision support system architecture , 2014, J. King Saud Univ. Comput. Inf. Sci..

[2]  R. G. Rowe,et al.  Health information systems. , 1978, British medical journal.

[3]  Tong Zhang,et al.  Text Mining: Predictive Methods for Analyzing Unstructured Information , 2004 .

[4]  Rajib Malakar,et al.  Electronic medical records , 2006 .

[5]  Jimeng Sun,et al.  Automatic identification of heart failure diagnostic criteria, using text analysis of clinical notes from electronic health records , 2014, Int. J. Medical Informatics.

[6]  Riza Cenk Erdur,et al.  SAMS – A Systems Architecture for Developing Intelligent Health Information Systems , 2013, Journal of Medical Systems.

[7]  Sichao Liu,et al.  Agent-based intelligent medical diagnosis system for patients. , 2015, Technology and health care : official journal of the European Society for Engineering and Medicine.

[8]  Gurpreet Singh Lehal,et al.  A Survey of Text Mining Techniques and Applications , 2009 .

[9]  Divakar Singh,et al.  Neural Network Approach for Text Classification using Relevance Factor as Term Weighing Method , 2013 .

[10]  Jorge A. Gálvez,et al.  A Review of Analytics and Clinical Informatics in Health Care , 2014, Journal of Medical Systems.

[11]  H. Koh,et al.  Data mining applications in healthcare. , 2005, Journal of healthcare information management : JHIM.

[12]  Zheng Chongxun,et al.  The Use of Fuzzy BackPropagation Neural Networks for the Early Diagnosis of Hypoxic Ischemic Encephalopathy in Newborns , 2011, Journal of biomedicine & biotechnology.

[13]  Farid E Ahmed,et al.  Molecular Cancer BioMed Central Review , 2005 .