AN INFECTIOUS DISEASE PREDICTION METHOD BASED ON K-NEAREST NEIGHBOR IMPROVED ALGORITHM

With the continuous development of medical information construction, the potential value of a large amount of medical information has not been exploited. Excavate a large number of medical records of outpatients, and train to generate disease prediction models to assist doctors in diagnosis and improve work efficiency.This paper proposes a disease prediction method based on k-nearest neighbor improvement algorithm from the perspective of patient similarity analysis. The method draws on the idea of clustering, extracts the samples near the center point generated by the clustering, applies these samples as a new training sample set in the K-nearest neighbor algorithm; based on the maximum entropy The K-nearest neighbor algorithm is improved to overcome the influence of the weight coefficient in the traditional algorithm and improve the accuracy of the algorithm. The real experimental data proves that the proposed k-nearest neighbor improvement algorithm has better accuracy and operational efficiency.

[1]  Agachai Sumalee,et al.  Short-Term Traffic State Prediction Based on Temporal–Spatial Correlation , 2013, IEEE Transactions on Intelligent Transportation Systems.

[2]  Jianying Hu,et al.  Towards Personalized Medicine: Leveraging Patient Similarity and Drug Similarity Analytics , 2014, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[3]  Saroja Kulkarni,et al.  Mining Social Media Data for Understanding Students’ Learning Experiences using Memetic algorithm , 2018 .

[4]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Georges Badr,et al.  Medical Data Mining for Heart Diseases and the Future of Sequential Mining in Medical Field , 2018, Machine Learning Paradigms.

[6]  Jimeng Sun,et al.  RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism , 2016, NIPS.

[7]  Igor Jurisica,et al.  Knowledge Discovery and Data Mining in Biomedical Informatics: The Future Is in Integrative, Interactive Machine Learning Solutions , 2014, Interactive Knowledge Discovery and Data Mining in Biomedical Informatics.

[8]  Nilmini Wickramasinghe,et al.  Deepr: A Convolutional Net for Medical Records , 2016, ArXiv.

[9]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[10]  Thaddeus Metz Medicine without Cure?: A Cluster Analysis of the Nature of Medicine. , 2018, The Journal of medicine and philosophy.

[11]  Yan Zhao,et al.  Improved KNN text classification algorithm with MapReduce implementation , 2017, 2017 4th International Conference on Systems and Informatics (ICSAI).

[12]  Fei Wang,et al.  Comprehensible Predictive Modeling Using Regularized Logistic Regression and Comorbidity Based Features , 2015, PloS one.

[13]  Anis Sharafoddini,et al.  Patient Similarity in Prediction Models Based on Health Data: A Scoping Review , 2017, JMIR medical informatics.

[14]  Francisco Herrera,et al.  kNN-IS: An Iterative Spark-based design of the k-Nearest Neighbors classifier for big data , 2017, Knowl. Based Syst..

[15]  Md. Kamrul Hasan,et al.  Prediction of breast cancer using support vector machine and K-Nearest neighbors , 2017, 2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC).

[16]  Hui Zhang,et al.  Spatiotemporal modeling of PM2.5 concentrations at the national scale combining land use regression and Bayesian maximum entropy in China. , 2018, Environment international.

[17]  He Qin,et al.  A Survey of Machine Learning Algorithms for Big Data , 2014 .

[18]  Noora Abdulrahman,et al.  KNN Classifier and Naive Bayse Classifier for Crime Prediction in San Francisco Context , 2017 .

[19]  Li Juan,et al.  TKNN: An Improved KNN Algorithm Based on Tree Structure , 2011, 2011 Seventh International Conference on Computational Intelligence and Security.

[20]  Mark Hoogendoorn,et al.  Prediction using patient comparison vs. modeling: A case study for mortality prediction , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[21]  Sherry‐Ann Brown Patient Similarity: Emerging Concepts in Systems and Precision Medicine , 2016, Front. Physiol..

[22]  Tom Stonier,et al.  Towards a general theory of information II: information and entropy , 1989 .

[23]  Liangxiao Jiang,et al.  Bayesian Citation-KNN with distance weighting , 2014, Int. J. Mach. Learn. Cybern..

[24]  Fen Wang,et al.  The construction of undergraduate data mining course in the big data age , 2017, 2017 12th International Conference on Computer Science and Education (ICCSE).

[25]  Kenney Ng,et al.  Personalized Predictive Modeling and Risk Factor Identification using Patient Similarity , 2015, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.