An incremental learning classification algorithm based on forgetting factor for eHealth networks

The advances of network technology and mobile communication technology are making eHealth possible. In eHealth systems, physiological data and relevant context-aware data are acquired continuously and in real time. At the same time, such large-scale data results in huge challenges in the aspect of real-time big data processing since eHealth data appears in the form of data stream. Therefore, we propose a novel incremental learning algorithm, namely α-SVMSGD, which improves the SVMSGD (Support Vector Machine-Stochastic Gradient Descent) algorithm by updating the training data with the continuous data stream. Besides, this α-SVMSGD may handle the problem that original SVMSGD cannot further mine the useful information in unclassified data. In α-SVMSGD, the process of training data updating is completed by introducing the concept of forgetting mechanism, in which the forgetting factor α is introduced to weed out useless training data. α-SVMSGD is applied into ambient assisted living communications, and further incorporated into the data filtering layer of a local data processing architecture (LDPA) to reduce data redundancy. Simulation results confirm that the proposed algorithm is a promising data redundancy solution for classification without loss of accuracy in the case of real-time data stream.

[1]  Li Zhao,et al.  A new data stream classification algorithm , 2013, Proceedings of 2013 2nd International Conference on Measurement, Information and Control.

[2]  Guangjie Han,et al.  LDPA: a local data processing architecture in ambient assisted living communications , 2015, IEEE Communications Magazine.

[3]  Jinjun Chen,et al.  KASR: A Keyword-Aware Service Recommendation Method on MapReduce for Big Data Applications , 2014, IEEE Transactions on Parallel and Distributed Systems.

[4]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[5]  Peter J. Hunter,et al.  Big Data, Big Knowledge: Big Data for Personalized Healthcare , 2015, IEEE Journal of Biomedical and Health Informatics.

[6]  Jinjun Chen,et al.  HireSome-II: Towards Privacy-Aware Cross-Cloud Service Composition for Big Data Applications , 2015, IEEE Transactions on Parallel and Distributed Systems.

[7]  Kun Wang,et al.  DAS: A dynamic assignment scheduling algorithm for stream computing in distributed applications , 2013, 2013 IEEE Global Communications Conference (GLOBECOM).

[8]  Tongyu Zhu,et al.  RTIC-C: A Big Data System for Massive Traffic Information Mining , 2013, 2013 International Conference on Cloud Computing and Big Data.

[9]  Lei Shu,et al.  A model-matching algorithm based on improved BP over out-of-order streams , 2014, 2014 IEEE International Conference on Communications (ICC).

[10]  Yiming Yang,et al.  RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[11]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[12]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[13]  Mohsen Guizani,et al.  Home M2M networks: Architectures, standards, and QoS improvement , 2011, IEEE Communications Magazine.

[14]  Lei Shu,et al.  Mobile big data fault-tolerant processing for ehealth networks , 2016, IEEE Network.

[15]  J.R. Casar,et al.  Context-aware services for ambient assisted living: A case-study , 2008, 2008 First International Symposium on Applied Sciences on Biomedical and Communication Technologies.

[16]  Daojing He,et al.  Lightweight and Confidential Data Discovery and Dissemination for Wireless Body Area Networks , 2014, IEEE Journal of Biomedical and Health Informatics.

[17]  Liu Yubin,et al.  Incremental Learning Method of Least Squares Support Vector Machine , 2010, 2010 International Conference on Intelligent Computation Technology and Automation.

[18]  Wei Xu,et al.  Towards Optimal One Pass Large Scale Learning with Averaged Stochastic Gradient Descent , 2011, ArXiv.