Information and knowing when to forget it

In this paper we propose several novel approaches for incorporating forgetting mechanisms into sequential prediction based machine learning algorithms. The broad premise of our work, supported and motivated in part by recent findings stemming from neurology research on the development of human brains, is that knowledge acquisition and forgetting are complementary processes, and that learning can (perhaps unintuitively) benefit from the latter too. We demonstrate that if forgetting is implemented in a purposeful and date driven manner, there are a number of benefits which can be gained from discarding information. The framework we introduce is a general one and can be used with any baseline predictor of choice. Hence in this sense it is best described as a meta-algorithm. The method we described was developed through a series of steps which increase the adaptability of the model, while being data driven. We first discussed a weakly adaptive forgetting process which we termed passive forgetting. A fully adaptive framework, which we termed active forgetting was developed by enveloping a passive forgetting process with a monitoring, self-aware module which detects contextual changes and makes a statistically informed choice when the model parameters should be abruptly rather than gradually updated. The effectiveness of the proposed meta-framework was demonstrated on two real world data sets concerned with challenges of major practical importance: those of predicting currency exchange rates and daily temperatures. On both tasks our approach was shown to be highly effective, reducing prediction errors by nearly 40%.

[1]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[2]  Michael J. Kahana,et al.  Note on the power law of forgetting , 2017, bioRxiv.

[3]  T. Bollerslev,et al.  Intraday periodicity and volatility persistence in financial markets , 1997 .

[4]  Xavier Maldague,et al.  Infrared face recognition: A literature review , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[5]  Ognjen Arandjelovic,et al.  Discovering hospital admission patterns using models learnt from electronic hospital records , 2015, Bioinform..

[6]  Ognjen Arandjelovic,et al.  Complex temporal topic evolution modelling using the Kullback-Leibler divergence and the Bhattacharyya distance , 2016, EURASIP J. Bioinform. Syst. Biol..

[7]  Ognjen Arandjelovic,et al.  Prediction of future hospital admissions - what is the tradeoff between specificity and accuracy? , 2016, ArXiv.

[8]  Ognjen Arandjelovic,et al.  Contextually Learnt Detection of Unusual Motion-Based Behaviour in Crowded Public Spaces , 2013, ISCIS.

[9]  Ivan Koychev,et al.  Gradual Forgetting for Adaptation to Concept Drift , 2000 .

[10]  Svetha Venkatesh,et al.  Data-mining twitter and the autism spectrum disorder: A Pilot study , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[11]  Ognjen Arandjelovic,et al.  Identification of promising research directions using machine learning aided medical literature analysis , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[12]  Ognjen Arandjelovic,et al.  Multiple-object Tracking in Cluttered and Crowded Public Spaces , 2010, ISVC.

[13]  Ognjen Arandjelovic,et al.  Towards sophisticated learning from EHRs: Increasing prediction specificity and accuracy using clinically meaningful risk criteria , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[14]  Foster J. Provost,et al.  Predictive Modeling With Big Data: Is Bigger Really Better? , 2013, Big Data.

[15]  Ognjen Arandelovic,et al.  Automatic knowledge extraction from EHRs , 2016 .

[16]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[17]  A. Newberg,et al.  Changes in the central nervous system and their clinical correlates during long-term spaceflight. , 1994, Aviation, space, and environmental medicine.

[18]  B. Barres,et al.  The complement system: an unexpected role in synaptic pruning during development and disease. , 2012, Annual review of neuroscience.

[19]  D. Rubin,et al.  One Hundred Years of Forgetting : A Quantitative Description of Retention , 1996 .

[20]  Bradley S. Peterson,et al.  Loss of mTOR-Dependent Macroautophagy Causes Autistic-like Synaptic Pruning Deficits , 2014, Neuron.

[21]  Svetha Venkatesh,et al.  Overcoming data scarcity of Twitter: Using tweets as bootstrap with application to autism-related topic content analysis , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).