Towards Hardware-driven Design of Low-energy Algorithms for Data Analysis

In the era of "big" data, data analysis algorithms need to be efficient. Traditionally researchers would tackle this problem by considering "small" data algorithms, and investigating how to make them computationally more efficient for big data applications. The main means to achieve computational efficiency would be to revise the necessity and order of subroutines, or to approximate calculations. This paper presents a viewpoint that in order to be able to cope with the new challenges of the growing digital universe, research needs to take a combined view towards data analysis algorithm design and hardware design, and discusses a potential research direction in taking an intreated approach in terms of algorithm design and hardware design. Analyzing how data mining algorithms operate at the elementary operations level can help do design more specialized and dedicated hardware, that, for instance, would be more energy efficient. In turn, understanding hardware design can help to develop more effective algorithms.

[1]  Michael L. Overton,et al.  Numerical Computing with IEEE Floating Point Arithmetic , 2001 .

[2]  Qing Zhao,et al.  Resource-Constrained Signal Processing , Communications , and Networking , 2007 .

[3]  G.E. Moore,et al.  Cramming More Components Onto Integrated Circuits , 1998, Proceedings of the IEEE.

[4]  Qing Zhao,et al.  Resource-Constrained Signal Processing, Communications, and Networking [From the Guest Editors] , 2007, IEEE Signal Process. Mag..

[5]  Christian Wietfeld,et al.  MobiSIM: a simulation library for resource prediction of smartphones and wireless sensor networks , 2013, SpringSim.

[6]  Eric Hsueh-Chan Lu,et al.  Energy-efficient real-time object tracking in multi-level sensor networks by mining and predicting movement patterns , 2009, J. Syst. Softw..

[7]  Jaakko Hollmén,et al.  Feature Extraction and Selection from Vibration Measurements for Structural Health Monitoring , 2009, IDA.

[8]  Luís Torgo,et al.  OpenML: A Collaborative Science Platform , 2013, ECML/PKDD.

[9]  Susanne Albers,et al.  Energy-efficient algorithms , 2010, Commun. ACM.

[10]  Sebastian Tschiatschek,et al.  Bayesian Network Classifiers with Reduced Precision Parameters , 2012, ECML/PKDD.

[11]  James Demmel,et al.  Minimizing Communication in Numerical Linear Algebra , 2009, SIAM J. Matrix Anal. Appl..

[12]  Paul M. B. Vitányi Time, space, and energy in reversible computing , 2005, CF '05.

[13]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[14]  Indre Zliobaite,et al.  Mobile Sensing Data for Urban Mobility Analysis: A Case Study in Preprocessing , 2014, EDBT/ICDT Workshops.

[15]  Geoff Holmes,et al.  Experiment databases , 2012, Machine Learning.

[16]  Jaakko Hollmén,et al.  Collaborative Filtering for Coordinated Monitoring in Sensor Networks , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.