Power and performance characterization, analysis and tuning for energy-efficient edge detection on atom and ARM based platforms

The de facto standard for embedded platforms with medium to low computing demands are ARM with Thumb ISA and Intel Atom with the X86 ISA with multiple cores. Operating these architectures in the milliwatts range while running realtime computer vision corner detection algorithms is a challenging problem. We present the analysis of power, performance and energy-efficiency measurements of Harris corner detection across a wide range of voltage and frequency settings, multicore/multithreading strategies, and compiler and application optimization parameters to find how the interplay of these parameters affect the power, performance and energy-efficiency. Our measurement of results on state-of-the-art embedded platforms demonstrate that a systematic cross-layer optimization at the application level (Sobel filter type, aperture size, number of image tiles), compiler level (branch prediction, function inlining) and system level (voltage and frequency setting, single core vs multicore implementation) significantly improves the energy-efficiency of corner detection, while meeting its real-time performance constraints. This cross-layer optimization improves the energy-efficiency of Harris corner on Atom and ARM by 89.5% and 87.2%, respectively.

[1]  Fawnizu Azmadi Hussin,et al.  Optimization of Processor Architecture for Image Edge Detection Filter , 2010, 2010 12th International Conference on Computer Modelling and Simulation.

[2]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[3]  B. Achiriloaie,et al.  VI REFERENCES , 1961 .

[4]  M. Fatih Demirci,et al.  Pipelining Harris corner detection with a tiny FPGA for a mobile robot , 2013, 2013 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[5]  Cem Ersoy,et al.  Wireless sensor networks for healthcare: A survey , 2010, Comput. Networks.

[6]  Waldemar Celes Filho,et al.  Accelerated Corner-Detector Algorithms , 2008, BMVC.

[7]  Amir Fijany,et al.  Highly Parallel Implementation of Harris Corner Detector on CSX SIMD Architecture , 2010, Euro-Par Workshops.

[8]  Huosheng Hu,et al.  Human motion tracking for rehabilitation - A survey , 2008, Biomed. Signal Process. Control..

[9]  Jun Zhang,et al.  Accelerating Harris Algorithm with GPU for Corner Detection , 2013, 2013 Seventh International Conference on Image and Graphics.

[10]  Cheong-Ghil Kim,et al.  Optimizing image processing on multi-core CPUs with Intel parallel programming technologies , 2011, Multimedia Tools and Applications.