New Trends in Computer Technologies and Applications

Nowadays ILP processors can’t analyze the semantic information of instruction thread to change instruction series automatically for increasing ILP degree. High performance required programs such as image processing or machine learning contain a lot of loop structure. Loop structure will be bounded with the instruction number of one basic block. That cause processors are hard to enhance the computing efficiency. The characteristics of the loop structure in the program are as follows: (1) Instruction will be fetched from cache and be decoded repeatedly. (2) The issued instructions are bounded by the loop body. (3) There is data dependence between iterations. These factors will get worse the poor ILP in the loop codes. In this paper, we propose an architecture called semantic-based dynamic loop unrolling mechanism. The proposed architecture can buffer the instruction series of nested loop, unroll it automatically by analyzing the instruction flow to find the loop body with the semantic of loop instructions, store them to the instruction buffer, and dispatch them to target the processor cores. The proposed architecture consists of three units: loop detect unit (LDU), unrolling control unit (UCU) and loop unrolling unit (LUU). LDU will parse the semantic of instructions to find the closed interval of the loop body instructions. UCU will control LUU in the whole process. LUU will unroll the loop based on the information collected by LDU. Loop controller will handle the complementation overhead for branch miss prediction and the loop finish-up codes. The verifications use ARM instructions generated by Keil lVision5 compiler. The results show that eliminating iteration dependence can improve ILP by 140% to 180%.

[1]  Chia-Hung Yeh,et al.  Self-learning-based single image super-resolution of a highly compressed image , 2013, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP).

[2]  Michael Klemm,et al.  OpenMP Programming on Intel Xeon Phi Coprocessors: An Early Performance Comparison , 2012, MARC@RWTH.

[3]  Chia-Hung Yeh,et al.  Self-learning-based signal decomposition for multimedia applications: A review and comparative study , 2014, Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific.

[4]  Cewu Lu,et al.  Two-Class Weather Classification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Shu-Jhen Fan-Jiang,et al.  Self-learning-based post-processing for image/video deblocking via sparse representation , 2014, J. Vis. Commun. Image Represent..

[6]  Hsin-Hsin Peng,et al.  Serum-derived nanoparticles: de novo generation and growth in vitro, and internalization by mammalian cells in culture. , 2011, Nanomedicine.

[7]  Cheng Zhang,et al.  Scene-free multi-class weather classification on single images , 2016, Neurocomputing.

[8]  Li-Wei Kang,et al.  Deep Learning-Based Weather Image Recognition , 2018, 2018 International Symposium on Computer, Consumer and Control (IS3C).

[9]  Yinglong Wang,et al.  A Hierarchical Approach for Rain or Snow Removing in a Single Color Image. , 2017, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[10]  Ahmed M. Elgammal,et al.  Weather classification with deep convolutional neural networks , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[11]  Rustam Stolkin,et al.  Weather Classification: A new multi-class dataset, data augmentation approach and comprehensive evaluations of Convolutional Neural Networks , 2018, 2018 NASA/ESA Conference on Adaptive Hardware and Systems (AHS).

[12]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[13]  Wei-Ta Chu,et al.  Camera as weather sensor: Estimating weather information from single images , 2017, J. Vis. Commun. Image Represent..

[14]  Christophe Dessimoz,et al.  SWPS3 – fast multi-threaded vectorized Smith-Waterman for IBM Cell/B.E. and ×86/SSE2 , 2008, BMC Research Notes.

[15]  H. Höhling,et al.  Analysis of a General Principle of Crystal Nucleation, Formation in the Different Hard Tissues , 1997, Advances in dental research.

[16]  Zheng Zhang,et al.  Multi-class weather classification on single images , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[17]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Jyh-Da Wei,et al.  Particle-Cell Detecting and Tracking in Live-Cell Time-Lapse Images , 2017, 2017 14th International Symposium on Pervasive Systems, Algorithms and Networks & 2017 11th International Conference on Frontier of Computer Science and Technology & 2017 Third International Symposium of Creative Computing (ISPAN-FCST-ISCC).

[19]  Joachim Mayer,et al.  Ultrastructural analysis of vascular calcifications in uremia. , 2010, Journal of the American Society of Nephrology : JASN.

[20]  Shu-Jhen Fan-Jiang,et al.  Efficient image/video deblocking via sparse representation , 2012, 2012 Visual Communications and Image Processing.

[21]  Jenq-Neng Hwang,et al.  DesnowNet: Context-Aware Deep Network for Snow Removal , 2017, IEEE Transactions on Image Processing.

[22]  Jérémie Bossu,et al.  Rain or Snow Detection in Image Sequences Through Use of a Histogram of Orientation of Streaks , 2011, International Journal of Computer Vision.

[23]  Hsin-Hsin Peng,et al.  Of nanobacteria, nanoparticles, biofilms and their role in health and disease: facts, fancy and future. , 2014, Nanomedicine.

[24]  Alexander Heiss,et al.  Fetuin-A Regulation of Calcified Matrix Metabolism , 2011, Circulation research.

[25]  I. Ide,et al.  Rainy weather recognition from in-vehicle camera images for driver assistance , 2005, IEEE Proceedings. Intelligent Vehicles Symposium, 2005..

[26]  Erik Meijering,et al.  Methods for cell and particle tracking. , 2012, Methods in enzymology.

[27]  Chia-Hung Yeh,et al.  Image and video restoration and enhancement via sparse representation , 2017 .

[28]  Qiang Chen,et al.  Network In Network , 2013, ICLR.

[29]  K. Jaqaman,et al.  Robust single particle tracking in live cell time-lapse sequences , 2008, Nature Methods.

[30]  Xiaoming Zheng,et al.  Weather Recognition Based on Images Captured by Vision System in Vehicle , 2009, ISNN.

[31]  Ping Tan,et al.  Photometric stereo and weather estimation using internet images , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Alessandro Foi,et al.  Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering , 2007, IEEE Transactions on Image Processing.

[33]  Chia-Hung Yeh,et al.  Learning-Based Joint Super-Resolution and Deblocking for a Highly Compressed Image , 2015, IEEE Transactions on Multimedia.

[34]  F. Moosmann,et al.  Classification of weather situations on single color images , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[35]  Jan Martel,et al.  Purported nanobacteria in human blood as calcium carbonate nanoparticles , 2008, Proceedings of the National Academy of Sciences.

[36]  陳敦裕,et al.  Visual depth guided color image rain streaks removal using sparse coding , 2014 .

[37]  Chih-Yang Lin,et al.  Rain Removal Using Single Image based on Non-negative Matrix Factorization , 2014, ICS.

[38]  Martin Vetterli,et al.  Howis the weather: Automatic inference from images , 2012, 2012 19th IEEE International Conference on Image Processing.

[39]  Delu Zeng,et al.  Removing Rain from Single Images via a Deep Detail Network , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Takayuki Hamano,et al.  Fetuin-mineral complex reflects extraosseous calcification stress in CKD. , 2010, Journal of the American Society of Nephrology : JASN.

[41]  Hsin-Hsin Peng,et al.  Physicochemical and biological properties of biomimetic mineralo-protein nanoparticles formed spontaneously in biological fluids. , 2013, Small.

[42]  Cewu Lu,et al.  RSCM: Region Selection and Concurrency Model for Multi-Class Weather Recognition , 2017, IEEE Transactions on Image Processing.

[43]  Xuelong Li,et al.  A Multi-Task Framework for Weather Recognition , 2017, ACM Multimedia.

[44]  Bertil Schmidt,et al.  Reconfigurable architectures for bio-sequence database scanning on FPGAs , 2005, IEEE Transactions on Circuits and Systems II: Express Briefs.

[45]  Xinghao Ding,et al.  Clearing the Skies: A Deep Network Architecture for Single-Image Rain Removal , 2016, IEEE Transactions on Image Processing.

[46]  Weiguo Liu,et al.  Bio-sequence database scanning on a GPU , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[47]  Jan Martel,et al.  Putative Nanobacteria Represent Physiological Remnants and Culture By-Products of Normal Calcium Homeostasis , 2009, PloS one.