Platform-aware dynamic configuration support for efficient text processing on heterogeneous system

Significant efforts have been made in accelerating computer vision and machine learning algorithms by utilizing parallel processors such as multi-core CPUs and GPUs. Although the suitability of GPU is well-known for computer graphics and image processing applications which require massively parallel floating-point computations, recent research movement towards general purpose computing on-GPU (GPGPU) makes it possible to take advantage of parallel processors to accelerate text processing applications as well. However, how to fully leverage different types of parallel processor architectures to obtain optimal performance (especially with text) without making specific efforts to each platform still remains a great challenge. We applied performance and accuracy enhancements to Naive Bayes algorithm to develop a practically sound implementation of text classification. A platform-aware dynamic configuration support automation flow is also proposed to support the seamless execution of our work across platforms. Experiments on various (integrated graphics, dedicated multiple GPUs) platforms demonstrate that our proposed approach improves both accuracy and performance of text classification.

[1]  Geoff Holmes,et al.  Multinomial Naive Bayes for Text Categorization Revisited , 2004, Australian Conference on Artificial Intelligence.

[2]  Clay S. Turner,et al.  A Fast Binary Logarithm Algorithm [DSP Tips & Tricks] , 2010, IEEE Signal Processing Magazine.

[3]  Ching-Lung Su,et al.  Overview and comparison of OpenCL and CUDA technology for GPGPU , 2012, 2012 IEEE Asia Pacific Conference on Circuits and Systems.

[4]  George Forman,et al.  An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..

[5]  Frank Mueller,et al.  GPU-Accelerated Text Mining , 2009 .

[6]  Yogesh Kumar Mittal,et al.  Implementation of Fast Artificial Neural Network for Pattern Classification on Heterogeneous System , 2013 .

[7]  Helio J. C. Barbosa,et al.  Accelerated parallel genetic programming tree evaluation with OpenCL , 2013, J. Parallel Distributed Comput..

[8]  Chien-Mo James Li,et al.  GPU-based N-detect transition fault ATPG , 2013, 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC).

[9]  T. Bayes An essay towards solving a problem in the doctrine of chances , 2003 .

[10]  Cen Li,et al.  A comparative study of text classification approaches for personalized retrieval in PubMed , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW).

[11]  Jianbin Fang,et al.  A Comprehensive Performance Comparison of CUDA and OpenCL , 2011, 2011 International Conference on Parallel Processing.

[12]  Cheng-Yen Lin,et al.  Design of vehicle detection methods with OpenCL programming on multi-core systems , 2013, The 11th IEEE Symposium on Embedded Systems for Real-time Multimedia.

[13]  Joseph R. Cavallaro,et al.  Accelerating computer vision algorithms using OpenCL framework on the mobile GPU - A case study , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Koen E. A. van de Sande,et al.  Empowering Visual Categorization With the GPU , 2011, IEEE Transactions on Multimedia.

[15]  Joseph R. Cavallaro,et al.  Accelerating Computer Vision Algorithms Using OpenCL on Mobile GPU – A Case Study , 2013 .