Comparative Analysis of Efficient Platforms: Scalable Algorithms and Parallel Paradigms for Large Scale Image Processing

With the advancement of technology we are heading towards a paperless environment. But there are still a large numbers of documents that exist in paper format in our daily lives. Thus the need to digitize these paper documents, archive them and view them at all times has arisen. The number of documents of a small organization may be in thousands, millions or even more. This chapter presents comparative analysis of different programming languages and libraries where it is intended to parallel process a huge stream of images which undergo unpredictable arrival of the images and variation in time. Since the parallelism can be implemented at different levels, different algorithms and techniques have also been discussed. It also presents the state of the art and discussion of various existing technical solutions to implement the parallelization on a hybrid platform for the real time processing of the images contained in a stream. Experimental results obtained using Apache Hadoop in combination with OpenMP have also been discussed.

[1]  Jack Dongarra,et al.  A User''s Guide to PVM Parallel Virtual Machine , 1991 .

[2]  Barbara Chapman,et al.  Using OpenMP - portable shared memory parallel programming , 2007, Scientific and engineering computation.

[3]  Hermann Hellwagner SCI: Scalable Coherent Interface , 1999, Lecture Notes in Computer Science.

[4]  M C Oz,et al.  Long-term use of a left ventricular assist device for end-stage heart failure. , 2001, The New England journal of medicine.

[5]  Barbara M. Chapman,et al.  Introducing OpenSHMEM: SHMEM for the PGAS community , 2010, PGAS '10.

[6]  William Gropp,et al.  MPICH2: A New Start for MPI Implementations , 2002, PVM/MPI.

[7]  Victor R. Basili,et al.  Identifying domain-specific defect classes using inspections and change history , 2006, ISESE '06.

[8]  Michael M. Resch,et al.  Computational science and high performance computing IV : the 4th Russian-German advanced research workshop, Freiburg, Germany, October 12 to 16, 2009 ; [proceedings of the Fourth Russian-German Advanced Research Workshop on Computational Science and High Performance Computing] , 2011, HiPC 2011.

[9]  Uwe Schwiegelshohn,et al.  New Challenges of Parallel Job Scheduling , 2007, JSSPP.

[10]  Idit Keidar,et al.  Trusting the cloud , 2009, SIGA.

[11]  Yanbo Han,et al.  Engineering and deployment of cooperative information systems : First International Conference, EDCIS 2002, Beijing, China, September 17-20, 2002 : proceedings , 2002 .

[12]  Sukumar Ghosh Distributed Systems: An Algorithmic Approach, Second Edition , 2014 .

[13]  Geoffrey C. Fox,et al.  Special Issue: Autonomous Grid Computing , 2007, Concurr. Comput. Pract. Exp..

[14]  Peter Lind,et al.  A Database-Centric Virtual Chemistry System , 2006, J. Chem. Inf. Model..

[15]  Harold S. Stone,et al.  An Efficient Parallel Algorithm for the Solution of a Tridiagonal Linear System of Equations , 1973, JACM.

[16]  Yanbo Han,et al.  Engineering and Deployment of Cooperative Information Systems , 2002, Lecture Notes in Computer Science.

[17]  Martin Gilje Jaatun,et al.  Deployment models: Towards eliminating security concerns from cloud computing , 2010, 2010 International Conference on High Performance Computing & Simulation.

[18]  Bo Wang,et al.  Task Partition Comparison between Multi-core System and GPU , 2010, 2010 Fifth Annual ChinaGrid Conference.

[19]  Wolfgang Karl,et al.  Euro-Par 2000 Parallel Processing , 2000, Lecture Notes in Computer Science.

[20]  B. E. F. Isher,et al.  Twenty-year follow-up of a randomized trial comparing total mastectomy, lumpectomy, and lumpectomy plus irradiation for the treatment of invasive breast cancer. , 2002 .

[21]  Marc Najork,et al.  Web Crawling , 2010, Found. Trends Inf. Retr..

[22]  Lawrence Rauchwerger,et al.  Hybrid Analysis: Static & Dynamic Memory Reference Analysis , 2004, International Journal of Parallel Programming.

[23]  Vaidy S. Sunderam,et al.  Experiments in Concurrent Stochastic Simulation: The EcliPSe Paradigm , 1992, J. Parallel Distributed Comput..

[24]  George Karypis,et al.  Multilevel k-way Partitioning Scheme for Irregular Graphs , 1998, J. Parallel Distributed Comput..

[25]  A. Taleb-Bendiab,et al.  A Comparative Study into Distributed Load Balancing Algorithms for Cloud Computing , 2010, 2010 IEEE 24th International Conference on Advanced Information Networking and Applications Workshops.

[26]  Bertram Ludäscher,et al.  A Framework for the Design and Reuse of Grid Workflows , 2004, SAG.

[27]  Anthony Skjellum,et al.  A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard , 1996, Parallel Comput..

[28]  Marian Bubak,et al.  Component Approach to Computational Applications on Clouds , 2011, ICCS.

[29]  Alexandru Iosup,et al.  A Performance Analysis of EC2 Cloud Computing Services for Scientific Computing , 2009, CloudComp.

[30]  Mohamed Jmaiel,et al.  A Comparative Study of the Current Cloud Computing Technologies and Offers , 2011, 2011 First International Symposium on Network Cloud Computing and Applications.

[31]  Yunheung Paek,et al.  Parallel Programming with Polaris , 1996, Computer.

[32]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[33]  Yong Zhao,et al.  Cloud Computing and Grid Computing 360-Degree Compared , 2008, GCE 2008.

[34]  Michael J. Franklin,et al.  The Design of GridDB: A Data-Centric Overlay for the Scientific Grid , 2004, VLDB.

[35]  Bronis R. de Supinski,et al.  OpenMP in a New Era of Parallelism, 4th International Workshop, IWOMP 2008, West Lafayette, IN, USA, May 12-14, 2008, Proceedings , 2008, IWOMP.

[36]  Ian Lumb,et al.  A Taxonomy and Survey of Cloud Computing Systems , 2009, 2009 Fifth International Joint Conference on INC, IMS and IDC.

[37]  Parag C. Pendharkar,et al.  Human-computer interaction issues for mobile computing in a variable work context , 2004, Int. J. Hum. Comput. Stud..

[38]  Robert S. Germain,et al.  Blue Matter: Strong Scaling of Molecular Dynamics on Blue Gene/L , 2006, International Conference on Computational Science.

[39]  Wanlei Zhou,et al.  Research on network anycast , 2002, Fifth International Conference on Algorithms and Architectures for Parallel Processing, 2002. Proceedings..

[40]  Martin Schulz,et al.  ScalaTrace: Scalable compression and replay of communication traces for high-performance computing , 2008, J. Parallel Distributed Comput..

[41]  Rudolf Eigenmann,et al.  Languages and Compilers for High Performance Computing, 17th International Workshop, LCPC 2004, West Lafayette, IN, USA, September 22-24, 2004, Revised Selected Papers , 2005, LCPC.

[42]  Yun Tian,et al.  Improving MapReduce performance through data placement in heterogeneous Hadoop clusters , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW).

[43]  Jack Dongarra,et al.  PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing , 1995 .

[44]  Daniel Rubio,et al.  Spring AOP and AspectJ Support , 2010 .

[45]  Aslak Tveito,et al.  Numerical solution of partial differential equations on parallel computers , 2006 .

[46]  Gregor von Laszewski,et al.  Grid Computing: Enabling a Vision for Collaborative Research , 2002, PARA.

[47]  Rajkumar Buyya,et al.  Market-Oriented Cloud Computing: Vision, Hype, and Reality for Delivering IT Services as Computing Utilities , 2008, 2008 10th IEEE International Conference on High Performance Computing and Communications.

[48]  Vítor Santos Costa,et al.  Euro-Par 2014 Parallel Processing , 2014, Lecture Notes in Computer Science.

[49]  C.E. Zoumas,et al.  A genetic algorithm solution approach to the hydrothermal coordination problem , 2004, IEEE Transactions on Power Systems.

[50]  Marian Bubak,et al.  Perspectives on grid computing , 2010, Future Gener. Comput. Syst..

[51]  Dean M. Tullsen,et al.  Simultaneous multithreading: Maximizing on-chip parallelism , 1995, Proceedings 22nd Annual International Symposium on Computer Architecture.

[52]  Ahmed Patel,et al.  Comparative study and review of grid, cloud, utility computing and software as a service for use by libraries , 2011 .

[53]  Minyi Guo,et al.  High-Performance Computing : Paradigm and Infrastructure , 2005 .

[54]  William Gropp,et al.  Mpi - The Complete Reference: Volume 2, the Mpi Extensions , 1998 .

[55]  Leszek Kasprzyk,et al.  Application of a Parallel Virtual Machine for the Analysis of a Luminous Field , 2002, PVM/MPI.