High-performance computing for vision

The main focus of the paper is on effectively using commercial-off-the-shelf (COTS) based general purpose parallel computing platforms to realize high speed implementations of vision tasks. Due to the successful use of the COTS-based systems in a variety of high performance applications, it is attractive to consider their use for vision applications as well. However, the irregular data dependencies in vision tasks lead to large communication overheads in the HPC systems. At the University of Southern California, our research efforts have been directed toward designing scalable parallel algorithms for vision tasks on the HPC systems. In our approach, we use the message passing programming model to develop portable code. Our algorithms are specified using C and MPI. In this paper, we summarize our efforts, and illustrate our approach using several example vision tasks.

[1]  Howard Jay Siegel,et al.  Mapping computer-vision-related tasks onto reconfigurable parallel-processing systems , 1992, Computer.

[2]  Seth Copen Goldstein,et al.  Active messages: a mechanism for integrating communication and computation , 1998, ISCA '98.

[3]  Ramesh Subramonian,et al.  LogP: towards a realistic model of parallel computation , 1993, PPOPP '93.

[4]  Charles L. Seitz,et al.  Myrinet: A Gigabit-per-Second Local Area Network , 1995, IEEE Micro.

[5]  Sanjay Ranka,et al.  A Practical Hierarchical Model of Parallel Computation. I. The Model , 1992, J. Parallel Distributed Comput..

[6]  Viktor K. Prasanna,et al.  Scalable Data Parallel Implementations of Object Recognition Using Geometric Hashing , 1994, J. Parallel Distributed Comput..

[7]  Dionysios I. Reisis,et al.  Parallel Computations on Reconfigurable Meshes , 1993, IEEE Trans. Computers.

[8]  G. Astfalk Parallel programming on the Convex MPP , 1994, Proceedings of OCEANS'94.

[9]  David A. Bader,et al.  Parallel Algorithms for Image Histogramming and Connected Components with an Experimental Study , 1996, J. Parallel Distributed Comput..

[10]  Monica S. Lam,et al.  Global optimizations for parallelism and locality on scalable parallel machines , 1993, PLDI '93.

[11]  Peter A. Dinda,et al.  Communication and memory requirements as the basis for mapping task and data parallel programs , 1994, Proceedings of Supercomputing '94.

[12]  Sanjay Ranka,et al.  Irregular Personalized Communication on Distributed Memory Machines , 1995, J. Parallel Distributed Comput..

[13]  Michael J. Quinn,et al.  Divide-and-conquer programming on MIMD computers , 1995, Proceedings of 9th International Parallel Processing Symposium.

[14]  Rice UniversityCORPORATE,et al.  High performance Fortran language specification , 1993 .

[15]  Amotz Bar-Noy,et al.  Designing broadcasting algorithms in the postal model for message-passing systems , 1992, SPAA '92.

[16]  Viktor K. Prasanna,et al.  Efficient Parallel Algorithms for Image Template Matching on Hypercube SIMD Machines , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  T. T. Kwan,et al.  Communication and computation performance of the CM-5 , 1993, Supercomputing '93.

[18]  Jon A. Webb,et al.  High performance computing in image processing and computer vision , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 2 - Conference B: Computer Vision & Image Processing. (Cat. No.94CH3440-5).

[19]  Sanjay Ranka,et al.  Parallel remapping algorithms for adaptive problems , 1995, Proceedings Frontiers '95. The Fifth Symposium on the Frontiers of Massively Parallel Computation.

[20]  Larry S. Davis,et al.  Effective use of SIMD parallelism in low- and intermediate-level vision , 1992, Computer.

[21]  C. A. R. Hoare,et al.  Monitors: an operating system structuring concept , 1974, CACM.

[22]  Viktor K. Prasanna,et al.  Parallel Architectures and Algorithms for Image Component Labeling , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Viktor K. Prasanna,et al.  Scalable parallel implementations of perceptual grouping on connection machine CM-5 , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 2 - Conference B: Computer Vision & Image Processing. (Cat. No.94CH3440-5).

[24]  Jack Dongarra,et al.  A User''s Guide to PVM Parallel Virtual Machine , 1991 .

[25]  Stelios C. Orphanoudakis,et al.  Load Balancing Requirements in Parallel Implementations of Image Feature Extraction Tasks , 1993, IEEE Trans. Parallel Distributed Syst..

[26]  Concettina Guerra Survey of parallel algorithms for structural pattern matching , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 2 - Conference B: Computer Vision & Image Processing. (Cat. No.94CH3440-5).

[27]  Andrew A. Chien,et al.  Software overhead in messaging layers: where does the time go? , 1994, ASPLOS VI.

[28]  Steven Fortune,et al.  Parallelism in random access machines , 1978, STOC.

[29]  Geoffrey C. Fox,et al.  Compiling Fortran 90D/HPF for Distributed Memory MIMD Computers , 1994, J. Parallel Distributed Comput..

[30]  Viktor K. Prasanna,et al.  Low level vision processing on connection machine CM-5 , 1993, 1993 Computer Architectures for Machine Perception.

[31]  Ram Nevatia,et al.  Specifying heterogeneous suites for vision tasks , 1993, 1993 Computer Architectures for Machine Perception.

[32]  Viktor K. Prasanna,et al.  Scalable data parallel algorithms and implementations for object recognition , 1993 .

[33]  Yongwha Chung,et al.  A fast asynchronous algorithm for linear feature extraction on IBM SP-2 , 1995, Proceedings of Conference on Computer Architectures for Machine Perception.

[34]  Leslie G. Valiant,et al.  A bridging model for parallel computation , 1990, CACM.

[35]  Sanjay Ranka,et al.  A practical hierarchical model of parallel computation , 1991, Proceedings of the Third IEEE Symposium on Parallel and Distributed Processing.

[36]  K. Ramesh Babu,et al.  Linear Feature Extraction and Description , 1979, IJCAI.

[37]  Kwan Woo Ryu,et al.  The block distributed memory model for shared memory multiprocessors , 1994, Proceedings of 8th International Parallel Processing Symposium.

[38]  Jake K. Aggarwal,et al.  Applying perceptual organization to the detection of man-made objects in non-urban scenes , 1992, Pattern Recognit..

[39]  Cho-Li Wang High performance computing for vision on distributed-memory machines , 1996 .

[40]  Peter Athanas,et al.  Addressing the Computational Requirements of Image Processing with a Custom Computing Machine: An Overview1 , 1997 .

[41]  Viktor K. Prasanna,et al.  Parallel algorithms and architectures for discrete relaxation technique , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[42]  Viktor K. Prasanna,et al.  Parallelization of perceptual grouping on distributed memory machines , 1995, Proceedings of Conference on Computer Architectures for Machine Perception.