Parallel Protein Information Analysis (PAPIA) System Running on a 64-Node PC Cluster.

Protein information analysis is widely regarded as a key technology in drug design, macromolecular engineering, and understanding genome sequences. Because vast amount of calculations are required, further speed-up for protein information analysis is very much in demand. We have implemented the PAPIA (PArallel Protein Information Analysis) system on the RWC PC cluster IIa (PAPIA cluster) which consists of 64 Pentium Pro 200MHz microprocessors. The PAPIA system performs fast parallel processing for typical calculations in protein analysis, such as structure similarity search, sequence homology search and multiple sequence alignment, nearly 60 times faster than a single processor. We have started a WWW service (http://www.rwcp.or.jp/papia/), allowing any biologist to easily submit jobs to the PAPIA system through a WWW browser. The user can experience the power of current parallel processing technology.

[1]  Tamotsu Noguchi,et al.  Parallel PDB Data Retriever "PDB Diving Booster" , 1997, ISHPC.

[2]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[3]  Tamotsu Noguchi,et al.  PDB-REPRDB: A Database of Representative Protein Chains in PDB (Protein Data Bank) , 1997, ISMB.

[4]  M. Kanehisa,et al.  DBGET/LinkDB: an integrated database retrieval system. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[5]  Masato Ishikawa,et al.  Comprehensive study on iterative algorithms of multiple sequence alignment , 1995, Comput. Appl. Biosci..

[6]  T. Heinemeyer,et al.  Databases on transcriptional regulation : TRANSFAC , TRRD and COMPEL , 1997 .

[7]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[8]  秋山 泰,et al.  Parallel Protein Information Analysis (PAPIA) system implemented on RWC PC cluster II , 1998 .

[9]  K Nishikawa,et al.  Predicting protein secondary structure based on amino acid sequence. , 1991, Methods in enzymology.

[10]  Tamotsu Noguchi,et al.  Employing A* Algorithm in Parallel Multiple Protein Sequence Alignment , 1997 .

[11]  Yutaka Ishikawa,et al.  Implementation of Gang-Scheduling on Workstation Cluster , 1996, JSSPP.

[12]  Masahiro Ito,et al.  Prediction of protein secondary structure using the 3D-1D compatibility algorithm , 1997, Comput. Appl. Biosci..

[13]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[14]  Mitsuhisa Sato,et al.  PM: An Operating System Coordinated High Performance Communication Library , 1997, HPCN Europe.