BioPortal: A Portal for Deployment of Bioinformatics Applications on Cluster and Grid Environments

Over last few years, interest on biotechnology has increased dramatically. With the completion of sequencing of the human genome, such interest is likely to expand even more rapidly. The size of genetic information database doubles every 14 months, overwhelming explosion of information in related bioscience disciplines and consequently, overtaxing any existing computational tool for data analysis. There is a persistent and continuous search for new alternatives or new technologies, all with the common goal of improving overall computational performance. Grid infrastructures are characterized by interconnecting a number of heterogeneous hosts through the internet, by enabling large-scale aggregation and sharing of computational, data and other resources across institutional boundaries. In this research paper, we present BioPortal, a user friendly and web-based GUI that eases the deployment of well-known bioinformatics applications on large-scale cluster and grid computing environments. The major motivation of this research is to enable biologists and geneticists, as also biology students and investigators, to access to high performance computing without specific technical knowledge of the means in which are handled by these computing environments and no less important, without introducing any additional drawback, in order to accelerate their experimental and sequence data analysis. As result, we could demonstrate the viability of such design and implementation, involving solely freely available softwares.

[1]  Nagiza F. Samatova,et al.  Efficient data access for parallel BLAST , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[2]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[3]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[4]  Yaw-Ling Lin,et al.  Performance issues of grid computing based on different architecture cluster computing platforms , 2005, 19th International Conference on Advanced Information Networking and Applications (AINA'05) Volume 1 (AINA papers).

[5]  Ian T. Foster,et al.  Resource co-allocation in computational grids , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[6]  Francine Berman,et al.  The GrADS Project: Software Support for High-Level Grid Application Development , 2001, Int. J. High Perform. Comput. Appl..

[7]  D. Higgins,et al.  See Blockindiscussions, Blockinstats, Blockinand Blockinauthor Blockinprofiles Blockinfor Blockinthis Blockinpublication Clustal: Blockina Blockinpackage Blockinfor Blockinperforming Multiple Blockinsequence Blockinalignment Blockinon Blockina Minicomputer Article Blockin Blockinin Blockin , 2022 .

[8]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[9]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[10]  Kuan-Ching Li,et al.  Design issues of a novel toolkit for parallel application performance monitoring and analysis in cluster and grid environments , 2005, 8th International Symposium on Parallel Architectures,Algorithms and Networks (ISPAN'05).

[11]  Kuo-Bin Li,et al.  ClustalW-MPI: ClustalW analysis using distributed and parallel computing , 2003, Bioinform..

[12]  Rajkumar Buyya,et al.  Grids and Grid technologies for wide‐area distributed computing , 2002, Softw. Pract. Exp..

[13]  M. P. Cummings PHYLIP (Phylogeny Inference Package) , 2004 .

[14]  Ian T. Foster,et al.  Grid information services for distributed resource sharing , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[15]  Rajkumar Buyya,et al.  Weaving computational grids: how analogous are they with electrical grids? , 2002, Comput. Sci. Eng..

[16]  Ian T. Foster,et al.  Data management and transfer in high-performance computational grid environments , 2002, Parallel Comput..

[17]  Ian T. Foster,et al.  GridMapper: a tool for visualizing the behavior of large-scale distributed systems , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[18]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.