This paper presents a parallel program for assessing the codetermination of gene transcriptional states from large- scale simultaneous gene expression measurements with cDNA microarrays. The parallel program is based on a nonlinear statistical framework recently proposed for the analysis of gene interaction via multivariate expression arrays. Parallel computing is key in the application of the statistical framework to a large set of genes because a prohibitive amount of computer time is required on a classical single-CPU machine. Our parallel program, named the Parallel Analysis of Gene Expression (PAGE) program, exploits inherent parallelism exhibited in the proposed codetermination prediction models. By running PAGE on 64 processors in Beowulf, a clustered parallel system, an analysis of melanoma cDNA microarray expression data has been completed within 12 days of computer time, an analysis that would have required about one and half years on a single-CPU computing system. A data visualization program, named the Visualization of Gene Expression (VOGE) program, has been developed to help interpret the massive amount of quantitative information produced by PAGE. VOGE provides graphical data visualization and analysis tools with filters, histograms, and accesses to other genetic databanks for further analyses of the quantitative information.
[1]
Edward R. Dougherty,et al.
Coefficient of determination in nonlinear signal processing
,
2000,
Signal Process..
[2]
Y. Chen,et al.
Ratio-based decisions and the quantitative analysis of cDNA microarray images.
,
1997,
Journal of biomedical optics.
[3]
Robert L. Martino,et al.
Parallel Algorithms in Molecular Biology
,
1997,
HPCN Europe.
[4]
Benjamin Ray Seyfarth,et al.
How to Build a Beowulf: A Guide to the Implementation and Application of PC Clusters
,
2000,
Scalable Comput. Pract. Exp..
[5]
Selim G. Akl,et al.
Design and analysis of parallel algorithms
,
1985
.
[6]
K Sivakumar,et al.
General nonlinear framework for the analysis of gene interaction via multivariate expression arrays.
,
2000,
Journal of biomedical optics.
[7]
E. Dougherty,et al.
Multivariate measurement of gene expression relationships.
,
2000,
Genomics.
[8]
Kai Hwang,et al.
Advanced computer architecture - parallelism, scalability, programmability
,
1992
.
[9]
Bhagirath Narahari,et al.
Dynamic load balancing schemes for computing accessible surface area of protein molecules
,
1998,
Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238).