Parallel Rank Coherence in Networks for Inferring Disease Phenotype and Gene Set Associations

The RCNet (Rank Coherence in Networks) algorithm has been used to find out the associations between the gene sets and disease phenotypes. However, it suffers from high computational cost when the size of dataset is very large. In this paper, we design three mechanisms to solve the RCNet algorithm on heterogeneous CPU-GPU system based on CUDA and OpenMP programming model. The pipeline mechanism is much suitable for the collaborative computing on CPU and dual-GPUs, which can achieve more than 33 times performance gains. The work plays an important role in reconstructing the disease phoneme-genome association efficiently.

[1]  G. Vriend,et al.  A text-mining analysis of the human phenome , 2006, European Journal of Human Genetics.

[2]  E. Snitkin,et al.  Genome-wide prioritization of disease genes and identification of disease-disease associations from an integrated human functional linkage network , 2009, Genome Biology.

[3]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[4]  Nicholas A. Hamilton,et al.  A GPU Implementation of Fast Parallel Markov Clustering in Bioinformatics Using EllPACK-R Sparse Data Format , 2010, 2010 Second International Conference on Advances in Computing, Control, and Telecommunication Technologies.

[5]  TaeHyun Hwang,et al.  Inferring disease and gene set associations with rank coherence in networks , 2011, Bioinform..

[6]  Antonino Tumeo,et al.  Accelerating DNA analysis applications on GPU clusters , 2010, 2010 IEEE 8th Symposium on Application Specific Processors (SASP).

[7]  T. Ideker,et al.  Network-based classification of breast cancer metastasis , 2007, Molecular systems biology.

[8]  Jens H. Krüger,et al.  A Survey of General‐Purpose Computation on Graphics Hardware , 2007, Eurographics.

[9]  Nuno Roma,et al.  Advantages and GPU implementation of high-performance indexed DNA search based on suffix arrays , 2011, 2011 International Conference on High Performance Computing & Simulation.

[10]  John D. Owens,et al.  Multi-GPU MapReduce on GPU Clusters , 2011, 2011 IEEE International Parallel & Distributed Processing Symposium.

[11]  TaeHyun Hwang,et al.  A Heterogeneous Label Propagation Algorithm for Disease Gene Discovery , 2010, SDM.

[12]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[13]  V. McKusick Mendelian Inheritance in Man and Its Online Version, OMIM , 2007, The American Journal of Human Genetics.

[14]  David Martin,et al.  GOToolBox: functional analysis of gene datasets based on Gene Ontology , 2004, Genome Biology.

[15]  P. Robinson,et al.  Walking the interactome for prioritization of candidate disease genes. , 2008, American journal of human genetics.

[16]  Michael Q. Zhang,et al.  Network-based global inference of human disease genes , 2008, Molecular systems biology.

[17]  C. Wijmenga,et al.  Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes. , 2006, American journal of human genetics.

[18]  Jürgen Teich,et al.  Generating Device-specific GPU Code for Local Operators in Medical Imaging , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium.

[19]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[20]  Wu-chun Feng,et al.  Accelerating Protein Sequence Search in a Heterogeneous Computing System , 2011, 2011 IEEE International Parallel & Distributed Processing Symposium.