Towards the Discovery of Diseases Related by Genes Using Vertex Similarity Measures

Discovering the relationships of gene to gene, gene to its related diseases, and diseases implicated in common genes is important. However, traditional biological methods can be expensive. Here, we show that the diseases implicated in common genes and the genes related to a multiple-gene disease can be inferred by the vertex similarity measures, a type of method to find the similar vertices in a network based on its structure. The relationship among diseases and the relationship among genes are modeled as two biological networks: human disease network and disease gene network. We apply the vertex similarity among the vertices in the human disease network to infer the diseases implicated in common genes. By similar manner, we utilize vertex similarity measures on the disease gene network to infer the genes related to a common multiple-gene disease. Experimental results demonstrate the potential of vertex similarity as an inexpensive approach to infer the possible links between genes and between diseases. We also develop a system to visualize and get a better understanding about the relationships among diseases and genes.

[1]  M. Newman,et al.  Vertex similarity in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  A. Butte,et al.  Creation and implications of a phenome-genome network , 2006, Nature Biotechnology.

[3]  L. Tsui,et al.  Identification of the cystic fibrosis gene: genetic analysis. , 1989, Science.

[4]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[5]  M. Maffei,et al.  Positional cloning of the mouse obese gene and its human homologue , 1994, Nature.

[6]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[7]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[9]  Derek de Solla Price,et al.  A general theory of bibliometric and other cumulative advantage processes , 1976, J. Am. Soc. Inf. Sci..

[10]  Andrew B. Kahng,et al.  New spectral methods for ratio cut partitioning and clustering , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[11]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[12]  M. Hodson,et al.  Identification of the cystic fibrosis gene. , 1990, BMJ.

[13]  D. R. Fulkerson,et al.  Maximal Flow Through a Network , 1956 .

[14]  Xiaolong Zhang,et al.  CollabSeer: a search engine for collaboration discovery , 2011, JCDL '11.

[15]  Hung-Hsuan Chen,et al.  Discovering missing links in networks using vertex similarity measures , 2012, SAC '12.

[16]  Jon Kleinberg,et al.  The link prediction problem for social networks , 2003, CIKM '03.

[17]  Albert-László Barabási,et al.  The Architecture of Biological Networks , 2006 .

[18]  Laurent Mottron,et al.  SYN1 loss-of-function mutations in autism and partial epilepsy cause impaired synaptic function. , 2011, Human molecular genetics.

[19]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[21]  L. Tsui,et al.  Erratum: Identification of the Cystic Fibrosis Gene: Genetic Analysis , 1989, Science.

[22]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[23]  A. Barabasi,et al.  Evolution of the social network of scientific collaborations , 2001, cond-mat/0104162.

[24]  M. Maffei,et al.  Positional cloning of the mouse obese gene and its human homologue , 1995, Nature.

[25]  E. Levanon,et al.  Preferential attachment in the protein network evolution. , 2003, Physical review letters.

[26]  Srinivasan Parthasarathy,et al.  Markov clustering of protein interaction networks with improved balance and scalability , 2010, BCB '10.

[27]  Francis S. Collins,et al.  Positional cloning moves from perditional to traditional , 1995, Nature Genetics.