Discovery of functional protein groups by clustering community links and integration of ontological knowledge

In this paper we cluster data from protein networks and integrate the results with chemical databases and ontologies to investigate functional links between related disease states. It is well know that certain genes participate in more than one function and if they are defective are likely to be responsible for several health problems. Furthermore, genes tend to cooperate in associated networks or cascades often with ’crosstalk’ between networks which can subtly alter cellular functions. Understanding the complexity and role of the various cell functions and mechanisms requires the use of computational models to make inferences and link together the interplay between genes, proteins and chemical interactions. A deeper understanding of the mechanisms of diseases will eventually be of benefit for the development new and improved therapies. The particular disease state we investigate in this work is cystinosis which is characterized by the widespread deposition of the amino acid cystine in cells due to a defect in cystine transport. In cystinosis, cystine accumulates in the lysosomes and eventually forms crystals throughout the body causing problems in the kidneys and the eyes. The defect is caused by a mutation in the CTNS gene and this forms the starting point for our investigation.

[1]  Eric J. Deeds,et al.  A simple physical model for scaling in protein-protein interaction networks. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[2]  C. Antignac,et al.  Molecular pathogenesis of cystinosis: effect of CTNS mutations on the transport activity and subcellular localization of cystinosin. , 2004, Human molecular genetics.

[3]  Sheila Garfield,et al.  Recent trends in knowledge and data integration for the life sciences , 2006, Expert Syst. J. Knowl. Eng..

[4]  Damian Szklarczyk,et al.  STITCH 3: zooming in on protein–chemical interactions , 2011, Nucleic Acids Res..

[5]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[6]  M. DePamphilis,et al.  HUMAN DISEASE , 1957, The Ulster Medical Journal.

[7]  Uri Alon,et al.  Efficient sampling algorithm for estimating subgraph concentrations and detecting network motifs , 2004, Bioinform..

[8]  Shigehiko Kanaya,et al.  Development and implementation of an algorithm for detection of protein complexes in large interaction networks , 2006, BMC Bioinformatics.

[9]  Xiaohua Hu,et al.  Mining and analysing scale-free proteinprotein interaction network , 2005, Int. J. Bioinform. Res. Appl..

[10]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[11]  Pavel Tomancak,et al.  linkcomm: an R package for the generation, visualization, and analysis of link communities in networks of arbitrary size and type , 2011, Bioinform..

[12]  Krzysztof J. Cios,et al.  Protein annotation from protein interaction networks and Gene Ontology , 2011, J. Biomed. Informatics.

[13]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[14]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[15]  C. Antignac,et al.  Cystinosis: from gene to disease. , 2002, Nephrology, dialysis, transplantation : official publication of the European Dialysis and Transplant Association - European Renal Association.

[16]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[17]  P. Aloy,et al.  Unveiling the role of network and systems biology in drug discovery. , 2010, Trends in pharmacological sciences.

[18]  Yongsheng Ding,et al.  Dynamic and collective analysis of membrane protein interaction network based on gene regulatory network model , 2012, Neurocomputing.

[19]  J. Feingold,et al.  Infantile cystinosis in France: genetics, incidence, geographic distribution. , 1976, Journal of medical genetics.

[20]  S. Urien,et al.  Population pharmacokinetics and pharmacodynamics of cysteamine in nephropathic cystinosis patients , 2011, Orphanet journal of rare diseases.

[21]  F. Emma,et al.  Modulation of CTNS gene expression by intracellular thiols. , 2010, Free radical biology & medicine.

[22]  Gang Feng,et al.  Disease Ontology: a backbone for disease semantic integration , 2011, Nucleic Acids Res..

[23]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[24]  Anthony J. T. Lee,et al.  Mining Dense Overlapping Subgraphs in weighted protein-protein interaction networks , 2011, Biosyst..

[25]  Xiang Li,et al.  DOSim: An R package for similarity between diseases based on Disease Ontology , 2011, BMC Bioinformatics.

[26]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[27]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[28]  C. Antignac,et al.  Late-onset nephropathic cystinosis: clinical presentation, outcome, and genotyping. , 2008, Clinical journal of the American Society of Nephrology : CJASN.

[29]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[30]  Giles Oatley,et al.  A multi-layered approach to protein data integration for diabetes research , 2007, Artif. Intell. Medicine.

[31]  Bruce A Barshop,et al.  Twice-daily cysteamine bitartrate therapy for children with cystinosis. , 2010, The Journal of pediatrics.

[32]  Jianzhi Zhang,et al.  Why Do Hubs Tend to Be Essential in Protein Networks? , 2006, PLoS genetics.

[33]  Ting Chen,et al.  Mapping gene ontology to proteins based on protein-protein interaction data , 2004, Bioinform..

[34]  S. Shen-Orr,et al.  Networks Network Motifs : Simple Building Blocks of Complex , 2002 .

[35]  Judith A. Blake,et al.  Beyond the data deluge: Data integration and bio-ontologies , 2006, J. Biomed. Informatics.

[36]  F. Emma,et al.  Pathogenesis of cell dysfunction in nephropathic cystinosis , 2008 .

[37]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.