Integrated network analysis of symptom clusters across disease conditions

Identifying the symptom clusters (two or more related symptoms) with shared underlying molecular mechanisms has been a vital analysis task to promote the symptom science and precision health. Related studies have applied the clustering algorithms (e.g. k-means, latent class model) to detect the symptom clusters mostly from various kinds of clinical data. In addition, they focused on identifying the symptom clusters (SCs) for a specific disease, which also mainly concerned with the clinical regularities for symptom management. Here, we utilized a network-based clustering algorithm (i.e., BigCLAM) to obtain 208 typical SCs across disease conditions on a large-scale symptom network derived from integrated high-quality disease-symptom associations. Furthermore, we evaluated the underlying shared molecular mechanisms for SCs, i.e., shared genes, protein-protein interaction (PPI) and gene functional annotations using integrated networks and similarity measures. We found that the symptoms in the same SCs tend to share a higher degree of genes, PPIs and have higher functional homogeneities. In addition, we found that most SCs have related symptoms with shared underlying molecular mechanisms (e.g. enriched pathways) across different disease conditions. Our work demonstrated that the integrated network analysis method could be used for identifying robust SCs and investigate the molecular mechanisms of these SCs, which would be valuable for symptom science and precision health.

[1]  T. Spector,et al.  A mutation in COL9A1 causes multiple epiphyseal dysplasia: further evidence for locus heterogeneity. , 2001, American journal of human genetics.

[2]  Takuji Nishimura,et al.  Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator , 1998, TOMC.

[3]  Hee-Ju Kim,et al.  Common Biological Pathways Underlying the Psychoneurological Symptom Cluster in Cancer Patients , 2012, Cancer nursing.

[4]  Shuhui Liu,et al.  Improving the measurement of semantic similarity by combining gene ontology and co-functional network: a random walk based approach , 2018, BMC Systems Biology.

[5]  Reinhard Guthke,et al.  Identification of intra-group, inter-individual, and gene-specific variances in mRNA expression profiles in the rheumatoid arthritis synovial membrane , 2008, Arthritis research & therapy.

[6]  D. Luo,et al.  Erythema associated with pain and warmth on face and ears: a variant of erythermalgia or red ear syndrome? , 2014, The Journal of Headache and Pain.

[7]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[8]  L. Tulman,et al.  Symptom Clusters: Concept Analysis and Clinical Implications for Cancer Nursing , 2005, Cancer nursing.

[9]  D. Walsh,et al.  The Effect of the Pain Symptom Cluster on Performance in Women Diagnosed with Advanced Breast Cancer: The Mediating Role of the Psychoneurological Symptom Cluster , 2018, Pain management nursing : official journal of the American Society of Pain Management Nurses.

[10]  J. Saklatvala,et al.  Joint immobilization prevents murine osteoarthritis and reveals the highly mechanosensitive nature of protease expression in vivo. , 2012, Arthritis and rheumatism.

[11]  Kent A. Spackman,et al.  Review: Representing Thoughts, Words, and Things in the UMLS , 1998, J. Am. Medical Informatics Assoc..

[12]  Rong Fei,et al.  Critical microRNAs and regulatory motifs in cleft palate identified by a conserved miRNA-TF-gene network approach in humans and mice , 2019, Briefings Bioinform..

[13]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[14]  Zhisong Pan,et al.  Identifying influential nodes based on network representation learning in complex networks , 2018, PloS one.

[15]  T. Yamawaki,et al.  Immunopositivity for ESCRT-III subunit CHMP2B in granulovacuolar degeneration of neurons in the Alzheimer's disease hippocampus , 2010, Neuroscience Letters.

[16]  D. Stryer,et al.  Practical clinical trials: increasing the value of clinical research for decision making in clinical and health policy. , 2003, JAMA.

[17]  Tudor Groza,et al.  The Human Phenotype Ontology in 2017 , 2016, Nucleic Acids Res..

[18]  A. Reeve,et al.  Characterisation of the cannabinoid receptor system in synovial tissue and fluid in patients with osteoarthritis and rheumatoid arthritis , 2008, Arthritis research & therapy.

[19]  D. Walsh,et al.  Review: Symptom clusters: myth or reality? , 2010, Palliative medicine.

[20]  Sanghamitra Bandyopadhyay,et al.  WeCoMXP: Weighted Connectivity Measure Integrating Co-Methylation, Co-Expression and Protein-Protein Interactions for Gene-Module Detection , 2020, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[21]  Wei Chen,et al.  Overlapping Community Detection in Directed Heterogeneous Social Network , 2015, WAIM.

[22]  Ponrathi Athilingam,et al.  Symptom clusters in chronic obstructive pulmonary disease: A systematic review. , 2019, Applied nursing research : ANR.

[23]  Tyrone D. Cannon,et al.  Latent class cluster analysis of symptom ratings identifies distinct subgroups within the clinical high risk for psychosis syndrome , 2017, Schizophrenia Research.

[24]  Jure Leskovec,et al.  Overlapping community detection at scale: a nonnegative matrix factorization approach , 2013, WSDM.

[25]  M. Dodd,et al.  Symptom clusters: the new frontier in symptom management research. , 2004, Journal of the National Cancer Institute. Monographs.

[26]  Lei Lei,et al.  A Systems Approach to Refine Disease Taxonomy by Integrating Phenotypic and Molecular Networks , 2017, bioRxiv.

[27]  D. Heinegård,et al.  Changes in cartilage and bone metabolism identified by serum markers in early osteoarthritis of the knee joint. , 1998, British journal of rheumatology.

[28]  Y. Conley,et al.  Symptom Science: Omics Supports Common Biological Underpinnings Across Symptoms , 2018, Biological research for nursing.

[29]  T. Siddique,et al.  UBQLN2/P62 cellular recycling pathways in amyotrophic lateral sclerosis and frontotemporal dementia , 2012, Muscle & nerve.

[30]  J. Levine,et al.  Cytokine Gene Polymorphisms Associated With Symptom Clusters in Oncology Patients Undergoing Radiation Therapy. , 2017, Journal of pain and symptom management.

[31]  Manfred Stommel,et al.  Symptom clusters in elderly patients with lung cancer. , 2004, Oncology nursing forum.

[32]  R. Rescorla Pavlovian conditioning and its proper control procedures. , 1967, Psychological review.

[33]  N. Uysal,et al.  Identification of Symptom Clusters in Cancer Patients at Palliative Care Clinic , 2017, Asia-Pacific journal of oncology nursing.

[34]  M. McHugh,et al.  The Chi-square test of independence , 2013, Biochemia medica.

[35]  Yi Pan,et al.  Biological network motif detection and evaluation , 2011, BMC Systems Biology.

[36]  Albert-László Barabási,et al.  Network-based prediction of drug combinations , 2019, Nature Communications.

[37]  L. Bolund,et al.  Patient iPSC-Derived Neurons for Disease Modeling of Frontotemporal Dementia with Mutation in CHMP2B , 2022 .

[38]  A. Barabasi,et al.  Human symptoms–disease network , 2014, Nature Communications.

[39]  T. Elston,et al.  Stochasticity in gene expression: from theories to phenotypes , 2005, Nature Reviews Genetics.

[40]  E. Choy,et al.  The role of Interleukin 6 in the pathophysiology of rheumatoid arthritis , 2010, Therapeutic advances in musculoskeletal disease.

[41]  Younglim Lee,et al.  Monosodium iodoacetate-induced joint pain is associated with increased phosphorylation of mitogen activated protein kinases in the rat spinal cord , 2011, Molecular pain.

[42]  R. Sandler,et al.  Symptom clusters in adults with inflammatory bowel disease , 2017, Research in nursing & health.

[43]  Doron Lancet,et al.  MalaCards: an amalgamated human disease compendium with diverse clinical and genetic annotation and structured search , 2016, Nucleic Acids Res..

[44]  Damian Szklarczyk,et al.  STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets , 2018, Nucleic Acids Res..

[45]  S. Weisbord,et al.  Comparison of Fatigue, Pain, and Depression in Patients With Advanced Kidney Disease and Cancer-Symptom Burden and Clusters. , 2019, Journal of pain and symptom management.

[46]  Ke Wang,et al.  Mining Disease-Symptom Relation from Massive Biomedical Literature and Its Application in Severe Disease Diagnosis , 2018, AMIA.

[47]  J. Scheller,et al.  Interleukin‐6 biology is coordinated by membrane‐bound and soluble receptors: role in inflammation and cancer , 2006, Journal of leukocyte biology.

[48]  Philip S. Yu,et al.  A new method to measure the semantic similarity of GO terms , 2007, Bioinform..

[49]  Phillip W. Lord,et al.  Semantic Similarity in Biomedical Ontologies , 2009, PLoS Comput. Biol..

[50]  Andrey Rzhetsky,et al.  DiseaseConnect: a comprehensive web server for mechanism-based disease–disease connections , 2014, Nucleic Acids Res..

[51]  Núria Queralt-Rosinach,et al.  DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants , 2016, Nucleic Acids Res..

[52]  Pingping Wang,et al.  Computational Methods for Identifying Similar Diseases , 2019, Molecular therapy. Nucleic acids.

[53]  Christine Miaskowski,et al.  Occurrence of symptom clusters. , 2004, Journal of the National Cancer Institute. Monographs.

[54]  Ana Cernea,et al.  Genomic risk prediction of aromatase inhibitor‐related arthralgia in patients with breast cancer using a novel machine‐learning algorithm , 2017, Cancer medicine.

[55]  Brad T. Sherman,et al.  DAVID: Database for Annotation, Visualization, and Integrated Discovery , 2003, Genome Biology.

[56]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[57]  Anirban Mukhopadhyay,et al.  A Survey and Comparative Study of Statistical Tests for Identifying Differential Expression from Microarray Data , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[58]  Michelle Giglio,et al.  Human Disease Ontology 2018 update: classification, content and workflow expansion , 2018, Nucleic Acids Res..

[59]  S. Barrett,et al.  Central role of the MEK/ERK MAP kinase pathway in a mouse model of rheumatoid arthritis: potential proinflammatory mechanisms. , 2007, Arthritis and rheumatism.

[60]  D. Kingsley,et al.  Mutations in ANKH cause chondrocalcinosis. , 2002, American journal of human genetics.

[61]  A. Thathiah,et al.  The role of GPCRs in neurodegenerative diseases: avenues for therapeutic intervention , 2017, Current opinion in pharmacology.

[62]  E. Bose,et al.  Diabetes Changes Symptoms Cluster Patterns in Persons Living With HIV , 2017, The Journal of the Association of Nurses in AIDS Care : JANAC.

[63]  Junmin Zhao,et al.  MultiSourcDSim: an integrated approach for exploring disease similarity , 2019, BMC Medical Informatics Decis. Mak..

[64]  H. Lowe,et al.  Understanding and using the medical subject headings (MeSH) vocabulary to perform literature searches. , 1994, JAMA.

[65]  C. Fan,et al.  Effects and relationship of ERK1 and ERK2 in interleukin-1β-induced alterations in MMP3, MMP13, type II collagen and aggrecan expression in human chondrocytes. , 2011, International journal of molecular medicine.

[66]  Ujjwal Maulik,et al.  MiRNA-TF-gene network analysis through ranking of biomolecules for multi-informative uterine leiomyoma dataset , 2015, J. Biomed. Informatics.

[67]  R. Nussbaum,et al.  Vascular pathology of medial arterial calcifications in NT5E deficiency: implications for the role of adenosine in pseudoxanthoma elasticum. , 2011, Molecular genetics and metabolism.

[68]  R. Faull,et al.  Gamma-aminobutyric acid A receptors in Alzheimer's disease: highly localized remodeling of a complex and diverse signaling pathway , 2018, Neural regeneration research.

[69]  Ying Li,et al.  Constructing and analyzing a disease network based on proteins , 2019, E3S Web of Conferences.

[70]  Ning Wang,et al.  Heterogeneous network embedding for identifying symptom candidate genes , 2018, J. Am. Medical Informatics Assoc..

[71]  F. Dhombres,et al.  Representation of rare diseases in health information systems: The orphanet approach to serve a wide range of end users , 2012, Human mutation.

[72]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[73]  Chun-Hsi Huang,et al.  A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data , 2014, Biology Direct.

[74]  A Smith,et al.  Genetic overlap between Alzheimer’s disease and Parkinson’s disease at the MAPT locus , 2015, Molecular Psychiatry.

[75]  P. Sedgwick Spearman’s rank correlation coefficient , 2018, British Medical Journal.

[76]  K. Reymann,et al.  K‐Lysine acetyltransferase 2a regulates a hippocampal gene expression network linked to memory formation , 2014, The EMBO journal.

[77]  Ujjwal Maulik,et al.  Detecting TF-miRNA-gene network based modules for 5hmC and 5mC brain samples: a intra- and inter-species case-study between human and rhesus , 2018, BMC Genetics.

[78]  Yibo Wu,et al.  GOSemSim: an R package for measuring semantic similarity among GO terms and gene products , 2010, Bioinform..