Mining Functional Gene Modules Linked with Rheumatoid Arthritis Using a SNP-SNP Network

The identification of functional gene modules that are derived from integration of information from different types of networks is a powerful strategy for interpreting the etiology of complex diseases such as rheumatoid arthritis (RA). Genetic variants are known to increase the risk of developing RA. Here, a novel method, the construction of a genetic network, was used to mine functional gene modules linked with RA. A polymorphism interaction analysis (PIA) algorithm was used to obtain cooperating single nucleotide polymorphisms (SNPs) that contribute to RA disease. The acquired SNP pairs were used to construct a SNP-SNP network. Sub-networks defined by hub SNPs were then extracted and turned into gene modules by mapping SNPs to genes using dbSNP database. We performed Gene Ontology (GO) analysis on each gene module, and some GO terms enriched in the gene modules can be used to investigate clustered gene function for better understanding RA pathogenesis. This method was applied to the Genetic Analysis Workshop 15 (GAW 15) RA dataset. The results show that genes involved in functional gene modules, such as CD160 (rs744877) and RUNX1 (rs2051179), are especially relevant to RA, which is supported by previous reports. Furthermore, the 43 SNPs involved in the identified gene modules were found to be the best classifiers when used as variables for sample classification.

[1]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[2]  P. Le Bouteiller,et al.  Cutting Edge: Engagement of CD160 by its HLA-C Physiological Ligand Triggers a Unique Cytokine Profile Secretion in the Cytotoxic Peripheral Blood NK Cell Subset1 , 2004, The Journal of Immunology.

[3]  Falk Schreiber,et al.  MAVisto: a tool for the exploration of network motifs , 2005, Bioinform..

[4]  Gregory Gutin,et al.  Traveling salesman should not be greedy: domination analysis of greedy-type heuristics for the TSP , 2001, Discret. Appl. Math..

[5]  S. Peng,et al.  Mechanisms of Disease: transcription factors in inflammatory arthritis , 2006, Nature Clinical Practice Rheumatology.

[6]  Marit Holden,et al.  GSEA-SNP: applying gene set enrichment analysis to SNP data from genome-wide association studies , 2008, Bioinform..

[7]  Ying Xu,et al.  Prediction of functional modules based on comparative genome analysis and Gene Ontology application , 2005, Nucleic acids research.

[8]  N. Schork,et al.  Pathway analysis of seven common diseases assessed by genome-wide association. , 2008, Genomics.

[9]  Purvesh Khatri,et al.  Onto-Tools: an ensemble of web-accessible, ontology-based tools for the functional design and interpretation of high-throughput gene expression experiments , 2004, Nucleic Acids Res..

[10]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[11]  Masahide Horiba,et al.  OTK18, a zinc-finger protein, regulates human immunodeficiency virus type 1 long terminal repeat through two distinct regulatory regions. , 2007, Journal of General Virology.

[12]  Albert Y. Zomaya,et al.  A genetic ensemble approach for gene-gene interaction identification , 2010, BMC Bioinformatics.

[13]  C. Wijmenga,et al.  Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes. , 2006, American journal of human genetics.

[14]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[15]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[16]  R. Elston,et al.  The investigation of linkage between a quantitative trait and a marker locus , 1972, Behavior genetics.

[17]  Yadong Wang,et al.  Constructing disease-specific gene networks using pair-wise relevance metric: Application to colon cancer identifies interleukin 8, desmin and enolase 1 as the central elements , 2008, BMC Systems Biology.

[18]  Leah E. Mechanic,et al.  Exploring SNP‐SNP interactions and colon cancer risk using polymorphism interaction analysis , 2006, International journal of cancer.

[19]  Michael Egmont-Petersen,et al.  Image processing with neural networks - a review , 2002, Pattern Recognit..

[20]  E. Domany,et al.  Potts ferromagnets on coexpressed gene networks: identifying maximally stable partitions. , 2003, Physical review letters.

[21]  Iuliana Ionita-Laza,et al.  Constructing gene association networks for rheumatoid arthritis using the backward genotype-trait association (BGTA) algorithm , 2007, BMC proceedings.

[22]  C. B. Kristensen Plasma protein binding of imipramine in patients with rheumatoid arthritis , 2004, European Journal of Clinical Pharmacology.

[23]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[24]  Stephen J. Chanock,et al.  Polymorphism Interaction Analysis (PIA): a method for investigating complex gene-gene interactions , 2008, BMC Bioinformatics.

[25]  H. Mewes,et al.  Functional modules by relating protein interaction networks and gene expression. , 2003, Nucleic acids research.

[26]  R. Drouin,et al.  ABC50, a novel human ATP-binding cassette protein found in tumor necrosis factor-alpha-stimulated synoviocytes. , 1998, Genomics.

[27]  T. Gilliam,et al.  Genetic-linkage mapping of complex hereditary disorders to a whole-genome molecular-interaction network. , 2008, Genome research.

[28]  M. Gröger,et al.  Overexpression of transcription factor Ets-1 in rheumatoid arthritis synovial membrane: regulation of expression and activation by interleukin-1 and tumor necrosis factor alpha. , 2001, Arthritis and rheumatism.

[29]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[30]  J. Savige,et al.  ANTI‐NEUTROPHIL CYTOPLASM ANTIBODIES (ANCA) IN A PATIENT WITH THE VASCULITIS OF MYELODYSPLASIA , 1991, British journal of haematology.

[31]  P. Anderson,et al.  Geldanamycin inhibits the production of inflammatory cytokines in activated macrophages by reducing the stability and translation of cytokine transcripts. , 2003, Arthritis and rheumatism.

[32]  J. H. Moore,et al.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. , 2001, American journal of human genetics.

[33]  Yun Xiao,et al.  A systematic method for mapping multiple loci: an application to construct a genetic network for rheumatoid arthritis. , 2008, Gene.

[34]  Céline Rouveirol,et al.  Identification of functional modules based on transcriptional regulation structure , 2008, BMC proceedings.

[35]  G. Schmitz,et al.  ATP-Binding Cassette Transporter A1 (ABCA1) in Macrophages: A Dual Function in Inflammation and Lipid Metabolism? , 2000, Pathobiology.

[36]  Daniel Hanisch,et al.  Co-clustering of biological networks and gene expression data , 2002, ISMB.

[37]  M. Daly,et al.  Identifying Relationships among Genomic Disease Regions: Predicting Genes at Pathogenic SNP Associations and Rare Deletions , 2009, PLoS genetics.

[38]  Lisa J. Martin,et al.  Genetic Analysis Workshop 15: gene expression analysis and approaches to detecting multiple functional loci , 2007, BMC proceedings.

[39]  Kai Wang,et al.  Pathway-based approaches for analysis of genomewide association studies. , 2007, American journal of human genetics.

[40]  Xia Li,et al.  Gene mining: a novel and powerful ensemble decision approach to hunting for disease genes using microarray expression profiling. , 2004, Nucleic acids research.