Pathway-based Analysis Tools for Complex Diseases: A Review

Genetic studies are traditionally based on single-gene analysis. The use of these analyses can pose tremendous challenges for elucidating complicated genetic interplays involved in complex human diseases. Modern pathway-based analysis provides a technique, which allows a comprehensive understanding of the molecular mechanisms underlying complex diseases. Extensive studies utilizing the methods and applications for pathway-based analysis have significantly advanced our capacity to explore large-scale omics data, which has rapidly accumulated in biomedical fields. This article is a comprehensive review of the pathway-based analysis methods—the powerful methods with the potential to uncover the biological depths of the complex diseases. The general concepts and procedures for the pathway-based analysis methods are introduced and then, a comprehensive review of the major approaches for this analysis is presented. In addition, a list of available pathway-based analysis software and databases is provided. Finally, future directions and challenges for the methodological development and applications of pathway-based analysis techniques are discussed. This review will provide a useful guide to dissect complex diseases.

[1]  Peter D. Karp,et al.  The MetaCyc Database , 2002, Nucleic Acids Res..

[2]  Mario Medvedovic,et al.  LRpath: a logistic regression approach for identifying enriched biological groups in gene expression data , 2009, Bioinform..

[3]  Jason H. Moore,et al.  BIOINFORMATICS REVIEW , 2005 .

[4]  Y. Pawitan,et al.  Strategies and issues in the detection of pathway enrichment in genome-wide association studies , 2009, Human Genetics.

[5]  Deanne M. Taylor,et al.  Powerful SNP-set analysis for case-control genome-wide association studies. , 2010, American journal of human genetics.

[6]  D. Maraganore,et al.  A Genomic Pathway Approach to a Complex Disease: Axon Guidance and Parkinson Disease , 2007, PLoS genetics.

[7]  C Charles Gu,et al.  Variable set enrichment analysis in genome-wide association studies , 2011, European Journal of Human Genetics.

[8]  Peilin Jia,et al.  Gene set analysis of genome-wide association studies: methodological issues and perspectives. , 2011, Genomics.

[9]  Andrew B. Nobel,et al.  Significance analysis of functional categories in gene expression studies: a structured permutation approach , 2005, Bioinform..

[10]  Momiao Xiong,et al.  Gene and pathway-based second-wave analysis of genome-wide association studies , 2010, European Journal of Human Genetics.

[11]  Judy H. Cho,et al.  Pathway analysis comparison using Crohn's disease genome wide association studies , 2010, BMC Medical Genomics.

[12]  P. Khatri,et al.  A systems biology approach for pathway level analysis. , 2007, Genome research.

[13]  Denis Thieffry,et al.  RegulonDB: a database on transcriptional regulation in Escherichia coli , 1998, Nucleic Acids Res..

[14]  Sangsoo Kim,et al.  GSA-SNP: a general approach for gene set analysis of polymorphisms , 2010, Nucleic Acids Res..

[15]  Pooja Mittal,et al.  A novel signaling pathway impact analysis , 2009, Bioinform..

[16]  Susumu Goto,et al.  The KEGG resource for deciphering the genome , 2004, Nucleic Acids Res..

[17]  Monica Chiogna,et al.  Gene set analysis exploiting the topology of a pathway , 2010, BMC Systems Biology.

[18]  D. Thomas,et al.  Gene–environment-wide association studies: emerging approaches , 2010, Nature Reviews Genetics.

[19]  Youping Deng,et al.  Exploring the pathogenetic association between schizophrenia and type 2 diabetes mellitus diseases based on pathway analysis , 2013, BMC Medical Genomics.

[20]  C. Wijmenga,et al.  Using genome‐wide pathway analysis to unravel the etiology of complex diseases , 2009, Genetic epidemiology.

[21]  Michael C Wu,et al.  Prior biological knowledge-based approaches for the analysis of genome-wide expression profiles using gene sets and pathways , 2009, Statistical methods in medical research.

[22]  Peilin Jia,et al.  Common variants conferring risk of schizophrenia: A pathway analysis of GWAS data , 2010, Schizophrenia Research.

[23]  Hong Wang,et al.  Prioritizing risk pathways: a novel association approach to searching for disease pathways fusing SNPs and pathways , 2009, Bioinform..

[24]  H. Hakonarson,et al.  Analysing biological pathways in genome-wide association studies , 2010, Nature Reviews Genetics.

[25]  J. Dopazo,et al.  Functional genomics and networks: new approaches in the extraction of complex gene modules , 2010, Expert review of proteomics.

[26]  Monica Chiogna,et al.  Along signal paths: an empirical gene set approach exploiting pathway topology , 2012, Nucleic acids research.

[27]  M. Xiong,et al.  Genome-wide gene and pathway analysis , 2010, European Journal of Human Genetics.

[28]  Atul J. Butte,et al.  Ten Years of Pathway Analysis: Current Approaches and Outstanding Challenges , 2012, PLoS Comput. Biol..

[29]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[30]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[31]  Eleftheria Zeggini,et al.  Finding common susceptibility variants for complex disease: past, present and future , 2009, Briefings in functional genomics & proteomics.

[32]  P. Khatri,et al.  Global functional profiling of gene expression. , 2003, Genomics.

[33]  Charles A Tilford,et al.  Gene set enrichment analysis. , 2009, Methods in molecular biology.

[34]  Dawei Liu,et al.  Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models , 2008, BMC Bioinformatics.

[35]  Hanlee P. Ji,et al.  Next-generation DNA sequencing , 2008, Nature Biotechnology.

[36]  Lenore K. Beitel,et al.  Selection and mutation in the “new” genetics: an emerging hypothesis , 2010, Human Genetics.

[37]  J H Moore,et al.  The Pathway Less Traveled: Moving from Candidate Genes to Candidate Pathways in the Analysis of Genome-Wide Data from Large Scale Pharmacogenetic Association Studies. , 2008, Current pharmacogenomics and personalized medicine.

[38]  Chiara Sabatti,et al.  Human genetics: Variants in common diseases , 2007, Nature.

[39]  Gabriele Sales,et al.  graphite - a Bioconductor package to convert pathway topology to gene network , 2012, BMC Bioinformatics.

[40]  Paolo G. V. Martini,et al.  Graphite Web: web tool for gene set analysis exploiting pathway topology , 2013, Nucleic Acids Res..

[41]  E Birney,et al.  The Genome Knowledgebase: a resource for biologists and bioinformaticists. , 2003, Cold Spring Harbor symposia on quantitative biology.

[42]  C. Carlson,et al.  Mapping complex disease loci in whole-genome association studies , 2004, Nature.

[43]  Jason H. Moore,et al.  Pathway analysis of genomic data: concepts, methods, and prospects for future development. , 2012, Trends in genetics : TIG.

[44]  M. Khoury,et al.  Tracking the epidemiology of human genes in the literature: the HuGE Published Literature database. , 2006, American journal of epidemiology.

[45]  X. Chen,et al.  Pathway‐based analysis for genome‐wide association studies using supervised principal components , 2010, Genetic epidemiology.

[46]  X. Wen,et al.  Gene, region and pathway level analyses in whole‐genome studies , 2009, Genetic epidemiology.

[47]  Joaquín Dopazo,et al.  Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies , 2009, Nucleic Acids Res..

[48]  Ugo Covani,et al.  Bioinformatics and Data Mining Studies in Oral Genomics and Proteomics: New Trends and Challenges , 2010, The open dentistry journal.

[49]  P. Rosenberg,et al.  Pathway analysis by adaptive combination of P‐values , 2009, Genetic epidemiology.

[50]  M. Orešič,et al.  Pathways to the analysis of microarray data. , 2005, Trends in biotechnology.

[51]  Aaron A. Klammer,et al.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data , 2013, Nature Methods.

[52]  Kai Wang,et al.  Pathway-based approaches for analysis of genomewide association studies. , 2007, American journal of human genetics.

[53]  Y. Benjamini,et al.  Controlling the false discovery rate in behavior genetics research , 2001, Behavioural Brain Research.

[54]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[55]  Scott M. Williams,et al.  Epistasis and its implications for personal genetics. , 2009, American journal of human genetics.

[56]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[57]  Sung Jae Choi,et al.  Genome-wide pathway analysis of genome-wide association studies on systemic lupus erythematosus and rheumatoid arthritis , 2012, Molecular Biology Reports.

[58]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[59]  Momiao Xiong,et al.  Pathway analysis with next-generation sequencing data , 2014, European Journal of Human Genetics.

[60]  Elizabeth A. Heron,et al.  The SNP ratio test: pathway analysis of genome-wide association datasets , 2009, Bioinform..

[61]  Manuel A. R. Ferreira,et al.  Gene ontology analysis of GWA study data sets provides insights into the biology of bipolar disorder. , 2009, American journal of human genetics.

[62]  J. Stephenson 1000 Genomes Project , 2008 .

[63]  Yuan Chen,et al.  A new permutation strategy of pathway-based approach for genome-wide association study , 2009, BMC Bioinformatics.