A Review of Pathway-Based Analysis Tools That Visualize Genetic Variants

Pathway analysis is a powerful method for data analysis in genomics, most often applied to gene expression analysis. It is also promising for single-nucleotide polymorphism (SNP) data analysis, such as genome-wide association study data, because it allows the interpretation of variants with respect to the biological processes in which the affected genes and proteins are involved. Such analyses support an interactive evaluation of the possible effects of variations on function, regulation or interaction of gene products. Current pathway analysis software often does not support data visualization of variants in pathways as an alternate method to interpret genetic association results, and specific statistical methods for pathway analysis of SNP data are not combined with these visualization features. In this review, we first describe the visualization options of the tools that were identified by a literature review, in order to provide insight for improvements in this developing field. Tool evaluation was performed using a computational epistatic dataset of gene–gene interactions for obesity risk. Next, we report the necessity to include in these tools statistical methods designed for the pathway-based analysis with SNP data, expressly aiming to define features for more comprehensive pathway-based analysis tools. We conclude by recognizing that pathway analysis of genetic variations data requires a sophisticated combination of the most useful and informative visual aspects of the various tools evaluated.

[1]  Dieter Schmalstieg,et al.  StratomeX: Visual Analysis of Large‐Scale Heterogeneous Genomics Data for Cancer Subtype Characterization , 2012, Comput. Graph. Forum.

[2]  John P A Ioannidis,et al.  What Should the Genome-wide Significance Threshold Be? Empirical Replication of Borderline Genetic Associations Yfor a Full List of Investigators Offering Data and Clarifications See Acknowledgments , 2022 .

[3]  Juan C. Vivar,et al.  Integrative pathway analysis of a genome-wide association study of (V)O(2max) response to exercise training. , 2013, Journal of applied physiology.

[4]  Ting Hu,et al.  Characterizing genetic interactions in human disease association studies using statistical epistasis networks , 2011, BMC Bioinformatics.

[5]  Jesús Espinal-Enríquez,et al.  Pathway Analysis: State of the Art , 2015, Front. Physiol..

[6]  E. Berns,et al.  Pathway analysis of gene lists associated with platinum-based chemotherapy resistance in ovarian cancer: the big picture. , 2010, Gynecologic oncology.

[7]  Dieter Schmalstieg,et al.  Entourage: Visualizing Relationships between Biological Pathways using Contextual Subsets , 2013, IEEE Transactions on Visualization and Computer Graphics.

[8]  Roded Sharan,et al.  Simultaneous Identification of Multiple Driver Pathways in Cancer , 2013, PLoS Comput. Biol..

[9]  Xia Yang,et al.  Integrating pathway analysis and genetics of gene expression for genome-wide association studies. , 2010, American journal of human genetics.

[10]  Mathieu Lemire,et al.  Analyses of associations with asthma in four asthma population samples from Canada and Australia , 2009, Human Genetics.

[11]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[12]  Teri A. Manolio,et al.  Bringing genome-wide association findings into clinical use , 2013, Nature Reviews Genetics.

[13]  Hongsheng Gui,et al.  Comparisons of seven algorithms for pathway analysis using the WTCCC Crohn's Disease dataset , 2011, BMC Research Notes.

[14]  K. Kasai,et al.  Epigenome-wide association study of DNA methylation in panic disorder , 2017, Clinical Epigenetics.

[15]  M. Koga,et al.  Pathway-based association analysis of genome-wide screening data suggest that genes associated with the &ggr;-aminobutyric acid receptor signaling pathway are involved in neuroleptic-induced, treatment-resistant tardive dyskinesia , 2008, Pharmacogenetics and genomics.

[16]  Kai Wang,et al.  Pathway-based approaches for analysis of genomewide association studies. , 2007, American journal of human genetics.

[17]  Chris North,et al.  Visualizing Biological Pathways: Requirements Analysis, Systems Evaluation and Research Agenda , 2005, Inf. Vis..

[18]  H. Hakonarson,et al.  Analysing biological pathways in genome-wide association studies , 2010, Nature Reviews Genetics.

[19]  Guodong Wu,et al.  Pathway‐Based Approaches for Sequencing‐Based Genome‐Wide Association Studies , 2013, Genetic epidemiology.

[20]  S. Yip,et al.  Pathway analysis of complex diseases for GWAS, extending to consider rare variants, multi-omics and interactions. , 2017, Biochimica et biophysica acta. General subjects.

[21]  P. Rosenberg,et al.  Pathway analysis by adaptive combination of P‐values , 2009, Genetic epidemiology.

[22]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[23]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[25]  Dieter Schmalstieg,et al.  enRoute: dynamic path extraction from biological pathway maps for exploring heterogeneous experimental datasets , 2013, BMC Bioinformatics.

[26]  Chris S. Haley,et al.  Detecting epistasis in human complex traits , 2014, Nature Reviews Genetics.

[27]  Yu-Chi Lee,et al.  CardioGxE, a catalog of gene-environment interactions for cardiometabolic traits , 2014, BioData Mining.

[28]  D. Moher,et al.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. , 2010, International journal of surgery.

[29]  Hedi Peterson,et al.  g:Profiler—a web server for functional interpretation of gene lists (2016 update) , 2016, Nucleic Acids Res..

[30]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[31]  Léon Personnaz,et al.  Enrichment or depletion of a GO category within a class of genes: which test? , 2007, Bioinform..

[32]  Jose M Villaveces,et al.  Tools for visualization and analysis of molecular networks, pathways, and -omics data , 2015, Advances and applications in bioinformatics and chemistry : AABC.

[33]  Steven C. Lawlor,et al.  MAPPFinder: using Gene Ontology and GenMAPP to create a global gene-expression profile from microarray data , 2003, Genome Biology.

[34]  Anita L DeStefano,et al.  Pathway analysis following association study , 2011, BMC proceedings.

[35]  M. Vidal,et al.  Edgetic perturbation models of human inherited disorders , 2009, Molecular systems biology.

[36]  M. Daly,et al.  PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes , 2003, Nature Genetics.

[37]  Yang Liu,et al.  Integrative pathway analysis of genome-wide association studies and gene expression data in prostate cancer , 2012, BMC Systems Biology.

[38]  Florian Markowetz,et al.  How to Understand the Cell by Breaking It: Network Analysis of Gene Perturbation Screens , 2009, PLoS Comput. Biol..

[39]  Ryan Miller,et al.  WikiPathways: capturing the full diversity of pathway knowledge , 2015, Nucleic Acids Res..

[40]  David Gomez-Cabrero,et al.  Data integration in the era of omics: current and future challenges , 2014, BMC Systems Biology.

[41]  George Ellis,et al.  Path: a tool to facilitate pathway-based genetic association analysis , 2009, Bioinform..

[42]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[43]  Jordi Villà-Freixa,et al.  Knowledge management for systems biology a general and visually driven framework applied to translational medicine , 2011, BMC Systems Biology.

[44]  Y. Lee,et al.  Genome-wide pathway analysis of breast cancer , 2014, Tumor Biology.

[45]  N. H. Cho,et al.  Pathway analysis of genome‐wide association datasets of personality traits , 2015, Genes, brain, and behavior.

[46]  Marina Evangelou,et al.  Comparison of Methods for Competitive Tests of Pathway Analysis , 2012, PloS one.

[47]  Nuno Nunes,et al.  PathVisio 3: An Extendable Pathway Analysis Toolbox , 2015, PLoS Comput. Biol..

[48]  S. Henikoff,et al.  Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm , 2009, Nature Protocols.

[49]  Paul Brennan,et al.  Comparison of Pathway Analysis Approaches Using Lung Cancer GWAS Data Sets , 2012, PloS one.

[50]  Ting Hu,et al.  Characterizing gene-gene interactions in a statistical epistasis network of twelve candidate genes for obesity , 2015, BioData Mining.

[51]  M. Mooney,et al.  Gene set analysis: A step‐by‐step guide , 2015, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[52]  Matthew A. Hibbs,et al.  Visualization of omics data for systems biology , 2010, Nature Methods.

[53]  Calculating and Interpreting the p-values for Functions , Pathways and Lists in IPA Overview , 2010 .

[54]  Young Ho Lee,et al.  Pathway analysis of genome-wide association studies for Parkinson’s disease , 2012, Molecular Biology Reports.

[55]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[56]  I. Kullo,et al.  Molecular population genetics of PCSK9: a signature of recent positive selection , 2008, Pharmacogenetics and genomics.