Current status of pathway analysis in genome-wide association study.

Since the first publication in 2005, the genome-wide association study (GWAS) strategy has contributed significantly to the understanding of the mechanisms of human genetic diseases. Integrations of statistical methods and systematic biology are important means to explore the GWAS data. Pathway analysis establishes the importance of genetic variants from GWAS and provides insights into their biological significance. It is conducive in correlating the genetic variants, which have only small but interactive changes, to their importance in the biological pathways. At present, pathway analysis has been widely applied to studies of GWAS data, with relatively good results. In the meantime, various analytical methods are being developed and adapted for research on more types of complex data. In this review, we summarize the statistical methods of pathway analysis on GWAS data, and divide them into non-kernel methods and kernel methods. The non-kernel methods include gene set enrichment analysis (GSEA) and hierarchical Bayes prioritization (HBP) analysis, while kernel methods include linear kernel (LIN), identity-by-status kernel (IBS) and powered exponential kernel. We have summarized the calculation principles and features of these statistical methods to provide insights for further developments of new algorithms in GWAS research.