Summary: A polygenic risk score (PRS) is a sum of trait-associated alleles across many genetic loci, typically weighted by effect sizes estimated from a genome-wide association study. The application of PRS has grown in recent years as their utility for detecting shared genetic aetiology among traits has become appreciated; PRS can also be used to establish the presence of a genetic signal in underpowered studies, to infer the genetic architecture of a trait, for screening in clinical trials, and can act as a biomarker for a phenotype. Here we present the first dedicated PRS software, PRSice (‘precise'), for calculating, applying, evaluating and plotting the results of PRS. PRSice can calculate PRS at a large number of thresholds (“high resolution”) to provide the best-fit PRS, as well as provide results calculated at broad P-value thresholds, can thin Single Nucleotide Polymorphisms (SNPs) according to linkage disequilibrium and P-value or use all SNPs, handles genotyped and imputed data, can calculate and incorporate ancestry-informative variables, and can apply PRS across multiple traits in a single run. We exemplify the use of PRSice via application to data on schizophrenia, major depressive disorder and smoking, illustrate the importance of identifying the best-fit PRS and estimate a P-value significance threshold for high-resolution PRS studies. Availability and implementation: PRSice is written in R, including wrappers for bash data management scripts and PLINK-1.9 to minimize computational time. PRSice runs as a command-line program with a variety of user-options, and is freely available for download from http://PRSice.info Contact: jack.euesden@kcl.ac.uk or paul.oreilly@kcl.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
[1]
P. Visscher,et al.
Common polygenic variation contributes to risk of schizophrenia and bipolar disorder
,
2009,
Nature.
[2]
M. Rietschel,et al.
Relationship between obesity and the risk of clinically significant depression: Mendelian randomisation study
,
2014,
The British journal of psychiatry : the journal of mental science.
[3]
Carson C Chow,et al.
Second-generation PLINK: rising to the challenge of larger and richer datasets
,
2014,
GigaScience.
[4]
Li Li,et al.
The benefits of using genetic information to design prevention trials.
,
2013,
American journal of human genetics.
[5]
Tom R. Gaunt,et al.
Genetic Variants in Novel Pathways Influence Blood Pressure and Cardiovascular Disease Risk
,
2011,
Nature.
[6]
P. Muglia,et al.
Genome-wide association study of major recurrent depression in the U.K. population.
,
2010,
The American journal of psychiatry.
[7]
F. Dudbridge.
Power and Predictive Accuracy of Polygenic Risk Scores
,
2013,
PLoS genetics.
[8]
G. Coop,et al.
A Population Genetic Signal of Polygenic Adaptation
,
2013,
PLoS genetics.
[9]
M. Daly,et al.
Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis
,
2013,
The Lancet.