PhenStat: A Tool Kit for Standardized Analysis of High Throughput Phenotypic Data

The lack of reproducibility with animal phenotyping experiments is a growing concern among the biomedical community. One contributing factor is the inadequate description of statistical analysis methods that prevents researchers from replicating results even when the original data are provided. Here we present PhenStat – a freely available R package that provides a variety of statistical methods for the identification of phenotypic associations. The methods have been developed for high throughput phenotyping pipelines implemented across various experimental designs with an emphasis on managing temporal variation. PhenStat is targeted to two user groups: small-scale users who wish to interact and test data from large resources and large-scale users who require an automated statistical analysis pipeline. The software provides guidance to the user for selecting appropriate analysis methods based on the dataset and is designed to allow for additions and modifications as needed. The package was tested on mouse and rat data and is used by the International Mouse Phenotyping Consortium (IMPC). By providing raw data and the version of PhenStat used, resources like the IMPC give users the ability to replicate and explore results within their own computing environment.

[1]  Robert Gentleman,et al.  Bioconductor Case Studies , 2008 .

[2]  R. Fisher On the Interpretation of χ2 from Contingency Tables, and the Calculation of P , 2010 .

[3]  Henrik Westerberg,et al.  A comparative phenotypic and genomic analysis of C57BL/6J and C57BL/6N mouse strains , 2013, Genome Biology.

[4]  C. Begley,et al.  Drug development: Raise standards for preclinical cancer research , 2012, Nature.

[5]  Edwin Cuppen,et al.  Progress and prospects in rat genetics: a community view , 2008, Nature Genetics.

[6]  Songyan Liu,et al.  The IKMC web portal: a central point of entry to data and resources from the International Knockout Mouse Consortium , 2010, Nucleic Acids Res..

[7]  Gautier Koscielny,et al.  The International Mouse Phenotyping Consortium Web Portal, a unified point of access for knockout mice and related phenotyping data , 2013, Nucleic Acids Res..

[8]  William Valdar,et al.  Genetic and Environmental Effects on Complex Traits in Mice , 2006, Genetics.

[9]  Anne E Kwitek,et al.  Multifactorial genetics: Rat genetics: attachign physiology and pharmacology to the genome , 2002, Nature Reviews Genetics.

[10]  Steven A. Harvey,et al.  A systematic genome-wide analysis of zebrafish protein-coding gene function , 2013, Nature.

[11]  Paul Smith,et al.  Must try harder. , 1988, The Health service journal.

[12]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[13]  Damian Smedley,et al.  Genome-wide Generation and Systematic Phenotyping of Knockout Mice Reveals New Roles for Many Genes , 2013, Cell.

[14]  F. Collins,et al.  Policy: NIH plans to enhance reproducibility , 2014, Nature.

[15]  E. Chesler,et al.  Neurobehavioral mutants identified in an ENU-mutagenesis project , 2007, Mammalian Genome.

[16]  F. Prinz,et al.  Believe it or not: how much can we rely on published data on potential drug targets? , 2011, Nature Reviews Drug Discovery.

[17]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[18]  D. Firth Bias reduction of maximum likelihood estimates , 1993 .

[19]  Laura M. Jackson,et al.  Finding Our Way through Phenotypes , 2015, PLoS biology.

[20]  D. Reed,et al.  Reduced body weight is a common effect of gene knockout in mice , 2008, BMC Genetics.

[21]  Brady T. West,et al.  Linear Mixed Models: A Practical Guide Using Statistical Software , 2006 .

[22]  David J. Adams,et al.  Impact of Temporal Variation on Design and Analysis of Mouse Knockout Phenotyping Studies , 2014, PloS one.

[23]  Natasha A. Karp,et al.  Robust and Sensitive Analysis of Mouse Knockout Phenotypes , 2012, PloS one.

[24]  A. Bradley,et al.  Agouti C57BL/6N embryonic stem cells for mouse genetic resources , 2009, Nature Methods.

[25]  I. Cuthill,et al.  Reporting : The ARRIVE Guidelines for Reporting Animal Research , 2010 .

[26]  I. Wilson,et al.  Age and Microenvironment Outweigh Genetic Influence on the Zucker Rat Microbiome , 2014, PloS one.

[27]  Katie Lidster,et al.  Two Years Later: Journals Are Not Yet Enforcing the ARRIVE Guidelines on Reporting Standards for Pre-Clinical Animal Studies , 2014, PLoS biology.

[28]  S Clare Stanford,et al.  A comparison of InVivoStat with other statistical software packages for analysis of data generated from animal experiments , 2012, Journal of psychopharmacology.

[29]  K. Flanagan Sexual dimorphism in biomedical research: a call to analyse by sex. , 2014, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[30]  Steve D. M. Brown,et al.  Mouse large-scale phenotyping initiatives: overview of the European Mouse Disease Clinic (EUMODIC) and of the Wellcome Trust Sanger Institute Mouse Genetics Project , 2012, Mammalian Genome.

[31]  Steve D. M. Brown,et al.  The International Mouse Phenotyping Consortium: past and future perspectives on mouse phenotyping , 2012, Mammalian Genome.

[32]  R. Fisher 019: On the Interpretation of x2 from Contingency Tables, and the Calculation of P. , 1922 .