DAtest: a framework for choosing differential abundance or expression method

DAtest is an R package for directly comparing different statistical methods for differential abundance and expression analysis on a dataset of interest; be it data from RNA-seq, proteomics, metabolomics or a microbial marker-gene survey. A myriad of statistical methods exists for conducting these analyses, and with this tool we give the analyst an empirical foundation for choosing a method suitable for a specific dataset. The package supports categorical and quantitative variables, paired/block experimental designs, and the inclusion of covariates. It is freely available at GitHub: https://github.com/Russel88/DAtest along with detailed instructions.

[1]  R. Knight,et al.  The Human Microbiome Project , 2007, Nature.

[2]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[3]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[4]  R. Knight,et al.  The human microbiome project: exploring the microbial part of ourselves in a changing world , 2022 .

[5]  M. Pop,et al.  Robust methods for differential abundance analysis in marker gene surveys , 2013, Nature Methods.

[6]  C. Mason,et al.  Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data , 2013, Genome Biology.

[7]  Jesse R. Zaneveld,et al.  Normalization and microbial differential abundance strategies depend upon data characteristics , 2017, Microbiome.

[8]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[9]  Rob Knight,et al.  Analysis of composition of microbiomes: a novel method for studying microbial composition , 2015, Microbial ecology in health and disease.

[10]  Hans Bisgaard,et al.  Large-scale benchmarking reveals false discoveries and count transformation sensitivity in 16S rRNA gene amplicon data analysis methods used in microbiome studies , 2016, Microbiome.

[11]  Robert Tibshirani,et al.  Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data , 2013, Statistical methods in medical research.