Experimental design for large scale omic studies

Molecular phenotyping has expanded from small sample sizes to larger complex studies and are now a common element in genetic studies. When large scale studies add a molecular phenotyping component, balancing omics batches for the factors of interest (e.g. treatment), regardless of the initial sample collection strategy always improves power. Where possible, confounding sources of experimental error that are not of interest (sample collection blocks and data collection plates) improves power as does planning batches for molecular phenotyping based on constraints during initial sample collection. Power for testing differences in molecular phenotypes is always higher when accounting for the entire experimental design during modeling. The inclusion of metadata that tracks sources variation is critical to our shared goals of enabling reproducible research.

[1]  Genetic Basis of Transcriptome Diversity in Drosophila melanogaster , 2015 .

[2]  Hao Wu,et al.  R/qtl: QTL Mapping in Experimental Crosses , 2003, Bioinform..

[3]  M. Nordborg,et al.  Genetic Architecture of Natural Variation of Telomere Length in Arabidopsis thaliana , 2014, Genetics.

[4]  Jeremy R. B. Newman,et al.  Disease-specific biases in alternative splicing and tissue-specific dysregulation revealed by multitissue profiling of lymphocyte gene expression in type 1 diabetes , 2017, Genome research.

[5]  J. Holland,et al.  Direct mapping of density response in a population of B73 × Mo17 recombinant inbred lines of maize (Zea Mays L.) , 2010, Heredity.

[6]  Joshua D. Knowles,et al.  Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry , 2011, Nature Protocols.

[7]  E. Ng’oma,et al.  Genetic Dissection of Nutrition-Induced Plasticity in Insulin/Insulin-Like Growth Factor Signaling and Median Life Span in a Drosophila Multiparent Population , 2017, Genetics.

[8]  K. Cusi,et al.  Improved experimental data processing for UHPLC–HRMS/MS lipidomics applied to nonalcoholic fatty liver disease , 2017, Metabolomics.

[9]  A. Long,et al.  Genetic Dissection of the Drosophila melanogaster Female Head Transcriptome Reveals Widespread Allelic Heterogeneity , 2014, PLoS genetics.

[10]  Katie E. Marshall,et al.  Cold acclimation wholly reorganizes the Drosophila melanogaster transcriptome and metabolome , 2016, Scientific Reports.

[11]  David L. Aylor,et al.  Male Infertility Is Responsible for Nearly Half of the Extinction Observed in the Mouse Collaborative Cross , 2017, Genetics.

[12]  M. McMullen,et al.  Genetic Properties of the Maize Nested Association Mapping Population , 2009, Science.

[13]  J. J. Jansen,et al.  Plants Know Where It Hurts: Root and Shoot Jasmonic Acid Induction Elicit Differential Responses in Brassica oleracea , 2013, PloS one.

[14]  S. Griffiths-Jones,et al.  Novel Intronic RNA Structures Contribute to Maintenance of Phenotype in Saccharomyces cerevisiae , 2016, Genetics.

[15]  E. Hovig,et al.  Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses , 2015, Biostatistics.

[16]  Adel Dayarian,et al.  Polygenicity and Epistasis Underlie Fitness-Proximal Traits in the Caenorhabditis elegans Multiparental Experimental Evolution (CeMEE) Panel , 2017, Genetics.

[17]  S. Nuzhdin,et al.  Simpler mode of inheritance of transcriptional variation in male Drosophila melanogaster , 2007, Proceedings of the National Academy of Sciences.

[18]  William G. Cochran,et al.  Experimental Designs, 2nd Edition , 1950 .

[19]  B. Oliver,et al.  Comparison of normalization and differential expression analyses using RNA-Seq data from 726 individual Drosophila melanogaster , 2016, BMC Genomics.

[20]  R. Doerge,et al.  Statistical Design and Analysis of RNA Sequencing Data , 2010, Genetics.