Meta‐analytic framework for liquid association

Motivation: Although coexpression analysis via pair‐wise expression correlation is popularly used to elucidate gene‐gene interactions at the whole‐genome scale, many complicated multi‐gene regulations require more advanced detection methods. Liquid association (LA) is a powerful tool to detect the dynamic correlation of two gene variables depending on the expression level of a third variable (LA scouting gene). LA detection from single transcriptomic study, however, is often unstable and not generalizable due to cohort bias, biological variation and limited sample size. With the rapid development of microarray and NGS technology, LA analysis combining multiple gene expression studies can provide more accurate and stable results. Results: In this article, we proposed two meta‐analytic approaches for LA analysis (MetaLA and MetaMLA) to combine multiple transcriptomic studies. To compensate demanding computing, we also proposed a two‐step fast screening algorithm for more efficient genome‐wide screening: bootstrap filtering and sign filtering. We applied the methods to five Saccharomyces cerevisiae datasets related to environmental changes. The fast screening algorithm reduced 98% of running time. When compared with single study analysis, MetaLA and MetaMLA provided stronger detection signal and more consistent and stable results. The top triplets are highly enriched in fundamental biological processes related to environmental changes. Our method can help biologists understand underlying regulatory mechanisms under different environmental exposure or disease states. Availability and Implementation: A MetaLA R package, data and code for this article are available at http://tsenglab.biostat.pitt.edu/software.htm Contact: ctseng@pitt.edu Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  G. Upton Fisher's Exact Test , 1992 .

[2]  Eulàlia de Nadal,et al.  Osmostress‐induced transcription by Hot1 depends on a Hog1‐mediated recruitment of the RNA Pol II , 2003, The EMBO journal.

[3]  Edith D. Wong,et al.  Saccharomyces Genome Database: the genomics resource of budding yeast , 2011, Nucleic Acids Res..

[4]  Marcel JT Reinders,et al.  Combinatorial effects of environmental parameters on transcriptional regulation in Saccharomyces cerevisiae: A quantitative analysis of a compendium of chemostat-based transcriptome data , 2009, BMC Genomics.

[5]  Albert Sickmann,et al.  Profiling Phosphoproteins of Yeast Mitochondria Reveals a Role of Phosphorylation in Assembly of the ATP Synthase*S , 2007, Molecular & Cellular Proteomics.

[6]  Atul J. Butte,et al.  Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks , 2005, BMC Bioinformatics.

[7]  M. Hewlins,et al.  The Catabolism of Amino Acids to Long Chain and Complex Alcohols in Saccharomyces cerevisiae * , 2003, The Journal of Biological Chemistry.

[8]  Renée X. de Menezes,et al.  Filtering, FDR and power , 2010, BMC Bioinformatics.

[9]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[10]  Thomas A Louis,et al.  Modeling Liquid Association , 2011, Biometrics.

[11]  Kevin P. Byrne,et al.  The Yeast Gene Order Browser: combining curated homology and syntenic context reveals gene fate in polyploid species. , 2005, Genome research.

[12]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[13]  Lynne M Connelly,et al.  Fisher's Exact Test. , 2016, Medsurg nursing : official journal of the Academy of Medical-Surgical Nurses.

[14]  Ker-Chau Li,et al.  A system for enhancing genome-wide coexpression dynamics study. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[15]  M. Holland,et al.  Targeted deletion of a yeast enolase structural gene. Identification and isolation of yeast enolase isozymes. , 1982, The Journal of biological chemistry.

[16]  E. Lander,et al.  Remodeling of yeast genome expression in response to environmental changes. , 2001, Molecular biology of the cell.

[17]  Javier Cabrera,et al.  Analysis of Data From Viral DNA Microchips , 2001 .

[18]  R. Gentleman,et al.  Independent filtering increases detection power for high-throughput experiments , 2010, Proceedings of the National Academy of Sciences.

[19]  Lin Song,et al.  Comparison of co-expression measures: mutual information, correlation, and model based indices , 2012, BMC Bioinformatics.

[20]  S. Horvath,et al.  A General Framework for Weighted Gene Co-Expression Network Analysis , 2005, Statistical applications in genetics and molecular biology.

[21]  Catarina Costa,et al.  The YEASTRACT database: an upgraded information system for the analysis of gene and genomic transcription regulation in Saccharomyces cerevisiae , 2013, Nucleic Acids Res..

[22]  Tina Gunderson,et al.  An efficient algorithm to explore liquid association on a genome-wide scale , 2014, BMC Bioinformatics.

[23]  Minoru Kanehisa,et al.  KEGG as a reference resource for gene and protein annotation , 2015, Nucleic Acids Res..

[24]  Ker-Chau Li,et al.  Genome-wide coexpression dynamics: Theory and application , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Yuan Ji,et al.  Extracting three-way gene interactions from microarray data , 2007, Bioinform..

[26]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[27]  N. Altman An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[28]  Robert Tibshirani,et al.  Bootstrap Methods for Standard Errors, Confidence Intervals, and Other Measures of Statistical Accuracy , 1986 .

[29]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[30]  David H Perlman,et al.  Regulation of yeast pyruvate kinase by ultrasensitive allostery independent of phosphorylation. , 2012, Molecular cell.

[31]  David B. Berry,et al.  Pathway connectivity and signaling coordination in the yeast stress-activated signaling network , 2014, Molecular systems biology.

[32]  I S Kohane,et al.  Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.