Towards a Bayesian Decision Theoretic Analysis of Contextual Effect Modifiers

Relevance measures based on parametric and structural properties of Bayesian networks can be utilized to characterize predictors and their interactions. The advantage of the Bayesian framework is that it allows a detailed view of parametric and structural aspects of relevance for domain experts. We discuss two particularly challenging scenarios from psycho-genetic studies, (1) the analysis of weak effects, and (2) the analysis of contextual relevance, where a factor has a negligible main effect and it modifies an effect of another factor only in a given subpopulation. To cope with this challenge, we investigate the formalization of expert intuitions and preferences from the exploratory data analysis phase. We propose formal losses for these two scenarios. We introduce and evaluate a Bayesian effect size measure using an artificial data set related to a genetic association study, and real data from a psycho-genetic study.

[1]  A. Falus,et al.  Evaluation of a Partial Genome Screening of Two Asthma Susceptibility Regions Using Bayesian Network Based Bayesian Multilevel Analysis of Relevance , 2012, PloS one.

[2]  B. Maher Personal genomes: The case of the missing heritability , 2008, Nature.

[3]  Constantin F. Aliferis,et al.  Towards Principled Feature Selection: Relevancy, Filters and Wrappers , 2003 .

[4]  T. Reich,et al.  A perspective on epistasis: limits of models displaying no main effect. , 2002, American journal of human genetics.

[5]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[6]  John D. Storey The positive false discovery rate: a Bayesian interpretation and the q-value , 2003 .

[7]  P. Antal,et al.  Estimation of eect size posterior using model averaging over Bayesian network structures and parameters , 2012 .

[8]  D. Balding A tutorial on statistical methods for population association studies , 2006, Nature Reviews Genetics.

[9]  D. Madigan,et al.  Bayesian model averaging and model selection for markov equivalence classes of acyclic digraphs , 1996 .

[10]  P Antal,et al.  Probabilistic Graphical Models for Computational Biomedicine , 2003, Methods of Information in Medicine.

[11]  Peter Antal,et al.  Evaluation of a Bayesian model-based approach in GA studies , 2009, MLSB.

[12]  Peter Antal,et al.  The effect of parameter priors on Bayesian relevance and effect size measures , 2013 .

[13]  Jesper Tegnér,et al.  Towards scalable and data efficient learning of Markov boundaries , 2007, Int. J. Approx. Reason..

[14]  Nir Friedman,et al.  Being Bayesian about Network Structure , 2000, UAI.

[15]  Nir Friedman,et al.  On the Sample Complexity of Learning Bayesian Networks , 1996, UAI.

[16]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[17]  Peter Antal,et al.  Brain galanin system genes interact with life stresses in depression-related phenotypes , 2014, Proceedings of the National Academy of Sciences.

[18]  Mikko Koivisto,et al.  Exact Bayesian Structure Discovery in Bayesian Networks , 2004, J. Mach. Learn. Res..

[19]  Peter Antal,et al.  Learning Complex Bayesian Network Features for Classification , 2006, Probabilistic Graphical Models.

[20]  Philippe Leray,et al.  Probabilistic graphical models for genetic association studies , 2012, Briefings Bioinform..

[21]  Nir Friedman,et al.  Being Bayesian About Network Structure. A Bayesian Approach to Structure Discovery in Bayesian Networks , 2004, Machine Learning.

[22]  Christine Sinoquet,et al.  Probabilistic graphical models for genetics, genomics and postgenomics , 2014 .

[23]  Andrei S. Rodin,et al.  Mining genetic epidemiology data with Bayesian networks I: Bayesian networks and example application (plasma apoE levels) , 2005, Bioinform..

[24]  Claudio J. Verzilli,et al.  Bayesian graphical models for genomewide association studies. , 2006, American journal of human genetics.

[25]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[26]  Paolo Giudici,et al.  Improving Markov Chain Monte Carlo Model Search for Data Mining , 2004, Machine Learning.