Controlling the Precision-Recall Tradeoff in Differential Dependency Network Analysis

Graphical models have gained a lot of attention recently as a tool for learning and representing dependencies among variables in multivariate data. Often, domain scientists are looking specifically for differences among the dependency networks of different conditions or populations (e.g. differences between regulatory networks of different species, or differences between dependency networks of diseased versus healthy populations). The standard method for finding these differences is to learn the dependency networks for each condition independently and compare them. We show that this approach is prone to high false discovery rates (low precision) that can render the analysis useless. We then show that by imposing a bias towards learning similar dependency networks for each condition the false discovery rates can be reduced to acceptable levels, at the cost of finding a reduced number of differences. Algorithms developed in the transfer learning literature can be used to vary the strength of the imposed similarity bias and provide a natural mechanism to smoothly adjust this differential precision-recall tradeoff to cater to the requirements of the analysis conducted. We present real case studies (oncological and neurological) where domain experts use the proposed technique to extract useful differential networks that shed light on the biological processes involved in cancer and brain function.

[1]  Christophe Ambroise,et al.  Inferring multiple graphical structures , 2009, Stat. Comput..

[2]  S. Hirohashi,et al.  Cell adhesion system and human cancer morphogenesis , 2003, Cancer science.

[3]  Patrick Danaher,et al.  The joint graphical lasso for inverse covariance estimation across multiple classes , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[4]  Robert Clarke,et al.  Differential dependency network analysis to identify condition-specific topological changes in biological networks , 2009, Bioinform..

[5]  M. Bosner,et al.  Cholesterol transport function of pancreatic cholesterol esterase: directed sterol uptake and esterification in enterocytes. , 1993, Biochemistry.

[6]  John F Smyth,et al.  Insulin-like Growth Factor Binding Proteins IGFBP3, IGFBP4, and IGFBP5 Predict Endocrine Responsiveness in Patients with Ovarian Cancer , 2007, Clinical Cancer Research.

[7]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[8]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[9]  Vince D. Calhoun,et al.  TDCS guided using fMRI significantly accelerates learning to identify concealed objects , 2012, NeuroImage.

[10]  Guanlin Wang,et al.  Secretory leukocyte protease inhibitor is associated with MMP-2 and MMP-9 to promote migration and invasion in SNU638 gastric cancer cells. , 2011, International journal of molecular medicine.

[11]  Seungyeop Han,et al.  Structured Learning of Gaussian Graphical Models , 2012, NIPS.

[12]  Ralph S Freedman,et al.  Ovarian cancer, the coagulation pathway, and inflammation , 2005, Journal of Translational Medicine.

[13]  Pedro M. Domingos,et al.  Learning Bayesian network classifiers by maximizing conditional likelihood , 2004, ICML.

[14]  Fang Han,et al.  Transelliptical Graphical Models , 2012, NIPS.

[15]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[16]  E. Levina,et al.  Joint estimation of multiple graphical models. , 2011, Biometrika.

[17]  Tracy R. Keeney,et al.  Aptamer-based multiplexed proteomic technology for biomarker discovery , 2010, Nature Precedings.

[18]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[19]  Leslie G. Ungerleider,et al.  Object vision and spatial vision: two cortical pathways , 1983, Trends in Neurosciences.