Genome-Wide Scale-Free Network Inference for Candida albicans

Discovery of essential genes in pathogenic organisms is an important step in the development of new medication. Despite a growing number of genome data available, little is known about C. albicans, a major fungal pathogen. Most of the human population carries C. albicans as commensal, but it can cause systemic infection that may lead to the death of the host if the immune system has deteriorated. In many organisms central nodes in the interaction network (hubs) play a crucial role for information and energy transport. Knock-outs of such hubs often lead to lethal phenotypes making them interesting drug targets. To identify these central genes via topological analysis, we inferred gene regulatory networks that are sparse and scale-free. We collected information from various sources to complement the limited expression data available. We utilized a linear regression algorithm to infer genome-wide gene regulatory interaction networks. To evaluate the predictive power of our approach, we used an automated text-mining system that scanned full-text research papers for known interactions. With the help of the compendium of known interactions, we also optimize the influence of the prior knowledge and the sparseness of the model to achieve the best results. We compare the results of our approach with those of other state-of-the-art network inference methods and show that we outperform those methods. Finally we identify a number of hubs in the genome of the fungus and investigate their biological relevance.

[1]  M. Gustafsson,et al.  Large-scale reverse engineering by the Lasso , 2004, q-bio/0403012.

[2]  Jesper Tegnér,et al.  Reverse engineering gene networks using singular value decomposition and robust regression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Ali Shojaie,et al.  Discovering graphical Granger causality using the truncating lasso penalty , 2010, Bioinform..

[4]  Robert D. Leclerc Survival of the sparsest: robust gene networks are parsimonious , 2008, Molecular systems biology.

[5]  Lan V. Zhang,et al.  Evidence for dynamically organized modularity in the yeast protein–protein interaction network , 2004, Nature.

[6]  Michael Hecker,et al.  Integrative modeling of transcriptional regulation in response to antirheumatic therapy , 2009, BMC Bioinformatics.

[7]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[8]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[9]  Joachim Selbig,et al.  pcaMethods - a bioconductor package providing PCA methods for incomplete data , 2007, Bioinform..

[10]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[11]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[12]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[13]  Reinhard Guthke,et al.  Dynamic network reconstruction from gene expression data applied to immune response during bacterial infection , 2005, Bioinform..

[14]  Jianzhi Zhang,et al.  Why Do Hubs Tend to Be Essential in Protein Networks? , 2006, PLoS genetics.

[15]  M. Johnston A model fungal gene regulatory mechanism: the GAL genes of Saccharomyces cerevisiae. , 1987, Microbiological reviews.

[16]  Bernhard Hube,et al.  From commensal to pathogen: stage- and tissue-specific gene expression of Candida albicans. , 2004, Current opinion in microbiology.

[17]  M. Gustafsson,et al.  Constructing and analyzing a large-scale gene-to-gene regulatory network Lasso-constrained inference and biological validation , 2005, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[18]  Michael Hecker,et al.  Gene regulatory network inference: Data integration in dynamic models - A review , 2009, Biosyst..

[19]  Reinhard Guthke,et al.  Regulatory network modelling of iron acquisition by a fungal pathogen in contact with epithelial cells , 2010, BMC Systems Biology.

[20]  Sampo Pyysalo,et al.  EXTRACTING BIO‐MOLECULAR EVENTS FROM LITERATURE—THE BIONLP’09 SHARED TASK , 2011, Comput. Intell..

[21]  A. Rokas,et al.  Transcriptional Rewiring: The Proof Is in the Eating , 2007, Current Biology.

[22]  U. Hahn,et al.  Full-genomic Network Inference for Non-model organisms: A Case Study for the Fungal Pathogen Candida albicans , 2011 .

[23]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[24]  D. Lohr,et al.  Transcriptional regulation in the yeast GAL gene family: a complex genetic network , 1995, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[25]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[26]  S. Bergmann,et al.  Comparative Gene Expression Analysis by a Differential Clustering Approach: Application to the Candida albicans Transcription Program , 2005, PLoS genetics.

[27]  J. Beney,et al.  The direct cost and incidence of systemic fungal infections. , 2002, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[28]  Reinhard Guthke,et al.  Dynamic Network Reconstruction from Gene Expression Data Describing the Effect of LiCl Stimulation on Hepatocytes , 2005, J. Integr. Bioinform..

[29]  Udo Hahn,et al.  SYNTACTIC SIMPLIFICATION AND SEMANTIC ENRICHMENT—TRIMMING DEPENDENCY GRAPHS FOR EVENT EXTRACTION , 2011, Comput. Intell..

[30]  C. d’Enfert,et al.  Candida: comparative and functional genomics. , 2007 .

[31]  Kevin Kontos,et al.  Information-Theoretic Inference of Large Transcriptional Regulatory Networks , 2007, EURASIP J. Bioinform. Syst. Biol..

[32]  Reinhard Guthke,et al.  Discovery of Gene Regulatory Networks in Aspergillus fumigatus , 2006, KDECB.

[33]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[34]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..