A computational procedure for identifying master regulator candidates: a case study on diabetes progression in Goto-Kakizaki rats

BackgroundWe have recently identified a number of active regulatory networks involved in diabetes progression in Goto-Kakizaki (GK) rats by network screening. The networks were quite consistent with the previous knowledge of the regulatory relationships between transcription factors (TFs) and their regulated genes. To study the underlying molecular mechanisms directly related to phenotype changes, such as diseases, we also previously developed a computational procedure for identifying transcriptional master regulators (MRs) in conjunction with network screening and network inference, by effectively perturbing the phenotype states.ResultsIn this work, we further improved our previous method for identifying MR candidates, by listing them in a more reliable manner, and applied the method to reveal the MR candidates for diabetes progression in GK rats from the active networks. Specifically, the active TF-gene pairs for different time periods in GK rats were first extracted from the networks by network screening. Another set of active TF-gene pairs was selected by network inference, by considering the gene expression signatures for those periods between GK and Wistar-Kyoto (WKY) rats. The TF-gene pairs extracted by the two methods were then further selected, from the viewpoints of the emergence specificity of TF in GK rats and the regulated-gene coverage of TF in the expression signature. Finally, we narrowed all of the genes down to only 5 TFs (Etv4, Fus, Nr2f1, Sp2, and Tcfap2b) as the candidates of MRs, with 54 regulated genes, by merging the selected TF-gene pairs.ConclusionsThe present method has successfully identified biologically plausible MR candidates, including the TFs related to diabetes in previous reports. Although the experimental verifications of the candidates and the present procedure are beyond the scope of this study, we narrowed down the candidates to 5 TFs, which can be used to perform the verification experiments relatively easily. The numerical results showed that our computational method is an efficient way to detect the key molecules responsible for biological phenomena.

[1]  F. E. Grubbs Sample Criteria for Testing Outlying Observations , 1950 .

[2]  J L Haines,et al.  Supporting Online Material Materials and Methods Figs. S1 to S7 Tables S1 to S4 References Mutations in the Fus/tls Gene on Chromosome 16 Cause Familial Amyotrophic Lateral Sclerosis , 2022 .

[3]  M. Gerstein,et al.  Genomic analysis of the hierarchical structure of regulatory networks , 2006, Proceedings of the National Academy of Sciences.

[4]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[5]  Peter Tontonoz,et al.  The small molecule phenamil is a modulator of adipocyte differentiation and PPARγ expression[S] , 2010, Journal of Lipid Research.

[6]  Adam A. Margolin,et al.  Reverse engineering cellular networks , 2006, Nature Protocols.

[7]  Luonan Chen,et al.  Biomolecular Networks: Methods and Applications in Systems Biology , 2009 .

[8]  J. Uhm,et al.  The transcriptional network for mesenchymal transformation of brain tumours , 2010 .

[9]  Katsuhisa Horimoto,et al.  Potential Linkages Between the Inner and Outer Cellular States of Human Induced Pluripotent Stem Cells , 2010 .

[10]  Richard L Robertson,et al.  NR2F1 deletion in a patient with a de novo paracentric inversion, inv(5)(q15q33.2), and syndromic deafness , 2009, American journal of medical genetics. Part A.

[11]  J. Bertoglio,et al.  Sp2 regulates interferon-gamma-mediated socs1 gene expression. , 2009, Molecular immunology.

[12]  Zhi-Ping Liu,et al.  Network screening of Goto-Kakizaki rat liver microarray data during diabetic progression , 2011, BMC Systems Biology.

[13]  Katsuhisa Horimoto,et al.  BMC Systems Biology BioMed Central Methodology article , 2008 .

[14]  Sunghoon Kim,et al.  Identification of master regulator candidates in conjunction with network screening and inference , 2013, Int. J. Data Min. Bioinform..

[15]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Edgar Wingender,et al.  The TRANSFAC project as an example of framework technology that supports the analysis of genomic regulation , 2008, Briefings Bioinform..

[17]  Hiroshi Maegawa,et al.  The transcription factor AP-2beta causes cell enlargement and insulin resistance in 3T3-L1 adipocytes. , 2006, Endocrinology.

[18]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[19]  Masashi Toyoda,et al.  Possible linkages between the inner and outer cellular states of human induced pluripotent stem cells , 2011, BMC Systems Biology.

[20]  Katsuhisa Horimoto,et al.  Co-expressed gene assessment based on the path consistency algorithm: Operon detention in Escherichia coli , 2009, 2009 IEEE International Conference on Systems, Man and Cybernetics.

[21]  Joshua A. Grochow,et al.  Genomic analysis reveals a tight link between transcription factor dynamics and regulatory network architecture , 2009, Molecular systems biology.

[22]  K. Basso,et al.  A systems biology approach to prediction of oncogenes and molecular perturbation targets in B-cell lymphomas , 2008, Molecular systems biology.

[23]  Debra C DuBois,et al.  Gene expression analysis of hepatic roles in cause and development of diabetes in Goto-Kakizaki rats. , 2009, The Journal of endocrinology.

[24]  Katsuhisa Horimoto,et al.  Discovery of Chemical Compound Groups with Common Structures by a Network Analysis Approach (Affinity Prediction Method) , 2011, J. Chem. Inf. Model..