Discovery of microRNAs and Transcription Factors Co-Regulatory Modules by Integrating Multiple Types of Genomic Data

It is well known that regulators known as microRNA (miRNA) and transcription factor (TF) have been found to play an important role in gene regulation. However, there are few researches of collaborative regulatory (co-regulatory) mechanism between miRNA and TF on system level (function level). Meanwhile, recent advances in high-throughput genomic technologies have enabled researchers to collect diverse large-scale genomic data, which can be used to study the co-regulatory mechanism between miRNA and TF. In this paper, we propose a novel method called Sparse Network regularized non-negative matrix factorization for co-regulatory modules identification which adopts multiple non-negative matrix factorization framework to identify co-regulatory modules including miRNAs, TFs and genes. This method jointly integrates miRNA, TF and gene expression profiles, and additional priori networks were added in a regularized manner. In addition, to avoid the sparsity of these networks, we employ the sparsity penalties to the variables to achieve modular solutions. The mathematical formulation can be effectively solved by an iterative multiplicative updating algorithm. We apply this method to multiple genomic data including the expression profiles of miRNAs, TFs and genes on breast cancer obtained from TCGA, priori miRNA-gene regulations, TF-gene regulations and gene-gene interactions. The results show that the miRNAs, TFs and genes of the co-regulatory modules are significantly associated and modules have a reasonable size distribution. Furthermore, the co-regulatory modules are significantly enriched in Gene Ontology biological processes and Kyoto Encyclopedia of Genes and Genomes pathways, respectively.

[1]  Yuval Kluger,et al.  Inter- and intra-combinatorial regulation by transcription factors and microRNAs , 2007, BMC Genomics.

[2]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[3]  A. Muscella,et al.  CCL20 induces migration and proliferation on breast epithelial cells , 2013, Journal of cellular physiology.

[4]  S. Chin,et al.  BCL11A is a triple-negative breast cancer gene with critical functions in stem and progenitor cells , 2015, Nature Communications.

[5]  Karthik Devarajan,et al.  Nonnegative Matrix Factorization: An Analytical and Interpretive Tool in Computational Biology , 2008, PLoS Comput. Biol..

[6]  Juan M. Vaquerizas,et al.  A census of human transcription factors: function, expression and evolution , 2009, Nature Reviews Genetics.

[7]  P. Laird,et al.  Discovery of multi-dimensional modules by integrative analysis of cancer genomic data , 2012, Nucleic acids research.

[8]  A. Luo,et al.  MicroRNA-34 suppresses breast cancer invasion and metastasis by directly targeting Fra-1 , 2013, Oncogene.

[9]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[10]  Shuai Li,et al.  MicroRNA and histopathological characterization of pure mucinous breast carcinoma , 2013, Cancer biology & medicine.

[11]  Robert Gentleman,et al.  Using GOstats to test gene lists for GO term association , 2007, Bioinform..

[12]  Hsuan-Cheng Huang,et al.  Coregulation of transcription factors and microRNAs in human transcriptional regulatory network , 2011, BMC Bioinformatics.

[13]  Minghua Deng,et al.  A Lasso regression model for the construction of microRNA-target regulatory networks , 2011, Bioinform..

[14]  Qiang Yu,et al.  miR-449a and miR-449b are direct transcriptional targets of E2F1 and negatively regulate pRb-E2F1 activity through a feedback loop by targeting CDK6 and CDC25A. , 2009, Genes & development.

[15]  Giovanni De Micheli,et al.  Prediction of regulatory modules comprising microRNAs and target genes , 2005, ECCB/JBI.

[16]  Chao Cheng,et al.  Inferring MicroRNA Activities by Combining Gene Expression with MicroRNA Target Prediction , 2008, PloS one.

[17]  Pablo Tamayo,et al.  Metagenes and molecular pattern discovery using matrix factorization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[18]  J. Uhm Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2009 .

[19]  D. Tran,et al.  Computational discovery of miR-TF regulatory modules in human genome , 2010, Bioinformation.

[20]  Q. Cui,et al.  Principles of microRNA regulation of a human cellular signaling network , 2006, Molecular systems biology.

[21]  Yitzhak Pilpel,et al.  Global and Local Architecture of the Mammalian microRNA–Transcription Factor Regulatory Network , 2007, PLoS Comput. Biol..

[22]  Darragh G. McArt,et al.  TBX2 represses CST6 resulting in uncontrolled legumain activity to sustain breast cancer proliferation: a novel cancer-selective target pathway with therapeutic opportunities. , 2014, Oncotarget.

[23]  Byoung-Tak Zhang,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm045 Data and text mining Discovery of microRNA–mRNA modules via population-based probabilistic learning , 2007 .

[24]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2011 update , 2010, Nucleic Acids Res..

[25]  Michael W. Berry,et al.  Document clustering using nonnegative matrix factorization , 2006, Inf. Process. Manag..

[26]  Vipin Kumar,et al.  Large-scale integrative network-based analysis identifies common pathways disrupted by copy number alterations across cancers , 2013, BMC Genomics.

[27]  Philip M. Kim,et al.  Subsystem identification through dimensionality reduction of large-scale gene expression data. , 2003, Genome research.

[28]  Lin He,et al.  MicroRNAs: small RNAs with a big role in gene regulation , 2004, Nature reviews genetics.

[29]  E. Leygue,et al.  Claudin 1 in Breast Tumorigenesis: Revelation of a Possible Novel “Claudin High” Subset of Breast Cancers , 2010, Journal of biomedicine & biotechnology.

[30]  O. Hobert Gene Regulation by Transcription Factors and MicroRNAs , 2008, Science.

[31]  Juan Liu,et al.  A novel computational framework for simultaneous integration of multiple types of genomic data to identify microRNA-gene regulatory modules , 2011, Bioinform..

[32]  Xinxia Peng,et al.  Computational identification of hepatitis C virus associated microRNA-mRNA regulatory modules in human livers , 2009, BMC Genomics.

[33]  Hyun Cheol Chung,et al.  A Densely Interconnected Genome-Wide Network of MicroRNAs and Oncogenic Pathways Revealed Using Gene Expression Signatures , 2011, PLoS genetics.

[34]  George Michailidis,et al.  A non-negative matrix factorization method for detecting modules in heterogeneous omics multi-modal data , 2015, Bioinform..

[35]  Yang Li,et al.  HMDD v2.0: a database for experimentally supported human microRNA and disease associations , 2013, Nucleic Acids Res..

[36]  Yuan Qi,et al.  Modularity and Dynamics of Cellular Networks , 2006, PLoS Comput. Biol..

[37]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[38]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[39]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[40]  Jiajie Peng,et al.  SemFunSim: A New Method for Measuring Disease Similarity by Integrating Semantic and Gene Functional Association , 2014, PloS one.

[41]  Bo Chen,et al.  POU5F1/Oct-4 expression in breast cancer tissue is significantly associated with non-sentinel lymph node metastasis , 2016, BMC Cancer.

[42]  Hui Chen,et al.  PRAME is critical for breast cancer growth and metastasis. , 2016, Gene.

[43]  Paul T. Groth,et al.  The ENCODE (ENCyclopedia Of DNA Elements) Project , 2004, Science.

[44]  Chengcheng Hu,et al.  Differentially Expressed MicroRNAs in Postpartum Breast Cancer in Hispanic Women , 2015, PloS one.

[45]  Xin Chen,et al.  TRANSFAC: an integrated system for gene expression regulation , 2000, Nucleic Acids Res..

[46]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[47]  J. Benítez,et al.  VGLL1 expression is associated with a triple-negative basal-like phenotype in breast cancer. , 2014, Endocrine-related cancer.

[48]  H. Horvitz,et al.  MicroRNA expression profiles classify human cancers , 2005, Nature.

[49]  Daming Zhang,et al.  miR‐577 inhibits glioblastoma tumor growth via the Wnt signaling pathway , 2016, Molecular carcinogenesis.

[50]  B. Frey,et al.  Using expression profiling data to identify human microRNA targets , 2007, Nature Methods.