miRLAB: An R Based Dry Lab for Exploring miRNA-mRNA Regulatory Relationships

microRNAs (miRNAs) are important gene regulators at post-transcriptional level, and inferring miRNA-mRNA regulatory relationships is a crucial problem. Consequently, several computational methods of predicting miRNA targets have been proposed using expression data with or without sequence based miRNA target information. A typical procedure for applying and evaluating such a method is i) collecting matched miRNA and mRNA expression profiles in a specific condition, e.g. a cancer dataset from The Cancer Genome Atlas (TCGA), ii) applying the new computational method to the selected dataset, iii) validating the predictions against knowledge from literature and third-party databases, and comparing the performance of the method with some existing methods. This procedure is time consuming given the time elapsed when collecting and processing data, repeating the work from existing methods, searching for knowledge from literature and third-party databases to validate the results, and comparing the results from different methods. The time consuming procedure prevents researchers from quickly testing new computational models, analysing new datasets, and selecting suitable methods for assisting with the experiment design. Here, we present an R package, miRLAB, for automating the procedure of inferring and validating miRNA-mRNA regulatory relationships. The package provides a complete set of pipelines for testing new methods and analysing new datasets. miRLAB includes a pipeline to obtain matched miRNA and mRNA expression datasets directly from TCGA, 12 benchmark computational methods for inferring miRNA-mRNA regulatory relationships, the functions for validating the predictions using experimentally validated miRNA target data and miRNA perturbation data, and the tools for comparing the results from different computational methods.

[1]  Wm. R. Wright General Intelligence, Objectively Determined and Measured. , 1905 .

[2]  K. Pearson NOTES ON THE HISTORY OF CORRELATION , 1920 .

[3]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[4]  Moon,et al.  Estimation of mutual information using kernel density estimators. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[5]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[6]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[7]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[8]  T. Poggio,et al.  Multiclass cancer diagnosis using tumor gene expression signatures , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[9]  C. Burge,et al.  Prediction of Mammalian MicroRNA Targets , 2003, Cell.

[10]  Gordon K. Smyth,et al.  limma: Linear Models for Microarray Data , 2005 .

[11]  H. Horvitz,et al.  MicroRNA expression profiles classify human cancers , 2005, Nature.

[12]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[13]  Rafael A. Irizarry,et al.  Bioinformatics and Computational Biology Solutions using R and Bioconductor , 2005 .

[14]  N. Rajewsky microRNA target predictions in animals , 2006, Nature Genetics.

[15]  Maria L. Rizzo,et al.  Measuring and testing dependence by correlation of distances , 2007, 0803.4101.

[16]  B. Frey,et al.  Using expression profiling data to identify human microRNA targets , 2007, Nature Methods.

[17]  Huiqing Liu,et al.  Identifying mRNA targets of microRNA dysregulated in cancer: with application to clear cell Renal Cell Carcinoma , 2010, BMC Systems Biology.

[18]  Tongbin Li,et al.  miRecords: an integrated resource for microRNA–target interactions , 2008, Nucleic Acids Res..

[19]  Jiuyong Li,et al.  Exploring complex miRNA-mRNA interactions with Bayesian networks by splitting-averaging strategy , 2009, BMC Bioinformatics.

[20]  M. Maathuis,et al.  Estimating high-dimensional intervention effects from observational data , 2008, 0810.4214.

[21]  N. D. Clarke,et al.  Towards a Rigorous Assessment of Systems Biology Models: The DREAM3 Challenges , 2010, PloS one.

[22]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[23]  Scott B. Dewell,et al.  Transcriptome-wide Identification of RNA-Binding Protein and MicroRNA Target Sites by PAR-CLIP , 2010, Cell.

[24]  N. D. Clarke,et al.  Correction: Towards a Rigorous Assessment of Systems Biology Models: The DREAM3 Challenges , 2010, PLoS ONE.

[25]  Robert B Darnell,et al.  HITS‐CLIP: panoramic views of protein–RNA regulation in living cells , 2010, Wiley interdisciplinary reviews. RNA.

[26]  I. Van der Auwera,et al.  Integrated miRNA and mRNA expression profiling of the inflammatory breast cancer subtype , 2010, British Journal of Cancer.

[27]  Peter Bühlmann,et al.  Predicting causal effects in large-scale systems from observational data , 2010, Nature Methods.

[28]  J. Gorodkin,et al.  Global microRNA Analysis of the NCI-60 Cancer Cell Panel , 2011, Molecular Cancer Therapeutics.

[29]  T. Speed A Correlation for the 21st Century , 2011, Science.

[30]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[31]  J. E. García,et al.  A non-parametric test of independence ∗ , 2011 .

[32]  Minghua Deng,et al.  A Lasso regression model for the construction of microRNA-target regulatory networks , 2011, Bioinform..

[33]  Norbert Gretz,et al.  miRWalk - Database: Prediction of possible miRNA binding sites by "walking" the genes of three genomes , 2011, J. Biomed. Informatics.

[34]  A. Luttun,et al.  Quantification of miRNA-mRNA Interactions , 2012, PloS one.

[35]  Nectarios Koziris,et al.  TarBase 6.0: capturing the exponential growth of miRNA targets with experimental support , 2011, Nucleic Acids Res..

[36]  Jiuyong Li,et al.  Inferring microRNA and transcription factor regulatory networks in heterogeneous data , 2013, BMC Bioinformatics.

[37]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[38]  J. Ule,et al.  Protein–RNA interactions: new genomic technologies and perspectives , 2012, Nature Reviews Genetics.

[39]  Martin Reczko,et al.  DIANA-microT web server v5.0: service integration into miRNA functional analysis workflows , 2013, Nucleic Acids Res..

[40]  Bernhard Schölkopf,et al.  The Randomized Dependence Coefficient , 2013, NIPS.

[41]  The Cancer Genome Atlas Research Network,et al.  Multi-cancer molecular signatures and their interrelationships , 2013, 1306.2584.

[42]  Jiuyong Li,et al.  Inferring microRNA-mRNA causal regulatory relationships from expression data , 2013, Bioinform..

[43]  Jiuyong Li,et al.  Identifying miRNAs, targets and functions , 2012, Briefings Bioinform..

[44]  Zhaolei Zhang,et al.  A probabilistic approach to explore human miRNA targetome by integrating miRNA-overexpression data and sequence information , 2014, Bioinform..

[45]  Peng Qiu,et al.  TCGA-Assembler: open-source software for retrieving and processing TCGA data , 2014, Nature Methods.

[46]  Junpeng Zhang,et al.  Inferring condition-specific miRNA activity from matched miRNA and mRNA expression data , 2014, Bioinform..

[47]  Junpeng Zhang,et al.  Identifying direct miRNA-mRNA causal regulatory relationships in heterogeneous data , 2014, J. Biomed. Informatics.

[48]  Hsien-Da Huang,et al.  miRTarBase update 2014: an information resource for experimentally validated miRNA-target interactions , 2013, Nucleic Acids Res..

[49]  Cheng Liang,et al.  Inferring probabilistic miRNA–mRNA interaction signatures in cancers: a role-switch approach , 2014, Nucleic acids research.

[50]  D. Bartel,et al.  Predicting effective microRNA target sites in mammalian mRNAs , 2015, eLife.

[51]  Junpeng Zhang,et al.  From miRNA regulation to miRNA-TF co-regulation: computational approaches and challenges , 2015, Briefings Bioinform..

[52]  Jiuyong Li,et al.  Ensemble Methods for MiRNA Target Prediction from Expression Data , 2015, PloS one.