ExprTarget: An Integrative Approach to Predicting Human MicroRNA Targets

Variation in gene expression has been observed in natural populations and associated with complex traits or phenotypes such as disease susceptibility and drug response. Gene expression itself is controlled by various genetic and non-genetic factors. The binding of a class of small RNA molecules, microRNAs (miRNAs), to mRNA transcript targets has recently been demonstrated to be an important mechanism of gene regulation. Because individual miRNAs may regulate the expression of multiple gene targets, a comprehensive and reliable catalogue of miRNA-regulated targets is critical to understanding gene regulatory networks. Though experimental approaches have been used to identify many miRNA targets, due to cost and efficiency, current miRNA target identification still relies largely on computational algorithms that aim to take advantage of different biochemical/thermodynamic properties of the sequences of miRNAs and their gene targets. A novel approach, ExprTarget, therefore, is proposed here to integrate some of the most frequently invoked methods (miRanda, PicTar, TargetScan) as well as the genome-wide HapMap miRNA and mRNA expression datasets generated in our laboratory. To our knowledge, this dataset constitutes the first miRNA expression profiling in the HapMap lymphoblastoid cell lines. We conducted diagnostic tests of the existing computational solutions using the experimentally supported targets in TarBase as gold standard. To gain insight into the biases that arise from such an analysis, we investigated the effect of the choice of gold standard on the evaluation of the various computational tools. We analyzed the performance of ExprTarget using both ROC curve analysis and cross-validation. We show that ExprTarget greatly improves miRNA target prediction relative to the individual prediction algorithms in terms of sensitivity and specificity. We also developed an online database, ExprTargetDB, of human miRNA targets predicted by our approach that integrates gene expression profiling into a broader framework involving important features of miRNA target site predictions.

[1]  Bin Xu,et al.  MicroRNAs in psychiatric and neurodevelopmental disorders , 2010, Brain Research.

[2]  C. Molony,et al.  Genetic analysis of genome-wide variation in human gene expression , 2004, Nature.

[3]  Marisa Wong Medina,et al.  Pharmacogenomic Discovery Using Cell-Based Models , 2009, Pharmacological Reviews.

[4]  J. Castle,et al.  Microarray analysis shows that some microRNAs downregulate large numbers of target mRNAs , 2005, Nature.

[5]  A. Hatzigeorgiou,et al.  TarBase: A comprehensive database of experimentally supported animal microRNA targets. , 2005, RNA.

[6]  Martin Reczko,et al.  The database of experimentally supported targets: a functional update of TarBase , 2008, Nucleic Acids Res..

[7]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[8]  C. Burge,et al.  Most mammalian mRNAs are conserved targets of microRNAs. , 2008, Genome research.

[9]  Joshua M. Stuart,et al.  Integrating genotype and phenotype information: an overview of the PharmGKB project , 2001, The Pharmacogenomics Journal.

[10]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[11]  K. Gunsalus,et al.  Combinatorial microRNA target predictions , 2005, Nature Genetics.

[12]  Wei Zhang,et al.  SCAN: SNP and copy number annotation , 2010, Bioinform..

[13]  A. Astolfi,et al.  Gene expression profiling in colorectal cancer using microarray technologies: results and perspectives. , 2009, Cancer treatment reviews.

[14]  S. Cohen,et al.  microRNAs in neurodegeneration , 2008, Current Opinion in Neurobiology.

[15]  A. Hatzigeorgiou,et al.  A guide through present computational approaches for the identification of mammalian microRNA targets , 2006, Nature Methods.

[16]  S. Hunt,et al.  Genome-Wide Associations of Gene Expression Variation in Humans , 2005, PLoS genetics.

[17]  Wei Zhang,et al.  Use of cell lines in the investigation of pharmacogenetic loci. , 2009, Current pharmaceutical design.

[18]  Lijun He,et al.  Identification of common genetic variants that account for transcript isoform variation between human populations , 2008, Human Genetics.

[19]  A. Kohlmann,et al.  Perspectives of gene expression profiling for diagnosis and therapy in haematological malignancies. , 2009, Briefings in functional genomics & proteomics.

[20]  P. Zamore,et al.  MicroRNAs: Regulating a Change of Heart , 2009, Circulation.

[21]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[22]  Gordon K. Smyth,et al.  A comparison of background correction methods for two-colour microarrays , 2007, Bioinform..

[23]  Tyson A. Clark,et al.  Evaluation of genetic variation contributing to differences in gene expression between populations. , 2008, American journal of human genetics.

[24]  M. Eileen Dolan,et al.  Chemotherapeutic drug susceptibility associated SNPs are enriched in expression quantitative trait loci , 2010, Proceedings of the National Academy of Sciences.

[25]  Joshua T. Burdick,et al.  Common genetic variants account for differences in gene expression among ethnic groups , 2007, Nature Genetics.

[26]  L. Lim,et al.  MicroRNA targeting specificity in mammals: determinants beyond seed pairing. , 2007, Molecular cell.

[27]  R. Redon,et al.  Relative Impact of Nucleotide and Copy Number Variation on Gene Expression Phenotypes , 2007, Science.

[28]  Isaac Bentwich Prediction and validation of microRNAs and their targets , 2005, FEBS letters.

[29]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[30]  Anton J. Enright,et al.  MicroRNA targets in Drosophila , 2003, Genome Biology.

[31]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[32]  John D. Storey,et al.  Gene-expression variation within and among human populations. , 2007, American journal of human genetics.

[33]  Frank J. Slack,et al.  MicroRNAs and cancer: An overview , 2008, Cell cycle.

[34]  M. W. Foster,et al.  Integrating ethics and science in the International HapMap Project , 2004, Nature Reviews Genetics.

[35]  Stijn van Dongen,et al.  miRBase: tools for microRNA genomics , 2007, Nucleic Acids Res..

[36]  D. Koller,et al.  Population genomics of human gene expression , 2007, Nature Genetics.

[37]  J. Haerting,et al.  Gene-expression signatures in breast cancer. , 2003, The New England journal of medicine.

[38]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[39]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[40]  Lin He,et al.  MicroRNAs: small RNAs with a big role in gene regulation , 2004, Nature Reviews Genetics.

[41]  N. Cox,et al.  Trait-Associated SNPs Are More Likely to Be eQTLs: Annotation to Enhance Discovery from GWAS , 2010, PLoS genetics.