Efficient large-scale machine learning algorithms for genomic sequences
暂无分享,去创建一个
[1] V. Iyer,et al. FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) isolates active regulatory elements from human chromatin. , 2007, Genome research.
[2] Aravind Subramanian,et al. Gene expression inference with deep learning , 2015 .
[3] Howard Y. Chang,et al. ATAC‐seq: A Method for Assaying Chromatin Accessibility Genome‐Wide , 2015, Current protocols in molecular biology.
[4] Tom H. Pringle,et al. The human genome browser at UCSC. , 2002, Genome research.
[5] Manolis Kellis,et al. Large-scale epigenome imputation improves data quality and disease variant enrichment , 2015, Nature Biotechnology.
[6] Peggy Hall,et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..
[7] Yi Li,et al. Deconvolving tumor purity and ploidy by integrating copy number alterations and loss of heterozygosity , 2014, Bioinform..
[8] O. Cappé,et al. On‐line expectation–maximization algorithm for latent data models , 2009 .
[9] Morteza Mohammad Noori,et al. Enhanced Regulatory Sequence Prediction Using Gapped k-mer Features , 2014, PLoS Comput. Biol..
[10] Benjamin J. Strober,et al. A method to predict the impact of regulatory variants from DNA sequence , 2015, Nature Genetics.
[11] Ellen T. Gelfand,et al. The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.
[12] Galt P. Barber,et al. BigWig and BigBed: enabling browsing of large distributed datasets , 2010, Bioinform..
[13] William Stafford Noble,et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.
[14] Xiaohui Xie,et al. Co-Occurrence Feature Learning for Skeleton Based Action Recognition Using Regularized Deep LSTM Networks , 2016, AAAI.
[15] H. Hakonarson,et al. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.
[16] David D. Cox,et al. Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures , 2013, ICML.
[17] Martin Renqiang Min,et al. An integrated encyclopedia of DNA elements in the human genome , 2012 .
[18] Jürgen Schmidhuber,et al. Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.
[19] Jens Keilwagen,et al. Varying levels of complexity in transcription factor binding motifs , 2015, Nucleic acids research.
[20] M. Daly,et al. Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS). , 2005, Genome research.
[21] Michael A. Beer,et al. Predicting Gene Expression from Sequence , 2004, Cell.
[22] S. Gabriel,et al. Analysis of 6,515 exomes reveals a recent origin of most human protein-coding variants , 2012, Nature.
[23] Jianxing Feng,et al. Imputation for transcription factor binding predictions based on deep learning , 2017, PLoS Comput. Biol..
[24] Avanti Shrikumar,et al. Learning Important Features Through Propagating Activation Differences , 2017, ICML.
[25] Fidel Ramírez,et al. deepTools: a flexible platform for exploring deep-sequencing data , 2014, Nucleic Acids Res..
[26] B. Frey,et al. Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning , 2015, Nature Biotechnology.
[27] E. Birney,et al. High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells. , 2011, Genome research.
[28] F. Collins,et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.
[29] M. Frommer,et al. CpG islands in vertebrate genomes. , 1987, Journal of molecular biology.
[30] William Stafford Noble,et al. Global mapping of protein-DNA interactions in vivo by digital genomic footprinting , 2009, Nature Methods.
[31] Navdeep Jaitly,et al. Hybrid speech recognition with Deep Bidirectional LSTM , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[32] David J. Arenillas,et al. JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles , 2015, Nucleic Acids Res..
[33] Jacques van Helden,et al. Regulatory Sequence Analysis Tools , 2003, Nucleic Acids Res..
[34] John E. Reid,et al. STEME: efficient EM to find motifs in large data sets , 2011, Nucleic acids research.
[35] Geoffrey E. Hinton,et al. On the importance of initialization and momentum in deep learning , 2013, ICML.
[36] Xiaohui Xie,et al. EXTREME: an online EM algorithm for motif discovery , 2014, Bioinform..
[37] J. Shendure,et al. A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.
[38] Matthew Stephens,et al. msCentipede: Modeling Heterogeneity across Genomic Sites and Replicates Improves Accuracy in the Inference of Transcription Factor Binding , 2015, PloS one.
[39] Alex P. Reynolds,et al. Genome-scale mapping of DNase I hypersensitivity. , 2013, Current protocols in molecular biology.
[40] Charles Elkan,et al. Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.
[41] Andrew Zisserman,et al. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.
[42] Charles Elkan,et al. Unsupervised learning of multiple motifs in biopolymers using expectation maximization , 1995, Mach. Learn..
[43] Daniel Quang,et al. DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences , 2015 .
[44] Bronwen L. Aken,et al. GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.
[45] Helga Thorvaldsdóttir,et al. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..
[47] Timothy L. Bailey,et al. Gene expression Advance Access publication May 4, 2011 DREME: motif discovery in transcription factor ChIP-seq data , 2011 .
[48] Yann LeCun,et al. Signature Verification Using A "Siamese" Time Delay Neural Network , 1993, Int. J. Pattern Recognit. Artif. Intell..
[49] David R. Kelley,et al. Basset: Learning the regulatory code of the accessible genome with deep convolutional neural networks , 2015 .
[50] Hermann Ney,et al. Translation Modeling with Bidirectional Recurrent Neural Networks , 2014, EMNLP.
[51] Weiguo Liu,et al. GPU-MEME: Using Graphics Hardware to Accelerate Motif Finding in DNA Sequences , 2008, PRIB.
[52] P. Park,et al. Design and analysis of ChIP-seq experiments for DNA-binding proteins , 2008, Nature Biotechnology.
[53] Xiaohui Xie,et al. DANN: a deep learning approach for annotating the pathogenicity of genetic variants , 2015, Bioinform..
[54] Richard Leslie,et al. GRASP: analysis of genotype-phenotype results from 1390 genome-wide association studies and corresponding open access database , 2014, Bioinform..
[55] Kevin Y. Yip,et al. FunSeq2: a framework for prioritizing noncoding regulatory variants in cancer , 2014, Genome Biology.
[56] K. Lindblad-Toh,et al. Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammals , 2005, Nature.
[57] Jonas Mueller,et al. Siamese Recurrent Architectures for Learning Sentence Similarity , 2016, AAAI.
[58] Shane J. Neph,et al. An expansive human regulatory lexicon encoded in transcription factor footprints , 2012, Nature.
[59] Razvan Pascanu,et al. Theano: new features and speed improvements , 2012, ArXiv.
[60] Takaya Saito,et al. The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets , 2015, PloS one.
[61] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[62] Howard Y. Chang,et al. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position , 2013, Nature Methods.
[63] Michael Q. Zhang,et al. Integrative analysis of 111 reference human epigenomes , 2015, Nature.
[64] Sören Sonnenburg,et al. Optimized Cutting Plane Algorithm for Large-Scale Risk Minimization , 2009, J. Mach. Learn. Res..
[65] Stephen C. J. Parker,et al. Motif signatures in stretch enhancers are enriched for disease-associated genetic variants , 2015, Epigenetics & Chromatin.
[66] Sieu Phan,et al. Threshold for Positional Weight Matrix , 2008, Eng. Lett..
[67] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..
[68] B. Pugh,et al. Comprehensive Genome-wide Protein-DNA Interactions Detected at Single-Nucleotide Resolution , 2011, Cell.
[69] T. D. Schneider,et al. Use of the 'Perceptron' algorithm to distinguish translational initiation sites in E. coli. , 1982, Nucleic acids research.
[70] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[71] William Stafford Noble,et al. Quantifying similarity between motifs , 2007, Genome Biology.
[72] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.
[73] Monya Baker,et al. One-stop shop for disease genes , 2012, Nature.
[74] Wyeth W. Wasserman,et al. JASPAR: an open-access database for eukaryotic transcription factor binding profiles , 2004, Nucleic Acids Res..
[75] O. Stegle,et al. DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning , 2016, Genome Biology.
[76] G. Benson,et al. Tandem repeats finder: a program to analyze DNA sequences. , 1999, Nucleic acids research.
[77] Philip Machanick,et al. The value of position-specific priors in motif discovery using MEME , 2010, BMC Bioinformatics.
[78] Philip Machanick,et al. MEME-ChIP: motif analysis of large DNA datasets , 2011, Bioinform..
[79] Denis Thieffry,et al. RSAT 2015: Regulatory Sequence Analysis Tools , 2015, Nucleic Acids Res..
[80] Tatsunori B. Hashimoto,et al. Discovery of non-directional and directional pioneer transcription factors by modeling DNase profile magnitude and shape , 2014, Nature Biotechnology.
[81] Charles Elkan,et al. The Value of Prior Knowledge in Discovering Motifs with MEME , 1995, ISMB.
[82] Brendan J. Frey,et al. Deep learning of the tissue-regulated splicing code , 2014, Bioinform..
[83] Manolis Kellis,et al. ChromHMM: automating chromatin-state discovery and characterization , 2012, Nature Methods.
[84] Nitish Srivastava,et al. Improving Neural Networks with Dropout , 2013 .
[85] J. Uhm. Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2009 .
[86] E. Zeggini,et al. Functional annotation of non-coding sequence variants , 2014, Nature Methods.
[87] Wei Wang,et al. Predicting the Human Epigenome from DNA Motifs , 2014, Nature Methods.
[88] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[89] O. Troyanskaya,et al. Predicting effects of noncoding variants with deep learning–based sequence model , 2015, Nature Methods.
[90] William Stafford Noble,et al. Unsupervised pattern discovery in human chromatin structure through genomic segmentation , 2012, Nature Methods.
[91] L. Pachter,et al. Streaming fragment assignment for real-time analysis of sequencing experiments , 2012, Nature Methods.
[92] B. Williams,et al. Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.
[93] A. Mortazavi,et al. Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.
[94] Jacob F. Degner,et al. Sequence and Chromatin Accessibility Data Accurate Inference of Transcription Factor Binding from Dna Material Supplemental Open Access , 2022 .
[95] A. Siepel,et al. Probabilities of Fitness Consequences for Point Mutations Across the Human Genome , 2014, Nature Genetics.
[96] Finn Drabløs,et al. Accelerating Motif Discovery: Motif Matching on Parallel Hardware , 2006, WABI.
[97] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.