Scaling Machine Learning for Target Prediction in Drug Discovery using Apache Spark
暂无分享,去创建一个
Jörg K. Wegner | Wolfgang De Meuter | Roel Wuyts | Dries Harnie | Marvin N. Steijaert | Alexander E. Vapirev | Andrey Gedich | W. Meuter | J. Wegner | M. Steijaert | A. Vapirev | Roel Wuyts | Dries Harnie | Andrey Gedich
[1] Douglas M. Hawkins,et al. The Problem of Overfitting , 2004, J. Chem. Inf. Model..
[2] Daniel S. Katz,et al. Swift: A language for distributed parallel scripting , 2011, Parallel Comput..
[3] Reynold Xin,et al. GraphX: a resilient distributed graph system on Spark , 2013, GRADES.
[4] J. Arrowsmith,et al. Trial Watch: Phase II and Phase III attrition rates 2011–2012 , 2013, Nature Reviews Drug Discovery.
[5] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[6] Hairong Kuang,et al. The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).
[7] Jens Palsberg,et al. Concurrent Collections , 2010 .
[8] Nci Dream Community. A community effort to assess and improve drug sensitivity prediction algorithms , 2014 .
[9] Jean-Philippe Vert,et al. Protein-ligand interaction prediction: an improved chemogenomics approach , 2008, Bioinform..
[10] George Karypis,et al. Multi-Assay-Based Structure-Activity Relationship Models: Improving Structure-Activity Relationship Models by Incorporating Activity Information from Related Targets , 2009, J. Chem. Inf. Model..
[11] Ulf Leser,et al. Parallelization in Scientific Workflow Management Systems , 2013, ArXiv.
[12] Ole Tange,et al. GNU Parallel: The Command-Line Power Tool , 2011, login Usenix Mag..
[13] Scott Shenker,et al. Fast and Interactive Analytics over Hadoop Data with Spark , 2012, login Usenix Mag..
[14] Woody Sherman,et al. Boosting Virtual Screening Enrichments with Data Fusion: Coalescing Hits from Two-Dimensional Fingerprints, Shape, and Docking , 2013, J. Chem. Inf. Model..
[15] Michael J. Franklin,et al. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing , 2012, NSDI.
[16] Cheng-Hao Tsai,et al. Large-scale logistic regression and linear support vector machines using spark , 2014, 2014 IEEE International Conference on Big Data (Big Data).
[17] Laura M. Heiser,et al. A community effort to assess and improve drug sensitivity prediction algorithms , 2014, Nature Biotechnology.
[18] James Reinders,et al. Intel threading building blocks - outfitting C++ for multi-core processor parallelism , 2007 .
[19] George Karypis,et al. Target Fishing for Chemical Compounds Using Target-Ligand Activity Data and Ranking Based Methods , 2009, J. Chem. Inf. Model..
[20] Elisa Michelini,et al. Protein ligand interaction prediction , 2012 .
[21] Marek S. Wiewiórka,et al. SparkSeq: fast, scalable and cloud-ready tool for the interactive genomic data analysis with nucleotide precision , 2014, Bioinform..
[22] Lirong Wang,et al. TargetHunter: An In Silico Target Identification Tool for Predicting Therapeutic Potential of Small Organic Molecules Based on Chemogenomic Database , 2013, The AAPS Journal.
[23] P. Clemons,et al. Target identification and mechanism of action in chemical biology and drug discovery. , 2013, Nature chemical biology.
[24] Jens Palsberg,et al. Concurrent Collections , 2010, Sci. Program..
[25] Fei Yuan,et al. Chemical Descriptors Are More Important Than Learning Algorithms for Modelling , 2012, Molecular informatics.
[26] Andreas Bender,et al. In Silico Target Predictions: Defining a Benchmarking Data Set and Comparison of Performance of the Multiclass Naïve Bayes and Parzen-Rosenblatt Window , 2013, J. Chem. Inf. Model..
[27] Nikil Wale,et al. Machine learning in drug discovery and development , 2011 .