Optimising parallel R correlation matrix calculations on gene expression data using MapReduce

[1]  Philip Sedgwick,et al.  Pearson’s correlation coefficient , 2012, BMJ : British Medical Journal.

[2]  P. Sedgwick Spearman’s rank correlation coefficient , 2018, British Medical Journal.

[3]  Torsten Haferlach,et al.  An international standardization programme towards the application of gene expression profiling in routine leukaemia diagnostics: the Microarray Innovations in LEukemia study prephase , 2008, British journal of haematology.

[4]  E. Perakslis,et al.  Effective knowledge management in translational medicine , 2010, Journal of Translational Medicine.

[5]  Qi Li,et al.  A Chunking Method for Euclidean Distance Matrix Calculation on Large Dataset Using Multi-GPU , 2010, 2010 Ninth International Conference on Machine Learning and Applications.

[6]  Stephen H. Friend,et al.  How molecular profiling could revolutionize drug discovery , 2005, Nature Reviews Drug Discovery.

[7]  BMC Bioinformatics , 2005 .

[8]  Leming Shi,et al.  Effect of training-sample size and classification difficulty on the accuracy of genomic predictors , 2010, Breast Cancer Research.

[9]  H. Abdi The Kendall Rank Correlation Coefficient , 2007 .

[10]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[11]  F. Zhan,et al.  Prognostic value of Cyclin D2 mRNA expression in newly diagnosed multiple myeloma treated with high-dose chemotherapy and tandem autologous stem cell transplantations , 2006, Leukemia.

[12]  M. Metzker Sequencing technologies — the next generation , 2010, Nature Reviews Genetics.

[13]  C. Spearman The proof and measurement of association between two things. By C. Spearman, 1904. , 1987, The American journal of psychology.

[14]  S. Shurtleff,et al.  Clinical utility of microarray-based gene expression profiling in the diagnosis and subclassification of leukemia: report from the International Microarray Innovations in Leukemia Study Group. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[15]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[16]  Bowei Xi,et al.  Large complex data: divide and recombine (D&R) with RHIPE , 2012 .

[17]  Hao Yu,et al.  Programming with Big Data – Interface to MPI , 2016 .

[18]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[19]  Yike Guo,et al.  IC Cloud: A Design Space for Composable Cloud Computing , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[20]  S. Williams,et al.  Pearson's correlation coefficient. , 1996, The New Zealand medical journal.

[21]  O. Cope,et al.  Multiple myeloma. , 1948, The New England journal of medicine.

[22]  Guido Schwarzer,et al.  Easier parallel computing in R with snowfall and sfCluster , 2009, R J..

[23]  Samuel P. Midkiff,et al.  RABID -- A General Distributed R Processing Framework Targeting Large Data-Set Problems , 2013, 2013 IEEE International Congress on Big Data.

[24]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[25]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[26]  Maqc Consortium The MicroArray Quality Control ( MAQC )-II study of common practices for the development and validation of microarray-based predictive models , 2012 .