Get Training Numbers

3 The NCI60/Novartis Data There are actually several different sets of array profiles of the NCI60 cell lines. Most can be accessed from http://discover.nci.nih.gov. As noted by Coombes et al. [2], the particular data used by Potti et al. [3] came from U95Av2 arrays run by Novartis, with 59 samples (MDA-N is missing) run in triplicate. The arrays are indexed with an A, B, or C to designate the replicate set. Only the numbers from the A set were used. Further, probesets corresponding to Affymetrix control probes were excluded, leaving 12558 probesets as opposed to 12625. More details on the Novartis samples and the R data objects we use here are available at http://bioinformatics.mdanderson.org/Supplements/ReproRsch-Chemo; see particularly reports SR1 and SR2. We also load the lists of cell lines used for each drug (as inferred by Coombes et al. [2]) to help order the data matrix.