The identification and assessment of prognostic factors is one of the major tasks in clinical research. The assessment of one single prognostic factor can be done by recently established methods for using optimal cutpoints. Here, we suggest a method to consider an optimal selected prognostic factor from a set of prognostic factors of interest. This can be viewed as a variable selection method and is the underlying decision problem at each node of various tree building algorithms.
We propose to use maximally selected statistics where the selection is defined over the set of prognostic factors and over all cutpoints in each prognostic factor. We demonstrate that it is feasible to compute the approximate null distribution. We illustrate the new variable selection test with data of the German Breast Cancer Study Group and of a small study on patients with diffuse large B-cell lymphoma. Using the null distribution for a p-value adjusted regression trees algorithm, we adjust for the number of variables analysed at each node as well. (© 2004 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim)