Performance Analysis and Prediction for Data Mining Systems

We establish theoretical limits on the performance of certain data mining algorithms based only on the properties of the data sets being considered. We demonstrate the use of the bounds with an example based on data generated by an artificial world simulator. We point to extensions of this work and to connections with other fields.