Resolution and predictability: An approach to the scaling problem

We analyzed the relationship between resolution and predictability and found that while increasing resolution provides more descriptive information about the patterns in data, it also increases the difficulty of accurately modeling those patterns. There are limits to the predictability of natural phenomenon at particular resolutions, and “fractal-like” rules determine how both “data” and “model” predictability change with resolution.We analyzed land use data by resampling map data sets at several different spatial resolutions and measuring predictability at each. Spatial auto-predictability (Pa) is the reduction in uncertainty about the state of a pixel in a scene given knowledge of the state of adjacent pixels in that scene, and spatial cross-predictability (Pc) is the reduction in uncertainty about the state of a pixel in a scene given knowledge of the state of corresponding pixels in other scenes. Pa is a measure of the internal pattern in the data while Pc is a measure of the ability of some other “model” to represent that pattern. We found a strong linear relationship between the log of Pa and the log of resolution (measured as the number of pixels per square kilometer). This fractal-like characteristic of “self-similarity” with decreasing resolution implies that predictability may be best described using a unitless dimension that summarizes how it changes with resolution. While Pa generally increases with increasing resolution (because more information is being included), Pc generally falls or remains stable (because it is easier to model aggregate results than fine grain ones). Thus one can define an “optimal” resolution for a particular modeling problem that balances the benefit in terms of increasing data predictability (Pa) as one increases resolution, with the cost of decreasing model predictability (Pc).