The Future of Data Analysis

For a long time I have thought I was a statistician, interested in inferences from the particular to the general. But as I have watched mathematical statistics evolve, I have had cause to wonder and to doubt. And when I have pondered about why such techniques as the spectrum analysis of time series have proved so useful, it has become clear that their “dealing with fluctuations” aspects are, in many circumstances, of lesser importance than the aspects that would already have been required to deal effectively with the simpler case of very extensive data, where fluctuations would no longer be a problem. All in all, I have come to feel that my central interest is in data analysis, which I take to include, among other things: procedures for analyzing data, techniques for interpreting the results of such procedures, ways of planning the gathering of data to make its analysis easier, more precise or more accurate, and all the machinery and results of (mathematical) statistics which apply to analyzing data.

[1]  K. Pearson Contributions to the Mathematical Theory of Evolution , 1894 .

[2]  K. Pearson Contributions to the Mathematical Theory of Evolution. II. Skew Variation in Homogeneous Material , 1895 .

[3]  K. Pearson On the Criterion that a Given System of Deviations from the Probable in the Case of a Correlated System of Variables is Such that it Can be Reasonably Supposed to have Arisen from Random Sampling , 1900 .

[4]  Anwendungen der Wahrscheinlichkeitsrechnung auf Statistik , 1904 .

[5]  Student Errors of Routine Analysis , 1927 .

[6]  N. Wiener Generalized harmonic analysis , 1930 .

[7]  R. A. Fisher,et al.  Design of Experiments , 1936 .

[8]  B. L. Welch ON THE z-TEST IN RANDOMIZED BLOCKS AND LATIN SQUARES , 1937 .

[9]  E. Pitman Significance Tests Which May be Applied to Samples from Any Populations , 1937 .

[10]  E. Pitman SIGNIFICANCE TESTS WHICH MAY BE APPLIED TO SAMPLES FROM ANY POPULATIONS III. THE ANALYSIS OF VARIANCE TEST , 1938 .

[11]  A. Wald,et al.  On the Choice of the Number of Class Intervals in the Application of the Chi Square Test , 1942 .

[12]  L. Guttman,et al.  Statistical Adjustment of Data , 1944 .

[13]  D. J. Finney Probit analysis; a statistical treatment of the sigmoid response curve. , 1947 .

[14]  R. Geary,et al.  Testing for Normality , 2003 .

[15]  L S PENROSE,et al.  Some notes on discrimination. , 1947, Annals of eugenics.

[16]  H. Hartley,et al.  Tests of significance in harmonic analysis. , 1949, Biometrika.

[17]  J. Tukey,et al.  Dyadic anova, an analysis of variance for vectors. , 1949, Human biology.

[18]  J. Tukey One Degree of Freedom for Non-Additivity , 1949 .

[19]  A. Gayen The distribution of student's t in random samples of any size drawn from non-normal universes. , 1949, Biometrika.

[20]  J. Walsh,et al.  Applications of some significance tests for the median which are valid under very general conditions. , 1949, Journal of the American Statistical Association.

[21]  N. L. Johnson,et al.  Systems of frequency curves generated by methods of translation. , 1949, Biometrika.

[22]  W. Hollander,et al.  Introgressive Hybridization , 1949, The Yale Journal of Biology and Medicine.

[23]  John E. Walsh,et al.  On the Range-Midrange Test and Some Tests with Bounded Significance Levels , 1949 .

[24]  I. Lerner,et al.  Population genetics and animal improvement. , 1950 .

[25]  H. Eysenck,et al.  Criterion analysis; an application of the hypothetico-deductive method to factor analysis. , 1950, Psychological review.

[26]  W. Stevens,et al.  Fiducial limits of the parameter of a discontinuous distribution. , 1950, Biometrika.

[27]  J. Tukey,et al.  Components in regression. , 1951, Biometrics.

[28]  W. G. Cochran,et al.  Improvement by Means of Selection , 1951 .

[29]  Calyampudi R. Rao,et al.  Advanced Statistical Methods in Biometric Research. , 1953 .

[30]  David Lindley,et al.  Advanced Statistical Methods in Biometric Research. , 1953 .

[31]  Hans J. Eysenck,et al.  The Scientific Study of Personality , 1953 .

[32]  S. C. Pearce,et al.  Field experimentation with fruit trees and other perennial plants , 1953 .

[33]  Gerald J. Lieberman,et al.  Use of Normal Probability Paper , 1954 .

[34]  C. P. Blacker,et al.  The scientific study of personality , 1955 .

[35]  A. Bernard,et al.  The plotting of observations on probability-paper , 1955 .

[36]  A. E. Sarhan,et al.  Estimation of Location and Scale Parameters by Order Statistics from Singly and Doubly Censored Samples , 1956 .

[37]  J. Tukey,et al.  AVERAGE VALUES OF MEAN SQUARES IN FACTORIALS , 1956 .

[38]  Gerald J. Lieberman,et al.  The Use of Generalized Probability Paper for Continuous Distributions , 1956 .

[39]  E. Anderson A SEMIGRAPHICAL METHOD FOR THE ANALYSIS OF COMPLEX PROBLEMS. , 1957, Proceedings of the National Academy of Sciences of the United States of America.

[40]  P. Sneath,et al.  Some thoughts on bacterial classification. , 1957, Journal of general microbiology.

[41]  S. Vajda,et al.  Symposium on Monte Carlo Methods , 1957, The Mathematical Gazette.

[42]  M. A. Creasy Analysis of Variance as an Alternative to Factor Analysis , 1957 .

[43]  John W. Tukey,et al.  Antithesis or regression? , 1957, Mathematical Proceedings of the Cambridge Philosophical Society.

[44]  P. Sneath The application of computers to taxonomy. , 1957, Journal of general microbiology.

[45]  R. Gnanadesikan,et al.  Further contributions to multivariate confidence bounds , 1957 .

[46]  W. J. Dixon Estimates of the Mean and Standard Deviation of a Normal Population , 1957 .

[47]  David R. Cox,et al.  THE USE OF A CONCOMITANT VARIABLE IN SELECTING AN EXPERIMENTAL DESIGN , 1957 .

[48]  R. Sokal,et al.  A QUANTITATIVE APPROACH TO A PROBLEM IN CLASSIFICATION† , 1957, Evolution; International Journal of Organic Evolution.

[49]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[50]  D. Cox Some problems connected with statistical inference , 1958 .

[51]  A. Dempster A HIGH DIMENSIONAL TWO SAMPLE SIGNIFICANCE TEST , 1958 .

[52]  R. Gnanadesikan,et al.  A Note on `Further Contributions to Multivariate Confidence Bounds' , 1958 .

[53]  S. T. Cowan,et al.  An electro-taxonomic survey of bacteria. , 1958, Journal of general microbiology.

[54]  E. Hannan The Estimation of the Spectral Density after Trend Removal , 1958 .

[55]  R. Maurice Selection of the population with the largest mean when comparisons can be made only in pairs , 1958 .

[56]  A. E. Sarhan,et al.  Estimation of Location and Scale Parameters by Order Statistics from Singly and Doubly Censored Samples: Part II. Tables for the Normal Distribution for Samples of Size $11 \leqq n \leqq 15$ , 1958 .

[57]  J. Roy,et al.  STEP-DOWN PROCEDURE IN MULTIVARIATE ANALYSIS , 1958 .

[58]  L. Tucker An inter-battery method of factor analysis , 1958 .

[59]  G. Barnard Control Charts and Stochastic Processes , 1959 .

[60]  C. Daniel Use of Half-Normal Plots in Interpreting Factorial Two-Level Experiments , 1959 .

[61]  J. Kiefer Optimum Experimental Designs , 1959 .

[62]  F. E. Satterthwaite Random Balance Experimentation , 1959 .

[63]  Comments on “the Simplest Signed-Rank Tests” , 1959 .

[64]  Calyampudi R. Rao SOME PROBLEMS INVOLVING LINEAR HYPOTHESES IN MULTIVARIATE ANALYSIS , 1959 .

[65]  D J Rogers,et al.  A Computer Program for Classifying Plants. , 1960, Science.

[66]  W. R. Buckland,et al.  Contributions to Probability and Statistics , 1960 .

[67]  C. Dunnett On Selecting the Largest of k Normal Population Means , 1960 .

[68]  W. Dixon Simplified Estimation from Censored Normal Samples , 1960 .

[69]  J. Tukey A survey of sampling from contaminated distributions , 1960 .

[70]  J. Tukey,et al.  A Nonparametric Sum of Ranks Procedure for Relative Spread in Unpaired Samples , 1960 .

[71]  Gunnar Blom,et al.  Statistical Estimates and Transformed Beta-Variables. , 1960 .

[72]  A. Dempster A significance test for the separation of two highly multivariate small samples , 1960 .

[73]  J. Tukey Discussion, Emphasizing the Connection Between Analysis of Variance and Spectrum Analysis* , 1961 .

[74]  R. Gnanadesikan,et al.  GRAPHICAL ANALYSIS OF MULTI-RESPONSE EXPERIMENTAL DATA USING ORDERED DISTANCES. , 1961, Proceedings of the National Academy of Sciences of the United States of America.

[75]  F. J. Anscombe,et al.  The Examination and Analysis of Residuals , 1963 .