STATISTICAL AND MEASUREMENT PROPERTIES OF FEATURES USED IN ESSAY ASSESSMENT

Statistical and measurement properties are examined for features used in essay assessment to determine the generalizability of the features across populations, prompts, and individuals. Data are employed from TOEFL® and GMAT® examinations and from writing for CriterionSM.

[1]  C. Gini Variabilità e mutabilità : contributo allo studio delle distribuzioni e delle relazioni statistiche , 1912 .

[2]  R. Flesch A new readability yardstick. , 1948, The Journal of applied psychology.

[3]  E. H. Simpson Measurement of Diversity , 1949, Nature.

[4]  H. Gulliksen Theory of mental tests , 1952 .

[5]  Rupert G. Miller A Trustworthy Jackknife , 1964 .

[6]  Calyampudi Radhakrishna Rao,et al.  Linear Statistical Inference and its Applications , 1967 .

[7]  N. Draper,et al.  Applied Regression Analysis. , 1967 .

[8]  Ellis B. Page,et al.  Statistical and Linguistic Strategies in the Computer Grading of Essays , 1967, COLING.

[9]  E. B. Page,et al.  The use of the computer in analyzing student essays , 1968 .

[10]  J. Peter Kincaid,et al.  Derivation and Validation of the Automated Readability Index for Use with Technical Materials , 1970 .

[11]  Henry B. Slotnick TOWARD A THEORY OF COMPUTER ESSAY GRADING , 1972 .

[12]  R. P. Fishburne,et al.  Derivation of New Readability Formulas (Automated Readability Index, Fog Count and Flesch Reading Ease Formula) for Navy Enlisted Personnel , 1975 .

[13]  M. Kupperman Linear Statistical Inference and Its Applications 2nd Edition (C. Radhakrishna Rao) , 1975 .

[14]  R. D. Bock,et al.  Multivariate Statistical Methods in Behavioral Research , 1978 .

[15]  M. Coleman,et al.  A computer readability formula designed for machine scoring. , 1975 .

[16]  B. Efron,et al.  Estimating the number of unseen species: How many words did Shakespeare know? Biometrika 63 , 1976 .

[17]  R. Brennan Elements of generalizability theory , 1983 .

[18]  Stephen Reid,et al.  Writer's workbench analysis of holistically scored essays , 1986 .

[19]  S. Weisberg,et al.  Applied Linear Regression (2nd ed.). , 1986 .

[20]  E. B. Page Computer Grading of Student Prose, Using Modern Concepts and Software , 1994 .

[21]  H. Breland,et al.  THE COLLEGE BOARD VOCABULARY STUDY , 1994 .

[22]  H. Breland Word Frequency and Word Difficulty: A Comparison of Counts in Four Corpora , 1996 .

[23]  N. Draper,et al.  Applied Regression Analysis: Draper/Applied Regression Analysis , 1998 .

[24]  Peter W. Foltz,et al.  An introduction to latent semantic analysis , 1998 .

[25]  Brent Bridgeman,et al.  Writing Assessment in Admission to Higher Education: Review and Framework. College Board Report No. 99-3. GRE Board Research Report No. 96-12R. , 1999 .

[26]  W. Bossert,et al.  The Measurement of Diversity , 2001 .

[27]  Jill Burstein,et al.  Automated Essay Scoring : A Cross-disciplinary Perspective , 2003 .

[28]  Martin Chodorow,et al.  Automated Essay Evaluation: The Criterion Online Writing Service , 2004, AI Mag..