论文信息 - Using latent semantic analysis to assess knowledge: Some technical considerations

Using latent semantic analysis to assess knowledge: Some technical considerations

In another article (Wolfe et al., 1998/this issue) we showed how Latent Semantic Analysis (LSA) can be used to assess student knowledge—how essays can be graded by LSA and how LSA can match students with appropriate instructional texts. We did this by comparing an essay written by a student with one or more target instructional texts in terms of the cosine between the vector representation of the student's essay and the instructional text in question. This simple method was effective for the purpose, but questions remain about how LSA achieves its results and how the results might be improved. Here, we address four such questions: (a) What role does the use of technical vocabulary play? (b) how long should the student essays be? (c) is the cosine the optimal measure of semantic relatedness? and (d) how does one deal with the directionality of knowledge in the high‐dimensional space?

[1] C. Coombs. A theory of data. , 1965, Psychology Review.

[2] Donna K. Harman,et al. An experimental study of factors important in document ranking , 1986, SIGIR '86.

[3] G. Vining,et al. Data Analysis: A Model-Comparison Approach , 1989 .

[4] E. B. Page. Computer Grading of Student Prose, Using Modern Concepts and Software , 1994 .

[5] T. Landauer,et al. A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[6] Peter W. Foltz,et al. Learning from text: Matching readers and texts by latent semantic analysis , 1998 .