论文信息 - Data‐driven approaches to information access

Data‐driven approaches to information access

This paper summarizes three lines of research that are motivated by the practical problem of helping users find information from external data sources, most notably computers. The application areas include information retrieval, text categorization, and question answering. A common theme in these applications is that practical information access problems can be solved by analyzing the statistical properties of words in large volumes of real world texts. The same statistical properties constrain human performance, thus we believe that solutions to practical information access problems can shed light on human knowledge representation and reasoning.

Susan T. Dumais | S. Dumais

[1] Thomas K. Landauer,et al. On the computational basis of learning and cognition: Arguments from LSA , 2002 .

[2] J. Deese. The structure of associations in language and thought , 1966 .

[3] Margaret G. McKeown,et al. The Contribution of Prior Knowledge and Coherent Text to Comprehension , 1992 .

[4] T. Landauer,et al. A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[5] A. Graesser,et al. The Psychology of Questions , 1985 .

[6] Thomas Hofmann,et al. Probabilistic Latent Semantic Analysis , 1999, UAI.

[7] Nello Cristianini,et al. Latent Semantic Kernels , 2001, Journal of Intelligent Information Systems.

[8] Elizabeth D. Liddy,et al. Categorization and Standardizing Proper Nouns for Efficient Information Retrieval , 1996 .

[9] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.

[10] Peter W. Foltz,et al. Learning from text: Matching readers and texts by latent semantic analysis , 1998 .

[11] Susan T. Dumais,et al. Hierarchical classification of Web content , 2000, SIGIR '00.

[12] W. Kintsch,et al. Are Good Texts Always Better? Interactions of Text Coherence, Background Knowledge, and Levels of Understanding in Learning From Text , 1996 .

[13] Susan T. Dumais,et al. Automatic Cross-Language Information Retrieval Using Latent Semantic Indexing , 1998 .

[14] W. Kintsch,et al. Time course of priming for associate and inference words in a discourse context , 1988, Memory & cognition.

[15] Dragomir R. Radev,et al. Question-answering by predictive annotation , 2000, SIGIR '00.

[16] Susan T. Dumais,et al. Personalized information delivery: an analysis of information filtering methods , 1992, CACM.

[17] Donna Harman,et al. How effective is suffixing , 1991 .

[18] Robert L. Goldstone,et al. Concepts and Categorization , 2003 .

[19] Peter W. Foltz,et al. The Measurement of Textual Coherence with Latent Semantic Analysis. , 1998 .

[20] Peter D. Turney. Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[21] James Pustejovsky,et al. Corpus processing for lexical acquisition , 1996 .

[22] Philip J. Hayes,et al. TCS: a shell for content-based text categorization , 1990, Sixth Conference on Artificial Intelligence for Applications.

[23] E. B. Page. Computer Grading of Student Prose, Using Modern Concepts and Software , 1994 .

[24] John C. Platt,et al. Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[25] Peter W. Foltz,et al. The intelligent essay assessor: Applications to educational technology , 1999 .

[26] Peter W. Foltz,et al. Reasoning from Multiple Texts: An Automatic Analysis of Readers? Situation Models , 1996 .

[27] Peter J. Rousseeuw,et al. Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[28] Thomas L. Griffiths,et al. A probabilistic approach to semantic representation , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[29] Marcia J. Bates,et al. Subject access in online catalogs: A design model , 1986 .

[30] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[31] Peter W. Foltz,et al. An introduction to latent semantic analysis , 1998 .

[32] Hinrich Schütze,et al. A comparison of classifiers and document representations for the routing problem , 1995, SIGIR '95.

[33] Darrell Laham,et al. Latent Semantic Analysis Approaches to Categorization , 1997 .

[34] John R. Anderson,et al. A rational analysis of human memory. , 1989 .

[35] Sanda M. Harabagiu,et al. High performance question/answering , 2001, SIGIR '01.

[36] John R. Anderson,et al. Reflections of the Environment in Memory Form of the Memory Functions , 2022 .

[37] David D. Lewis,et al. Applying Support Vector Machines to the TREC-2001 Batch Filtering and Routing Tasks , 2001, TREC.

[38] Susan T. Dumais,et al. Optimizing search by showing results in context , 2001, CHI.

[39] Leon Flicker,et al. Latent Semantic Analysis: A New Method to Measure Prose Recall , 2002, Journal of clinical and experimental neuropsychology.

[40] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.