SIGCHI Lifetime Research Award Talk: Interdisciplinary Perspectives on Search

I have long been interested in how people seek information from external sources and make sense of the results. While the sources of information have continued to evolve over the years from libraries, to web search engines, to virtual assistants, many important challenges and opportunities remain. The success of any information retrieval systems depends critically on both the ability to support people in articulating their information needs and making sense of the results to solve the problem that motivated their search in the first place, as well as the need to efficiently and effectively find relevant information. My research combines these two dimensions into an interdisciplinary, user-centered perspective on information systems. My interest in information retrieval started in the early 1980's with the observation that different people use a surprisingly wide variety of words to describe the same object or concept. This fundamental characteristic of human language set limits on how well simple word-matching techniques can do in satisfying information needs. In a paper at the pre-CHI Gaithersburg conference in 1982 [6] we describe this problem as statistical semantics. (It is symptomatic of the problem that we subsequently used vocabulary mismatch, verbal disagreement, and statistical semantics to refer to the same problem.) Over the next decade, with colleagues at Bell Labs, I developed and evaluated solutions that involved collecting multiple descriptors for objects, and reducing the dimensionality of the representation using techniques like Latent Semantic Indexing (LSI) [3][7] to mitigate the disagreement between the vocabulary that authors use in writing and searchers use to express their information needs. Similar approaches (combined with a lot more data and compute) are used to power modern word-embedding techniques that widely used in natural language processing.