Predicting natural language descriptions of smells

There has been recent progress in predicting whether common verbal descriptors such as “fishy”, “floral” or “fruity” apply to the smell of odorous molecules. However, the number of descriptors for which such a prediction is possible to date is very small compared to the large number of descriptors that have been suggested for the profiling of smells. We show here that the use of natural language semantic representations on a small set of general olfactory perceptual descriptors allows for the accurate inference of perceptual ratings for mono-molecular odorants over a large and potentially arbitrary set of descriptors. This is a noteworthy approach given that the prevailing view is that human’s capacity to identify or characterize odors by name is poor [1, 2, 3, 4, 5]. Our methods, when combined with a molecule-to-ratings model using chemoinformatic features, also allow for the zero-shot learning inference [6, 7] of perceptual ratings for arbitrary molecules. We successfully applied our semantics-based approach to predict perceptual ratings with an accuracy higher than 0.5 for up to 70 olfactory perceptual descriptors in a well-known dataset, a ten-fold increase in the number of descriptors from previous attempts. Moreover we accurately predict paradigm odors of four common families of molecules with an AUC of up to 0.75. Our approach solves the need for the consuming task of handcrafting domain specific sets of descriptors in olfaction and collecting ratings for large numbers of descriptors and odorants [8, 9, 10, 11] while establishing that the semantic distance between descriptors defines the equivalent of an odorwheel.

[1]  W S Cain,et al.  To know with the nose: keys to odor identification. , 1979, Science.

[2]  John B. Goodenough,et al.  Contextual correlates of synonymy , 1965, CACM.

[3]  R. G. Crowder,et al.  Memory for Odors , 1991 .

[4]  Andreas Keller,et al.  Olfactory perception of chemically diverse molecules , 2016, BMC Neuroscience.

[5]  M. Larsson,et al.  Semantic factors in episodic recognition of common odors in early and late adulthood: a review. , 1997, Chemical senses.

[6]  N. Schwarz Self-reports: How the questions shape the answers. , 1999 .

[7]  R. A. Arnold,et al.  Progress Towards a Standardized System of Wine Aroma Terminology , 1984, American Journal of Enology and Viticulture.

[8]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[9]  P. Kay,et al.  Language, thought, and color: Whorf was half right , 2009, Trends in Cognitive Sciences.

[10]  R. Dolan,et al.  The Nose Smells What the Eye Sees Crossmodal Visual Facilitation of Human Olfactory Perception , 2003, Neuron.

[11]  Friedrich Mueller,et al.  Odor classification: a review of factors influencing perception-based odor arrangements. , 2013, Chemical senses.

[12]  B. Bahrami,et al.  Motion Detection and Motion Verbs , 2007, Psychological science.

[13]  W. Kintsch,et al.  Are Good Texts Always Better? Interactions of Text Coherence, Background Knowledge, and Levels of Understanding in Learning From Text , 1996 .

[14]  T. Engen Remembering Odors and Their Names , 1987 .

[15]  J. H. Steiger Tests for comparing elements of a correlation matrix. , 1980 .

[16]  A. Dravnieks,et al.  Atlas of odor character profiles , 1992 .

[17]  Scott A. McDonald,et al.  Environmental Determinants of Lexical Processing Effort , 2000 .

[18]  Stephen Clark,et al.  Grounding Semantics in Olfactory Perception , 2015, ACL.

[19]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[20]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[21]  Amit Dhurandhar,et al.  Predicting human olfactory perception from chemical features of odor molecules , 2017, Science.

[22]  Karthik S. Gurumoorthy,et al.  ProtoDash: Fast Interpretable Prototype Selection , 2017, ArXiv.

[23]  A. Keller,et al.  Humans Can Discriminate More than 1 Trillion Olfactory Stimuli , 2014, Science.

[24]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[25]  Cheryl Corcoran,et al.  Olfactory deficits, cognition and negative symptoms in early onset psychosis , 2005, Schizophrenia Research.

[26]  Peter W. Foltz,et al.  The Measurement of Textual Coherence with Latent Semantic Analysis. , 1998 .

[27]  W. Cain,et al.  Quantification of odor quality. , 2000, Chemical senses.

[28]  Gary Lupyan,et al.  Language can boost otherwise unseen objects into visual awareness , 2013, Proceedings of the National Academy of Sciences.

[29]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[30]  N. Sobel,et al.  An odor is not worth a thousand words: from multidimensional odors to unidimensional odor objects. , 2013, Annual review of psychology.

[31]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[32]  J. Cronly-Dillon,et al.  Visual Agnosias and Other Disturbances of Visual Perception and Cognition , 1991 .

[33]  Tom Michael Mitchell,et al.  Predicting Human Brain Activity Associated with the Meanings of Nouns , 2008, Science.

[34]  Geoffrey E. Hinton,et al.  Zero-shot Learning with Semantic Output Codes , 2009, NIPS.

[35]  Gail Vance Civille,et al.  Developing Lexicons: A Review , 2013 .

[36]  P. Kuhl Early language acquisition: cracking the speech code , 2004, Nature Reviews Neuroscience.

[37]  K. Marder,et al.  Olfactory deficits in patients with mild cognitive impairment predict Alzheimer's disease at follow-up. , 2000, The American journal of psychiatry.