On the difficulty of making concreteness concrete

The use of labels of semantic properties like ‘concreteness’ is quite common in studies in syntax, but their exact meaning is often unclear. In this article, we compare different definitions of concreteness, and use them in different implementations to annotate nouns in two data sets: (1) all nouns with word sense annotations in the SemCor corpus, and (2) nouns in a particular lexico-syntactic context, viz. the theme (e.g. a book) in prepositional dative (gave a book to him) and double object (gave him a book) constructions. The results show that the definition and implementation used in different approaches differ greatly, and can considerably affect the conclusions drawn in syntactic research. A followup crowdsourcing experiment showed that there are instances that are clearly concrete or abstract, but also many instances for which humans disagree. Therefore, results concerning concreteness in syntactic research can only be interpreted when taking into account the annotation scheme used and the type of data that is being analysed.

[1]  Rudolph W. Schulz,et al.  Parameters of abstraction, meaningfulness, and pronunciability for 329 nouns , 1966 .

[2]  George A. Miller,et al.  A Semantic Concordance , 1993, HLT.

[3]  Sidney Greenbaum,et al.  Comparing English worldwide : the International Corpus of English , 1996 .

[4]  Timo Järvinen,et al.  A non-projective dependency parser , 1997, ANLP.

[5]  Renata Vieira,et al.  An Empirically-based System for Processing Definite Descriptions , 2000, CL.

[6]  Hans-Jörg Schmid,et al.  English abstract nouns as conceptual shells : from corpus to cognition , 2000 .

[7]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[8]  Ellen Riloff,et al.  A Bootstrapping Method for Learning Semantic Lexicons using Extraction Pattern Contexts , 2002, EMNLP.

[9]  Sandra Kübler How Do Treebank Annotation Schemes Influence Parsing Results? Or How Not to Compare Apples And Oranges , 2005 .

[10]  Laurent Romary,et al.  Towards International Standards for Language Resources , 2007 .

[11]  R. Harald Baayen,et al.  Predicting the dative alternation , 2007 .

[12]  Mark A. Changizi Economically organized hierarchies in WordNet and the Oxford English Dictionary , 2008, Cognitive Systems Research.

[13]  Daphne Theijssen Variable Selection in Logistic Regression: The British English Dative Alternation , 2009, ESSLLI Student Sessions.

[14]  Elisabeth Dévière,et al.  Analyzing linguistic data: a practical introduction to statistics using R , 2009 .

[15]  D. Geeraerts,et al.  Advances in cognitive sociolinguistics , 2010 .

[16]  Yi Zhang,et al.  Query Difficulty Prediction for Contextual Image Retrieval , 2010, ECIR.

[17]  Hans van Halteren,et al.  Evaluating automatic annotation: automatically detecting and enriching instances of the dative alternation , 2012, Lang. Resour. Evaluation.