Sentifiers: Interpreting Vague Intent Modifiers in Visual Analysis using Word Co-occurrence and Sentiment Analysis

Natural language interaction with data visualization tools often involves the use of vague subjective modifiers in utterances such as "show me the sectors that are performing" and "where is a good neighborhood to buy a house?." Interpreting these modifiers is often difficult for these tools because their meanings lack clear semantics and are in part defined by context and personal user preferences. This paper presents a system called \system that makes a first step in better understanding these vague predicates. The algorithm employs word co-occurrence and sentiment analysis to determine which data attributes and filters ranges to associate with the vague predicates. The provenance results from the algorithm are exposed to the user as interactive text that can be repaired and refined. We conduct a qualitative evaluation of the Sentifiers system that indicates the usefulness of the interface as well as opportunities for better supporting subjective utterances in visual analysis tasks through natural language.

[1]  Erez Lieberman Aiden,et al.  Quantitative Analysis of Culture Using Millions of Digitized Books , 2010, Science.

[2]  John T. Stasko,et al.  Orko: Facilitating Multimodal Interaction for Visual Exploration and Analysis of Networks , 2018, IEEE Transactions on Visualization and Computer Graphics.

[3]  S. Shapiro Vagueness in Context , 2006 .

[4]  W. Bruce Croft,et al.  Learning concept importance using a weighted dependence model , 2010, WSDM '10.

[5]  Stefano Vegnaduzzo Acquisition of Subjective Adjectives with Limited Resources , 2004 .

[6]  Marti Hearst,et al.  Toward Interface Defaults for Vague Modifiers in Natural Language Interfaces for Visual Analysis , 2019, 2019 IEEE Visualization Conference (VIS).

[7]  Yiwen Sun,et al.  Articulate: A Semi-automated Model for Translating Natural Language Queries into Meaningful Visualizations , 2010, Smart Graphics.

[8]  Johan Bos,et al.  Computational Semantics in Discourse: Underspecification, Resolution, and Inference , 2004, J. Log. Lang. Inf..

[9]  Mukund Sundararajan,et al.  Analyza: Exploring Data with Conversation , 2017, IUI.

[10]  Chirag Shah,et al.  Extracting Information Seeking Intentions for Web Search Sessions , 2016, SIGIR.

[11]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[12]  Qiang Yang,et al.  Building bridges for web query classification , 2006, SIGIR.

[13]  Rada Mihalcea,et al.  Improving the Impact of Subjectivity Word Sense Disambiguation on Contextual Opinion Analysis , 2011, CoNLL.

[14]  Vidya Setlur,et al.  Applying Pragmatics Principles for Interaction with Visual Analytics , 2018, IEEE Transactions on Visualization and Computer Graphics.

[15]  Tie-Yan Liu,et al.  Actively predicting diverse search intent from user browsing behaviors , 2010, WWW '10.

[16]  Karrie Karahalios,et al.  DataTone: Managing Ambiguity in Natural Language Interfaces for Data Visualization , 2015, UIST.

[17]  Petra Hendriks,et al.  Review of C. Kennedy, Projecting the adjective. The syntax and semantics of gradability and comparison , 1999 .

[18]  Watts BuildingBrighton Generating Vague Descriptions , 2000 .

[19]  Paul Cairns,et al.  Doing Better Statistics in Human-Computer Interaction , 2019 .

[20]  Umberto Straccia,et al.  Fuzzy Ontology Representation using OWL 2 , 2010, Int. J. Approx. Reason..

[21]  Kevin Chen-Chuan Chang,et al.  Towards rich query interpretation: walking back and forth for mining query templates , 2010, WWW '10.

[22]  Jeffrey Heer,et al.  SpanningAspectRatioBank Easing FunctionS ArrayIn ColorIn Date Interpolator MatrixInterpola NumObjecPointI Rectang ISchedu Parallel Pause Scheduler Sequen Transition Transitioner Transiti Tween Co DelimGraphMLCon IData JSONCon DataField DataSc Dat DataSource Data DataUtil DirtySprite LineS RectSprite , 2011 .

[23]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[24]  HeerJeffrey,et al.  D3 Data-Driven Documents , 2011 .

[25]  Ellen Riloff,et al.  Creating Subjective and Objective Sentence Classifiers from Unannotated Texts , 2005, CICLing.

[26]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[27]  Vidya Setlur,et al.  Inferencing underspecified natural language utterances in visual analysis , 2019, IUI.

[28]  Xiao Li,et al.  Understanding the Semantic Structure of Noun Phrase Queries , 2010, ACL.

[29]  Gerard de Melo,et al.  Good, Great, Excellent: Global Inference of Semantic Intensities , 2013, Transactions of the Association for Computational Linguistics.

[30]  Jonas Schreiber,et al.  The Analysis of Mind , 1935, Nature.

[31]  ChengXiang Zhai,et al.  Mining Coordinated Intent Representation for Entity Search and Recommendation , 2015, CIKM.

[32]  Vidya Setlur,et al.  Eviza: A Natural Language Interface for Visual Analysis , 2016, UIST.

[33]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[34]  Janyce Wiebe,et al.  Articles: Recognizing Contextual Polarity: An Exploration of Features for Phrase-Level Sentiment Analysis , 2009, CL.

[35]  Helen Nissenbaum,et al.  Bias in computer systems , 1996, TOIS.

[36]  Amanda Spink,et al.  Determining the user intent of web search engine queries , 2007, WWW '07.