Concept discovery and argument bundles in the web of experiences

Millions of people interact and share interesting information every day in the Social Web. From daily conversations to comments about products in e-commerce sites, the content generated by people in these sites is huge and diverse. Among the wide diversity of user-contributed content on the web, there is a particular kind that has the potential of being put to good use by intelligent systems: human experiences. People very often use other people's experiences before making decisions, and when these kind of human experiences are expressed and recorded on the web, they can be shared with by large number of people. Nevertheless sometimes this content is not easily accessible, so a person trying to book a hotel may read a few reviews over a few hotels - but cannot possibly read them all. There is a clear need for an in-depth analysis of this kind of information, based on textual expressions of human particular experiences. Our approach, in the framework of the Web of Experiences, aims at acquiring practical knowledge from individual experiences with entities in the real world expressed in textual form. Moreover, this knowledge has to be represented in a way that facilitates the reuse of the experiential knowledge by other individuals with different preferences. Our approach has three stages: First, we extract the most salient set of aspects used by the individuals to describe their experiences with the entities in a domain. Second, using the set of extracted aspects, we group them in concepts to create a concept vocabulary that models the set of issues addressed in the reviews. Third, using the vocabulary of concepts, we create a bundle of arguments for each entity. An argument bundle characterizes the pros and cons of an entity, aggregating practical knowledge from judgments written by individuals with different biases and preferences. Moreover, we show how argument bundles allow us to define the notions of user query and the satisfaction degree of a bundle by a user query, proving that argument bundles are not only capable of representing practical knowledge but they are also useful to perform inference given a set of user preferences specified in a query. We evaluate the argument bundles of our approach with the Amazon score ratings and the camera characterizations of Dpreview. We show that pro and con arguments are very close to those listed in Dpreview. Evaluating entity rankings, we show that Dpreview and our approach give congruent rankings, while Amazon's is not congruent neither with Dpreview's or ours.

[1]  Aidan Waugh,et al.  Using Experience on the Read/Write Web: The GhostWriter System , 2009 .

[2]  Enric Plaza,et al.  Aspect Selection for Social Recommender Systems , 2015, ICCBR.

[3]  A. Baddeley Working Memory and Conscious Awareness , 2019, Theories of Memory.

[4]  Enric Plaza,et al.  Sentiment and Preference Guided Social Recommendation , 2014, ICCBR.

[5]  Rosina O. Weber,et al.  Adapting Sentiments with Context , 2015, ICCBR.

[6]  Trevor J. M. Bench-Capon,et al.  Argumentation in artificial intelligence , 2007, Artif. Intell..

[7]  Equivalence between Tie-corrected Spearman Test and a Chi-square Test in a Fourfold Contingency Table , 1988 .

[8]  Enric Plaza,et al.  Preference and Sentiment Guided Social Recommendations with Temporal Dynamics , 2014, SGAI Conf..

[9]  Konrad P. Körding,et al.  A high-reproducibility and high-accuracy method for automated topic classification , 2014, ArXiv.

[10]  Cristina Bosco,et al.  Developing Corpora for Sentiment Analysis: The Case of Irony and Senti-TUT , 2013, IEEE Intelligent Systems.

[11]  S. Toulmin The uses of argument , 1960 .

[12]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[13]  Barry Smyth,et al.  Further Experiments in Opinionated Product Recommendation , 2014, ICCBR.

[14]  S. Dumais Latent Semantic Analysis. , 2005 .

[15]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[16]  Peter,et al.  Semantic Unification , 2008 .

[17]  P. Anderson What is Web 2.0? Ideas, technologies and implications for education , 2007 .

[18]  Yehuda Koren,et al.  Collaborative filtering with temporal dynamics , 2009, KDD.

[19]  E. Tulving Elements of episodic memory , 1983 .

[20]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[21]  Janusz S. Bień,et al.  Beliefs, Points of View, and Multiple Environments , 1983, Cogn. Sci..

[22]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[23]  Xuanjing Huang,et al.  Phrase Dependency Parsing for Opinion Mining , 2009, EMNLP.

[24]  Amélie Marian,et al.  Beyond the Stars: Improving Rating Predictions using Review Text Content , 2009, WebDB.

[25]  Niels Pinkwart,et al.  Computer-Supported Collaborative Learning DOI 10.1007/s11412-009-9080-x Computer-supported argumentation: A review of the state of the art , 2009 .

[26]  Hae-Chang Rim,et al.  Lexicalized Hidden Markov Models for Part-of-Speech Tagging , 2000, COLING.

[27]  Eric Chang,et al.  Red Opal: product-feature scoring from reviews , 2007, EC '07.

[28]  Stewart Massie,et al.  Unsupervised Feature Selection for Text Data , 2006, ECCBR.

[29]  Oren Etzioni,et al.  RevMiner: an extractive interface for navigating reviews on a smartphone , 2012, UIST.

[30]  Ronald Loui A Citation-based Reflection on Toulmin and Argument , 2005 .

[31]  Qiang Yang,et al.  Transferring topical knowledge from auxiliary long texts for short text clustering , 2011, CIKM '11.

[32]  H. Abdi The Kendall Rank Correlation Coefficient , 2007 .

[33]  Li Chen,et al.  Recommender systems based on user reviews: the state of the art , 2015, User Modeling and User-Adapted Interaction.

[34]  Enric Plaza,et al.  Case-Based Sequential Ordering of Songs for Playlist Recommendation , 2006, ECCBR.

[35]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[36]  Sutanu Chakraborti,et al.  Mining user trails in critiquing based recommenders , 2014, WWW.

[37]  Ralph H. Johnson Manifest Rationality: A Pragmatic Theory of Argument , 2000 .

[38]  Regina Barzilay,et al.  Automatic Aggregation by Joint Modeling of Aspects and Values , 2014, J. Artif. Intell. Res..

[39]  Harith Alani,et al.  Adapting Sentiment Lexicons Using Contextual Semantics for Sentiment Analysis of Twitter , 2014, ESWC.

[40]  Ralph Grishman,et al.  Information Extraction: Techniques and Challenges , 1997, SCIE.

[41]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[42]  Rob Malouf,et al.  A Comparison of Algorithms for Maximum Entropy Parameter Estimation , 2002, CoNLL.

[43]  Rohini K. Srihari,et al.  OpinionMiner: a novel machine learning system for web opinion mining and extraction , 2009, KDD.

[44]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[45]  Rob Huddleston,et al.  What is the web? , 2018, Introduction to HTML and CSS.

[46]  Tao Li,et al.  Product recommendation with temporal dynamics , 2012, Expert Syst. Appl..

[47]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[48]  Barry Smyth,et al.  Personalized Opinion-Based Recommendation , 2016, ICCBR.

[49]  Yu Sun,et al.  Automatic Extraction for Product Feature Words from Comments on the Web , 2009, AIRS.

[50]  Himabindu Lakkaraju,et al.  Exploiting Coherence for the Simultaneous Discovery of Latent Facets and associated Sentiments , 2011, SDM.

[51]  Derek Bridge,et al.  The GhostWriter-2 . 0 System : Creating a Virtuous Circle in Web 2 . 0 Product Reviewing , 2010 .

[52]  Martin Ester,et al.  ILDA: interdependent LDA model for learning latent aspects and their ratings from online product reviews , 2011, SIGIR.

[53]  Nuria Oliver,et al.  I Like It... I Like It Not: Evaluating User Ratings Noise in Recommender Systems , 2009, UMAP.

[54]  Lingling Meng,et al.  A Review of Semantic Similarity Measures in WordNet 1 , 2013 .

[55]  Wayne D. Gray,et al.  Basic objects in natural categories , 1976, Cognitive Psychology.

[56]  Andrew McCallum,et al.  Information extraction from research papers using conditional random fields , 2006, Inf. Process. Manag..

[57]  Zhen Hai,et al.  Implicit Feature Identification via Co-occurrence Association Rule Mining , 2011, CICLing.

[58]  Ted Briscoe,et al.  Corpus Annotation for Parser Evaluation , 1999, ArXiv.

[59]  Vincent Aleven,et al.  Toward Legal Argument Instruction with Graph Grammars and Collaborative Filtering Techniques , 2006, Intelligent Tutoring Systems.

[60]  Stewart Massie,et al.  Term Similarity and Weighting Framework for Text Representation , 2011, ICCBR.

[61]  Tiejun Zhao,et al.  Target-dependent Twitter Sentiment Classification , 2011, ACL.

[62]  Enric Plaza,et al.  A Case-Based Song Scheduler for Group Customised Radio , 2007, ICCBR.

[63]  Martin Ester,et al.  Opinion digger: an unsupervised opinion miner from unstructured product reviews , 2010, CIKM.

[64]  German Rigau,et al.  Exploring the Automatic Selection of Basic Level Concepts , 2006 .

[65]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[66]  Yue Lu,et al.  Latent aspect rating analysis without aspect keyword supervision , 2011, KDD.

[67]  Iyad Rahwan,et al.  Laying the foundations for a World Wide Argument Web , 2007, Artif. Intell..

[68]  Phan Minh Dung,et al.  An Abstract, Argumentation-Theoretic Approach to Default Reasoning , 1997, Artif. Intell..

[69]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[70]  Iryna Gurevych,et al.  Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields , 2010, EMNLP.

[71]  Yoon Ho Cho,et al.  Mining changes in customer buying behavior for collaborative recommendations , 2005, Expert Syst. Appl..

[72]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[73]  Diana Maynard,et al.  Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis. , 2014, LREC.

[74]  E. Tulving Episodic and semantic memory: Where should we go from here? , 1986, Behavioral and Brain Sciences.

[75]  Claire Cardie,et al.  Multi-aspect Sentiment Analysis with Topic Models , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[76]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[77]  Klaus-Dieter Althoff,et al.  Extraction of Adaptation Knowledge from Internet Communities , 2009, LWA.

[78]  Soo-Min Kim,et al.  Automatic Identification of Pro and Con Reasons in Online Reviews , 2006, ACL.

[79]  Royi Ronen,et al.  Selecting content-based features for collaborative filtering recommenders , 2013, RecSys.

[80]  S. S. Culbert,et al.  Cognition and Categorization , 1979 .

[81]  Guillermo Ricardo Simari,et al.  Towards an argument interchange format , 2006, The Knowledge Engineering Review.

[82]  Paul Thagard,et al.  Evaluating Explanations in Law, Science, and Everyday Life , 2006 .

[83]  Janyce Wiebe,et al.  A Computational Theory of Perspective and Reference in Narrative , 1988, ACL.

[84]  Razvan C. Bunescu,et al.  Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques , 2003, Third IEEE International Conference on Data Mining.

[85]  Ellen Riloff,et al.  Exploiting Subjectivity Classification to Improve Information Extraction , 2005, AAAI.

[86]  Enric Plaza,et al.  Concept Discovery and Argument Bundles in the Experience Web , 2016, ICCBR.

[87]  Anne Kao,et al.  Natural Language Processing and Text Mining , 2006 .

[88]  Xue Li,et al.  Time weight collaborative filtering , 2005, CIKM '05.

[89]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[90]  Stephen Toulmin [Excerpt from "Human Understanding, Volume 1: General Introduction and Part I"] , 1972 .

[91]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[92]  Jimeng Sun,et al.  Temporal recommendation on graphs via long- and short-term preference fusion , 2010, KDD.

[93]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[94]  Nicolas Nicolov,et al.  Targeting Sentiment Expressions through Supervised Ranking of Linguistic Configurations , 2009, ICWSM.

[95]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[96]  Derek G. Bridge,et al.  The GhostWriter-2.0 Case-Based Reasoning system for making content suggestions to the authors of product reviews , 2012, Knowl. Based Syst..

[97]  Shi Bing,et al.  Inductive learning algorithms and representations for text categorization , 2006 .

[98]  Chun-hung Li,et al.  Semantic Dependent Word Pairs Generative Model for Fine-Grained Product Feature Mining , 2011, PAKDD.

[99]  Pablo Noriega,et al.  A Framework for Argumentation-Based Negotiation , 1997, ATAL.

[100]  Xiaoyan Zhu,et al.  Movie review mining and summarization , 2006, CIKM '06.

[101]  R. Kies,et al.  Online Forums and Deliberative Democracy , 2005 .

[102]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[103]  Suk Hwan Lim,et al.  Extracting and Ranking Product Features in Opinion Documents , 2010, COLING.

[104]  Michal Karpowicz,et al.  Opinion Mining on the Web 2.0 - Characteristics of User Generated Content and Their Impacts , 2013, CHI-KDD.

[105]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[106]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[107]  Markus Schaal,et al.  Opinionated Product Recommendation , 2013, ICCBR.

[108]  Barry Smyth,et al.  Mining Features and Sentiment from Review Experiences , 2013, ICCBR.

[109]  Enric Plaza,et al.  Semantics and Experience in the Future Web , 2008, ECCBR.

[110]  Enric Plaza,et al.  Principle and Praxis in the Experience Web : A Case Study in Social Music , 2009 .

[111]  William E. Winkler,et al.  The State of Record Linkage and Current Research Problems , 1999 .

[112]  Maarten de Rijke,et al.  Identifying entity aspects in microblog posts , 2012, SIGIR '12.

[113]  Mary Dalrymple,et al.  The PARC 700 Dependency Bank , 2003, LINC@EACL.

[114]  Douglas Walton,et al.  Informal Logic: A Pragmatic Approach , 2008 .

[115]  Phan Minh Dung,et al.  On the Acceptability of Arguments and its Fundamental Role in Nonmonotonic Reasoning, Logic Programming and n-Person Games , 1995, Artif. Intell..

[116]  S. Kapur,et al.  The seats of reason? An imaging study of deductive and inductive reasoning , 1997, Neuroreport.

[117]  Sasha Blair-Goldensohn,et al.  Building a Sentiment Summarizer for Local Service Reviews , 2008 .

[118]  Kerstin Denecke,et al.  Using SentiWordNet for multilingual sentiment analysis , 2008, 2008 IEEE 24th International Conference on Data Engineering Workshop.

[119]  J. Keziya Rani,et al.  Mining Opinion Features in Customer Reviews. , 2016 .

[120]  Amanda Spink,et al.  Searching the Web: the public and their queries , 2001 .

[121]  Ivan Titov,et al.  A Joint Model of Text and Aspect Ratings for Sentiment Summarization , 2008, ACL.

[122]  Enric Plaza,et al.  On Reusing Other People's Experiences , 2009, Künstliche Intell..

[123]  Yuji Matsumoto,et al.  Opinion Mining on the Web by Extracting Subject-Aspect-Evaluation Relations , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[124]  Shlomo Yitzhaki,et al.  Relative Deprivation and the Gini Coefficient , 1979 .

[125]  Christopher D. Manning,et al.  Analyzing the Dynamics of Research by Extracting Key Aspects of Scientific Papers , 2011, IJCNLP.

[126]  Meng Wang,et al.  Product Aspect Ranking and Its Applications , 2014, IEEE Transactions on Knowledge and Data Engineering.

[127]  Marcus A. Maloof,et al.  Dynamic weighted majority: a new ensemble method for tracking concept drift , 2003, Third IEEE International Conference on Data Mining.

[128]  David B. Dunson,et al.  Probabilistic topic models , 2011, KDD '11 Tutorials.

[129]  Guohong Fu,et al.  Chinese named entity recognition using lexicalized HMMs , 2005, SKDD.

[130]  Andrew McCallum,et al.  Rethinking LDA: Why Priors Matter , 2009, NIPS.

[131]  Daniel D. Suthers,et al.  Architectures for computer supported collaborative learning , 2001, Proceedings IEEE International Conference on Advanced Learning Technologies.

[132]  Barry Smyth,et al.  Many cases make light work for visualization in many eyes , 2009 .

[133]  Mário J. Silva,et al.  Clues for detecting irony in user-generated contents: oh...!! it's "so easy" ;-) , 2009, TSA@CIKM.