A Semantic Layer on Semi-Structured Data Sources for Intuitive Chatbots

The main limits of chatbot technology are related to the building of their knowledge representation and to their rigid information retrieval and dialogue capabilities, usually based on simple "pattern matching rules". The analysis of distributional properties of words in a texts corpus allows the creation of semantic spaces where represent and compare natural language elements. This space can be interpreted as a "conceptual" space where the axes represent the latent primitive concepts of the analyzed corpus.The presented work aims at exploiting the properties of a data-driven semantic/conceptual space built using semistructured data sources freely available on the web, like Wikipedia. This coding is equivalent to adding, into the Wikipedia graph, a conceptual similarity relationship layer.The chatbot can exploit this layer in order to simulate an "intuitive" behavior, attempting to retrieve semantic relations between Wikipedia resources also through associative sub-symbolic paths.

[1]  Simone Paolo Ponzetto,et al.  Deriving a Large-Scale Taxonomy from Wikipedia , 2007, AAAI.

[2]  Jens Lehmann,et al.  What Have Innsbruck and Leipzig in Common? Extracting Semantics from Wiki Content , 2007, ESWC.

[3]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[4]  Andrea Corradini,et al.  Developing a Conversational Agent Using Ontologies , 2007, HCI.

[5]  Mitsuru Ishizuka,et al.  Exploiting Syntactic and Semantic Information for Relation Extraction from Wikipedia , 2006 .

[6]  Martin Hepp,et al.  Harvesting Wiki Consensus - Using Wikipedia Entries as Ontology Elements , 2006, SemWiki.

[7]  Daniel S. Weld,et al.  Autonomously semantifying wikipedia , 2007, CIKM '07.

[8]  Markus Krötzsch,et al.  Semantic Wikipedia , 2006, WikiSym '06.

[9]  Sung-Bae Cho,et al.  A semantic Bayesian network approach to retrieving information with intelligent conversational agents , 2007, Inf. Process. Manag..

[10]  Richard S. Wallace,et al.  The Anatomy of A.L.I.C.E. , 2009 .

[11]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[12]  Patrick F. Reidy An Introduction to Latent Semantic Analysis , 2009 .

[13]  Lance Chun Che Fung,et al.  An Embodied Conversational Agent for Intelligent Web Interaction on Pandemic Crisis Communication , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Workshops.

[14]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[15]  Giovanni Pilato,et al.  A Conversational Agent Based on a Conceptual Interpretation of a Data Driven Semantic Space , 2005, AI*IA.

[16]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.