Classification and Indexing of Web Content Based on a Model of Semantic Social Bookmarking

One of the key challenges in Information Technology is finding a way to organize the knowledge present on the Web. This led to years of research on the integration of information, on the Semantic Web and related technologies. Information Search and Retrieval from the Web occur through a process of content disambiguation and search engines use algorithms and software agents in order to meet the needs of users and advertising buyers. Ex-post analytical agents and tools are becoming more pervasive, so much to cause increasing problems of privacy. Our work proposes an innovative approach of content disambiguation that overturns the ex-post semantic analysis of contents, because it deals with an ex-ante classification conducted on two axes: vertical one (hierarchical and taxonomic axis) and horizontal one (folksonomic axis through tags or keywords). This method, which is based on the logic of social bookmarking and focuses on semantic tagging, represents a new frontier in information architecture because it introduces a new way of classification made by people using keywords that have a specific lexical and semantic value. This approach will allow people to create a knowledge base of Web contents characterized by a precise semantic

[1]  Gerhard Weikum,et al.  YAGO: A Large Ontology from Wikipedia and WordNet , 2008, J. Web Semant..

[2]  Robert Meersman,et al.  From Folksologies to Ontologies: How the Twain Meet , 2006, OTM Conferences.

[3]  Michele Marchesi,et al.  A machine learning approach for text categorization of fixing-issue commits on CVS , 2010, ESEM '10.

[4]  Giulio Concas,et al.  An Approach to Manage the Web Knowledge , 2013, KEOD.

[5]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[6]  Jens Lehmann,et al.  DBpedia and the live extraction of structured data from Wikipedia , 2012, Program.

[7]  Eneko Agirre,et al.  Personalizing PageRank for Word Sense Disambiguation , 2009, EACL.

[8]  Eneko Agirre,et al.  A Study on Linking Wikipedia Categories to Wordnet Synsets using Text Similarity , 2009, RANLP.

[9]  Simone Paolo Ponzetto,et al.  BabelNet: Building a Very Large Multilingual Semantic Network , 2010, ACL.

[10]  Sandro Pinna,et al.  Open Source Communities as Social Networks: An Analysis of Some Peculiar Characteristics , 2008 .

[11]  Giulio Concas,et al.  Using a Standards-based Approach for a Multimedia Knowledge-base , 2011, KMIS.

[12]  Dan Brickley,et al.  Resource description framework (RDF) schema specification , 1998 .

[13]  Giulio Concas,et al.  Knowledge Formalization and Management in KMS , 2012, KMIS.

[14]  Giulio Concas,et al.  Approach to Manage Semantic Informations from UGC , 2011, KEOD.

[15]  Hwee Tou Ng,et al.  Word Sense Disambiguation Using OntoNotes: An Empirical Study , 2008, EMNLP.

[16]  Michael Skinner,et al.  Information arbitrage across multi-lingual Wikipedia , 2009, WSDM '09.

[17]  Simone Paolo Ponzetto,et al.  BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..

[18]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[19]  Oscar Corcho,et al.  Preliminary Results in Tag Disambiguation using DBpedia , 2009 .

[20]  Alexandre Passant,et al.  Meaning Of A Tag: A collaborative approach to bridge the gap between tagging and Linked Data , 2008, LDOW.

[21]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[22]  David R. Karger,et al.  Piggy Bank: Experience the Semantic Web Inside Your Web Browser , 2005, International Semantic Web Conference.