Wisdom extraction in knowledge-based information systems

This paper aims to provide a number of distinct approaches towards this goal, i.e. to translate the information contained in the repositories into knowledge. For centuries, humans have gathered and generated data to study the different phenomena around them. Consequently, there are a variety of information repositories available in many different fields of study. However, the ability to access, integrate and properly interpret the relevant data sets in these repositories has mainly been limited by their ever expanding volumes. The goal of translating the available data to knowledge, eventually leading to wisdom, requires an understanding of the relations, ordering and associations among the data sets.,While the existing information repositories are rich in content, there are no easy means of understanding the relevance or influence of the different facts contained therein. Therefore, the interest of the general populace in terms of prioritizing some data items (or facts) over others is usually lost. In this paper, the goal is to provide approaches for transforming the available facts in the information repositories to wisdom. The authors target the lack of order in the facts presented in the repositories to create a hierarchical distribution based on the common understanding, expectations, opinions and judgments of the different users.,The authors present multiple approaches to extract and order the facts related to each concept, using both automatic and semi-automatic methods. The experiments show that the results of these approaches are similar and very close to the instinctive ordering of facts by users.,The authors believe that the work presented in this paper, with some additions, can be a feasible step to convert the available knowledge to wisdom and a step towards the future of online information systems.

[1]  Alessandro Moschitti,et al.  UNITN: Training Deep Convolutional Neural Network for Twitter Sentiment Classification , 2015, *SEMEVAL.

[2]  Kôiti Hasida,et al.  POLYPHONET: An advanced social network extraction system from the Web , 2007, J. Web Semant..

[3]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[4]  A. Mehri,et al.  Keyword extraction by nonextensivity measure. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  W. Bruce Croft,et al.  Deriving concept hierarchies from text , 1999, SIGIR '99.

[6]  Abraham Kandel,et al.  DegExt - A Language-Independent Graph-Based Keyphrase Extractor , 2011, AWIC.

[7]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[8]  Peter D. Turney Learning Algorithms for Keyphrase Extraction , 2000, Information Retrieval.

[9]  Gerhard Weikum,et al.  WebChild: harvesting and organizing commonsense knowledge from the web , 2014, WSDM.

[10]  Yarden Katz,et al.  Pellet: A practical OWL-DL reasoner , 2007, J. Web Semant..

[11]  Peter W. Foltz,et al.  An introduction to latent semantic analysis , 1998 .

[12]  Catherine Havasi,et al.  ConceptNet 5: A Large Semantic Network for Relational Knowledge , 2013, The People's Web Meets NLP.

[13]  Ken Barker,et al.  Using Noun Phrase Heads to Extract Document Keyphrases , 2000, Canadian Conference on AI.

[14]  Habib Ullah Khan,et al.  ConceptOnto: An upper ontology based on ConceptNet , 2014, 2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA).

[15]  Terje Aven,et al.  A conceptual framework for linking risk and the elements of the data-information-knowledge-wisdom (DIKW) hierarchy , 2013, Reliab. Eng. Syst. Saf..

[16]  Luis von Ahn Games with a Purpose , 2006, Computer.

[17]  Adam Kilgarriff Googleology is Bad Science , 2007, Computational Linguistics.

[18]  F. Ren,et al.  Multilingual single document keyword extraction for information retrieval , 2005, 2005 International Conference on Natural Language Processing and Knowledge Engineering.

[19]  Jennifer E. Rowley,et al.  The wisdom hierarchy: representations of the DIKW hierarchy , 2007, J. Inf. Sci..

[20]  John Langford,et al.  Telling humans and computers apart automatically , 2004, CACM.

[21]  Haixun Wang,et al.  Probase: a probabilistic taxonomy for text understanding , 2012, SIGMOD Conference.

[22]  Leora Morgenstern,et al.  The Winograd Schema Challenge: Evaluating Progress in Commonsense Reasoning , 2015, AAAI.

[23]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[24]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[25]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[26]  Doug Downey,et al.  Web-scale information extraction in knowitall: (preliminary results) , 2004, WWW '04.

[27]  Pedro A. Pury,et al.  Statistical keyword detection in literary corpora , 2007, ArXiv.

[28]  Susan Bull,et al.  Bringing chatbots into education: Towards natural language negotiation of open learner models , 2006, Knowl. Based Syst..

[29]  Xiaohua Hu,et al.  The Evaluation of Sentence Similarity Measures , 2008, DaWaK.

[30]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[31]  Manuel Blum,et al.  Peekaboom: a game for locating objects in images , 2006, CHI.

[32]  Preslav Nakov,et al.  SemEval-2015 Task 10: Sentiment Analysis in Twitter , 2015, *SEMEVAL.

[33]  Ramanathan V. Guha,et al.  Cyc: toward programs with common sense , 1990, CACM.

[34]  Preslav Nakov,et al.  SemEval-2014 Task 9: Sentiment Analysis in Twitter , 2014, *SEMEVAL.

[35]  Saif Mohammad,et al.  NRC-Canada-2014: Recent Improvements in the Sentiment Analysis of Tweets , 2014, SemEval@COLING.

[36]  Preslav Nakov,et al.  SemEval-2016 Task 4: Sentiment Analysis in Twitter , 2016, *SEMEVAL.

[37]  Haixun Wang,et al.  Automatic taxonomy construction from keywords , 2012, KDD.

[38]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[39]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[40]  Manuel Blum,et al.  Verbosity: a game for collecting common-sense facts , 2006, CHI.

[41]  Djoerd Hiemstra,et al.  A probabilistic justification for using tf×idf term weighting in information retrieval , 2000, International Journal on Digital Libraries.

[42]  Anette Hulth,et al.  Improved Automatic Keyword Extraction Given More Linguistic Knowledge , 2003, EMNLP.

[43]  James Mayfield,et al.  Character N-Gram Tokenization for European Language Text Retrieval , 2004, Information Retrieval.

[44]  Francesco Piazza,et al.  Towards an RDF Encoding of ConceptNet , 2011, ISNN.

[45]  Arkady B. Zaslavsky,et al.  Context Aware Computing for The Internet of Things: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[46]  Erik T. Mueller,et al.  Open Mind Common Sense: Knowledge Acquisition from the General Public , 2002, OTM.

[47]  Paul Buitelaar,et al.  SemEval-2015 Task 17: Taxonomy Extraction Evaluation (TExEval) , 2015, SemEval@NAACL-HLT.

[48]  Alessandro Moschitti,et al.  Twitter Sentiment Analysis with Deep Convolutional Neural Networks , 2015, SIGIR.

[49]  Kevin Warwick,et al.  Good Machine Performance in Turing's Imitation Game , 2014, IEEE Transactions on Computational Intelligence and AI in Games.

[50]  Christian Wartena,et al.  Thesaurus Based Term Ranking for Keyword Extraction , 2010, 2010 Workshops on Database and Expert Systems Applications.

[51]  P. Carpena,et al.  Level statistics of words: finding keywords in literary texts and symbolic sequences. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[52]  Clement T. Yu,et al.  A theory of term importance in automatic text analysis , 1974, J. Am. Soc. Inf. Sci..

[53]  Bill Serra,et al.  People, Places, Things: Web Presence for the Real World , 2002, Mob. Networks Appl..

[54]  Paul Tarau,et al.  Knowledge-based conversational agents and virtual storytelling , 2004, SAC '04.

[55]  Timothy Baldwin,et al.  Automatic keyphrase extraction from scientific articles , 2013, Lang. Resour. Evaluation.

[56]  Martin Hepp,et al.  Games with a Purpose for the Semantic Web , 2008, IEEE Intelligent Systems.

[57]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[58]  W. Bruce Croft,et al.  Similarity measures for tracking information flow , 2005, CIKM '05.

[59]  Ning Zhong,et al.  In Search of the Wisdom Web , 2002, Computer.

[60]  Owen Conlan,et al.  The Challenge of Content Creation to Facilitate Personalized E-Learning Experiences , 2006 .

[61]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.