HOLACONF - Cloud Forward: From Distributed to Complete Computing Towards Cloud-Based Knowledge Capturing Based on Natural Language Processing

The organized capturing and sharing of knowledge is very important, and a lot of tools, such as wikis, social communities and knowledge-management or e-learning portals, exist for supporting this purpose. The community content- and knowledgecapturing, management and sharing portal of the European project “Realising an Applied Gaming Eco-system” (RAGE) † combines such tools. The goal of the RAGE project is to boost the collaborative knowledge asset management for software development in European applied gaming (AG) research and development (R&D). To support this process, the so-called RAGE ecosystem implements a portal to support the related asset, content and knowledge exchange between diverse actors in AG communities. Therefore, the community portal in RAGE is designed as a so-called ecosystem and is intended to provide its users different tools for the capturing, management, and sharing of knowledge. In this study, we rely on the term and model definition of spiraling knowledge exchange between explicit and tacit knowledge given by Nonaka and Takeuchi. 1 To achieve the goal of extracting, i.e., externalizing and explicitly representing and sharing this knowledge to its users, we propose to generate a taxonomy for faceted search automatically by extracting named entities form the knowledge sources and to classify documents using Support Vector Machines (SVM). In this paper we present our architectural approach for the NLP-based IR concepts and discuss how cloud services based on data distribution and cloud computing can improve the outcome of our system.

[1]  Hang Li,et al.  Named entity recognition in query , 2009, SIGIR.

[2]  Oren Etzioni,et al.  Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.

[3]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[6]  Abdulsalam Yassine,et al.  Cloud-based SVM for food categorization , 2015, Multimedia tools and applications.

[7]  Kalina Bontcheva,et al.  GATECloud.net: a platform for large-scale, open-source text processing on the cloud , 2013, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[8]  Tingting Mu,et al.  Supporting the education evidence portal via text mining , 2010, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[9]  Stefan Trausan-Matu,et al.  ReaderBench, an Environment for Analyzing Text Complexity and Reading Strategies , 2013, AIED.

[10]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .

[11]  I. Nonaka,et al.  How Japanese Companies Create the Dynamics of Innovation , 1995 .

[12]  Keith B. Hall,et al.  Improved video categorization from text metadata and user comments , 2011, SIGIR '11.

[13]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[14]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[15]  Naoaki Okazaki,et al.  Kleio: a knowledge-enriched information retrieval system for biology , 2008, SIGIR '08.

[16]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[17]  David A. Ferrucci,et al.  UIMA: an architectural approach to unstructured information processing in the corporate research environment , 2004, Natural Language Engineering.

[18]  Duc Binh Vu,et al.  Towards Social Media Platform Integration with an Applied Gaming Ecosystem , 2015 .

[19]  Bogdan Dit,et al.  Feature location in source code: a taxonomy and survey , 2013, J. Softw. Evol. Process..

[20]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[21]  Alexander Klenner,et al.  Large scale chemical patent mining with UIMA and UNICORE , 2012, Journal of Cheminformatics.

[22]  Ameet Talwalkar,et al.  Foundations of Machine Learning , 2012, Adaptive computation and machine learning.