Supporting Data Analytics Applications Which Utilize Cognitive Services

A wide variety of services are available over the Web which can dramatically improve the functionality of applications. These services include information retrieval (including data lookups from a variety of sources and Web searches), natural language understanding, visual recognition, and data storage. A key problem is how to provide support for applications which use these services. This paper presents a rich software development kit (SDK) which accesses these services and provides a variety of features applications need to use these services, optimize performance, and compare them. A key aspect of our SDK is its support for natural language understanding services. We also present a personalized knowledge base built on top of our rich SDK that uses publically available data sources as well as private information. The knowledge base supports data analysis and reasoning over data.

[1]  J. Carroll,et al.  Jena: implementing the semantic web recommendations , 2004, WWW Alt. '04.

[2]  Rajkumar Buyya,et al.  2011 Fourth IEEE International Conference on Utility and Cloud Computing SMICloud: A Framework for Comparing and Ranking Cloud Services , 2022 .

[3]  Roberto Navigli,et al.  Entity Linking meets Word Sense Disambiguation: a Unified Approach , 2014, TACL.

[4]  Doug Downey,et al.  Local and Global Algorithms for Disambiguation to Wikipedia , 2011, ACL.

[5]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[6]  Dan Lin,et al.  A Brokerage-Based Approach for Cloud Service Selection , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[7]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[8]  Elizabeth Chang,et al.  Cloud service selection: State-of-the-art and future research directions , 2014, J. Netw. Comput. Appl..

[9]  Markus Krötzsch,et al.  Wikidata , 2014, Commun. ACM.

[10]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[11]  Silviu Cucerzan,et al.  Large-Scale Named Entity Disambiguation Based on Wikipedia Data , 2007, EMNLP.

[12]  Razvan C. Bunescu,et al.  Using Encyclopedic Knowledge for Named entity Disambiguation , 2006, EACL.

[13]  Michael Granitzer,et al.  Robust and Collective Entity Disambiguation through Semantic Embeddings , 2016, SIGIR.

[14]  Arun Iyengar,et al.  Providing Enhanced Functionality for Data Store Clients , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[15]  Rajkumar Buyya,et al.  A framework for ranking of cloud computing services , 2013, Future Gener. Comput. Syst..

[16]  Zibin Zheng,et al.  QoS Ranking Prediction for Cloud Services , 2013, IEEE Transactions on Parallel and Distributed Systems.

[17]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[18]  Marcelo Arenas,et al.  Semantics and Complexity of SPARQL , 2006, International Semantic Web Conference.

[19]  Deborah L. McGuinness,et al.  Owl web ontology language guide , 2003 .

[20]  Jigyasa Sharma,et al.  Microsoft Cognitive Services , 2018 .

[21]  Xianpei Han,et al.  Named entity disambiguation by leveraging wikipedia semantic knowledge , 2009, CIKM.

[22]  Gerhard Weikum,et al.  Robust Disambiguation of Named Entities in Text , 2011, EMNLP.

[23]  Mark Dredze,et al.  Entity Disambiguation for Knowledge Base Population , 2010, COLING.

[24]  Ismailcem Budak Arpinar,et al.  Ontology-Driven Automatic Entity Disambiguation in Unstructured Text , 2006, SEMWEB.

[25]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[26]  Sören Auer,et al.  AGDISTIS - Graph-Based Disambiguation of Named Entities Using Linked Data , 2014, International Semantic Web Conference.

[27]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[28]  Mehmet A. Orgun,et al.  Cloud Service Selection Based on the Aggregation of User Feedback and Quantitative Performance Assessment , 2013, 2013 IEEE International Conference on Services Computing.

[29]  Elena García Barriocanal,et al.  On integrating learning object metadata inside the OpenCyc knowledge base , 2004, IEEE International Conference on Advanced Learning Technologies, 2004. Proceedings..

[30]  Zibin Zheng,et al.  CloudRank: A QoS-Driven Component Ranking Framework for Cloud Computing , 2010, 2010 29th IEEE Symposium on Reliable Distributed Systems.

[31]  Douglas B. Lenat,et al.  CYC: a large-scale investment in knowledge infrastructure , 1995, CACM.

[32]  Laks V. S. Lakshmanan,et al.  Proceedings of the 2008 ACM SIGMOD international conference on Management of data , 2008, SIGMOD 2008.

[33]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.