SoftKG: Building A Software Development Knowledge Graph through Wikipedia Taxonomy

At present, software development is an important way to make our life more convenient and intelligent. With the development of software programming, we have accumulated a lot of expert experience and common sense of domain knowledge. How to effectively organize and reuse these high-quality knowledge has become an urgent problem because reusing high-quality expert knowledge can greatly improve the efficiency of solving problems, especially for the novices of programming. When we encounter the programming problems, the usual solutions to get the answers is to query the search engines or consult the senior developers. However, these solutions have the following limitations: 1) the information in the field of software development is relatively scattered, for example, the demanded information is distributed on different websites. The developers need to query the search engine several times to get the information that they want, which is unfriendly, especially for the novices; 2) there is no such a unified organization form for the information we retrieve, and we need to process it further to get the answers, which is inefficient. To address the above limitations, we propose to build a software development knowledge graph (SoftKG) through Wikipedia taxonomy. Specifically, we propose a framework to build SoftKG based on open source knowledge communities.

[1]  Sanda M. Harabagiu,et al.  Open-domain textual question answering techniques , 2003, Natural Language Engineering.

[2]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[3]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[4]  Gopinath Ganapathy,et al.  To Generate the Ontology from Java Source Code , 2011 .

[5]  Gerhard Friedrich,et al.  A Taxonomy for Generating Explanations in Recommender Systems , 2011, AI Mag..

[6]  Denny Vrandecic,et al.  Wikidata: a new platform for collaborative data collection , 2012, WWW.

[7]  Andrew Chou,et al.  Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[8]  Hannah Bast,et al.  More Accurate Question Answering on Freebase , 2015, CIKM.

[9]  Haixun Wang,et al.  Understand Short Texts by Harvesting and Analyzing Semantic Knowledge , 2017, IEEE Transactions on Knowledge and Data Engineering.

[10]  Gang Yin,et al.  Intelligent Development Environment and Software Knowledge Graph , 2017, Journal of Computer Science and Technology.

[11]  Maurizio Atzori,et al.  CodeOntology: RDF-ization of Source Code , 2017, International Semantic Web Conference.

[12]  Ming Zhou,et al.  Dialog-to-Action: Conversational Question Answering Over a Large-Scale Knowledge Base , 2018, NeurIPS.

[13]  Ruslan Salakhutdinov,et al.  Open Domain Question Answering Using Early Fusion of Knowledge Bases and Text , 2018, EMNLP.

[14]  Alessandro Bozzon,et al.  Recurrent knowledge graph embedding for effective recommendation , 2018, RecSys.

[15]  Gerhard Weikum,et al.  Never-Ending Learning for Open-Domain Question Answering over Knowledge Bases , 2018, WWW.

[16]  Mitesh M. Khapra,et al.  Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph , 2018, AAAI.

[17]  Jens Lehmann,et al.  Pretrained Transformers for Simple Question Answering over Knowledge Graphs , 2019, SEMWEB.

[18]  Xiaoli Tang,et al.  Unifying Task-Oriented Knowledge Graph Learning and Recommendation , 2019, IEEE Access.

[19]  Min Wang,et al.  Searching Software Knowledge Graph with Question , 2019, ICSR.

[20]  Anthony K. H. Tung,et al.  An Efficient Parallel Keyword Search Engine on Knowledge Graphs , 2019, 2019 IEEE 35th International Conference on Data Engineering (ICDE).

[21]  Yixin Cao,et al.  KGAT: Knowledge Graph Attention Network for Recommendation , 2019, KDD.

[22]  Evgeny Kharlamov,et al.  Keyword Search over Knowledge Graphs via Static and Dynamic Hub Labelings , 2020, WWW.

[23]  Nan Duan,et al.  Graph-Based Reasoning over Heterogeneous External Knowledge for Commonsense Question Answering , 2019, AAAI.

[24]  Yiqun Liu,et al.  Jointly Non-Sampling Learning for Knowledge Graph Enhanced Recommendation , 2020, SIGIR.

[25]  Alan Ritter,et al.  Code and Named Entity Recognition in StackOverflow , 2020, ACL.

[26]  Xu Wang,et al.  HGMAN: Multi-Hop and Multi-Answer Question Answering Based on Heterogeneous Knowledge Graph (Student Abstract) , 2020, AAAI.

[27]  Pierre Maret,et al.  QAnswer KG: Designing a Portable Question Answering System over RDF Data , 2020, ESWC.