From Freebase to Wikidata: The Great Migration

Collaborative knowledge bases that make their data freely available in a machine-readable form are central for the data strategy of many projects and organizations. The two major collaborative knowledge bases are Wikimedia's Wikidata and Google's Freebase. Due to the success of Wikidata, Google decided in 2014 to offer the content of Freebase to the Wikidata community. In this paper, we report on the ongoing transfer efforts and data mapping challenges, and provide an analysis of the effort so far. We describe the Primary Sources Tool, which aims to facilitate this and future data migrations. Throughout the migration, we have gained deep insights into both Wikidata and Freebase, and share and discuss detailed statistics on both knowledge bases.

[1]  Joel Nothman,et al.  Evaluating Entity Linking with Wikipedia , 2013, Artif. Intell..

[2]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[3]  Maribel Acosta,et al.  Towards Better Visual Tools for Exploring Wikipedia Article Development — The Use Case of "Gamergate Controversy" , 2015 .

[4]  Barbara B. Tillett,et al.  VIAF (virtual international authority file) : Linking the deutsche nationalbibliothek and library of congress name authority files , 2007 .

[5]  Tim Berners-Lee,et al.  Linked data , 2020, Semantic Web for the Working Ontologist.

[6]  Nikolas Mitrou,et al.  Ontology and Database Mapping: A Survey of Current Implementations and Future Directions , 2008, J. Web Eng..

[7]  Richard Tzong-Han Tsai,et al.  From Entity Recognition to Entity Linking: A Survey of Advanced Entity Linking Techniques (人工知能学会全国大会(第26回)文化,科学技術と未来) -- (International Organized Session「Special Session on Web Intelligence & Data Mining」) , 2012 .

[8]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[9]  A. Swartz MusicBrainz: A Semantic Web Service , 2002, IEEE Intell. Syst..

[10]  Keith Hall,et al.  Projecting the Knowledge Graph to Syntactic Parsing , 2014, EACL.

[11]  Tom Heath,et al.  Open Data Commons, a License for Open Data , 2008, LDOW.

[12]  Jiawei Han,et al.  Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions , 2015, IEEE Transactions on Knowledge and Data Engineering.

[13]  Markus Krötzsch,et al.  Reifying RDF: What Works Well With Wikidata? , 2015, SSWS@ISWC.

[14]  Phoebe Ayers,et al.  How Wikipedia Works: And How You Can Be a Part of It , 2008 .

[15]  Alon Y. Halevy,et al.  Semantic Integration Research in the Database Community : A Brief Survey , 2005 .

[16]  Hyoil Han,et al.  A survey on ontology mapping , 2006, SGMD.

[17]  Rahul Gupta,et al.  Knowledge base completion via search-based question answering , 2014, WWW.

[18]  山田 育矢 Entity linking with a knowledge base(審査報告) , 2016 .

[19]  Wei Zhang,et al.  Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.

[20]  Mathias Schindler,et al.  Introducing New Features to Wikipedia: Case Studies for Web Science , 2011, IEEE Intelligent Systems.

[21]  Csongor Nyulas,et al.  WebProtégé: a collaborative Web-based platform for editing biomedical ontologies , 2014, Bioinform..

[22]  Jim Melton,et al.  XML schema , 2003, SGMD.

[23]  Asunción Gómez-Pérez,et al.  License Linked Data Resources Pattern , 2013, WOP.

[24]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[25]  Ian H. Witten,et al.  An open-source toolkit for mining Wikipedia , 2013, Artif. Intell..

[26]  Basil Ell,et al.  A Comparative Survey of DBpedia , Freebase , OpenCyc , Wikidata , and YAGO , 2015 .

[27]  Ulrike Cress,et al.  Collaborative knowledge building with wikis: The impact of redundancy and polarity , 2012, Comput. Educ..

[28]  Markus Krötzsch,et al.  Wikidata , 2014, Commun. ACM.