论文信息 - Knowledge Base Construction in the Machine-learning Era

Knowledge Base Construction in the Machine-learning Era

More information is accessible today than at any other time in human history. From a software perspective, however, the vast majority of this data is unusable, as it is locked away in unstructured formats such as text, PDFs, web pages, images, and other hard-to-parse formats. The goal of knowledge base construction is to extract structured information automatically from this "dark data," so that it can be used in downstream applications for search, question-answering, link prediction, visualization, modeling and much more. Today, knowledge bases are the central components of systems that help fight human trafficking, accelerate biomedical discovery, and, increasingly, power web-search and question-answering technologies.

Christopher Ré | Alexander Ratner | C. Ré | Alexander J. Ratner

[1] Christopher Ré,et al. Large-scale extraction of gene interactions from full-text literature using DeepDive , 2015, Bioinform..

[2] Christopher De Sa,et al. DeepDive: Declarative Knowledge Base Construction , 2016, SGMD.

[3] Sebastian Ruder,et al. An Overview of Multi-Task Learning in Deep Neural Networks , 2017, ArXiv.

[4] Gideon S. Mann,et al. Generalized Expectation Criteria for Semi-Supervised Learning with Weakly Labeled Data , 2010, J. Mach. Learn. Res..

[5] Christopher Ré,et al. Extracting Databases from Dark Data with DeepDive , 2016, SIGMOD Conference.

[6] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[7] Jens Lehmann,et al. DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[8] Douwe Kiela,et al. Poincaré Embeddings for Learning Hierarchical Representations , 2017, NIPS.

[9] Fabian M. Suchanek,et al. YAGO3: A Knowledge Base from Multilingual Wikipedias , 2015, CIDR.

[10] Rich Caruana,et al. Multitask Learning: A Knowledge-Based Source of Inductive Bias , 1993, ICML.

[11] Jure Leskovec,et al. node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[12] Daniel Jurafsky,et al. Distant supervision for relation extraction without labeled data , 2009, ACL.

[13] Doug Downey,et al. KnowItNow: Fast, Scalable Information Extraction from the Web , 2005, HLT.

[14] Razvan C. Bunescu,et al. Learning to Extract Relations from the Web using Minimal Supervision , 2007, ACL.

[15] Wei Zhang,et al. Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.

[16] Heng Ji,et al. Label Noise Reduction in Entity Typing by Heterogeneous Partial-Label Embedding , 2016, KDD.

[17] Luke S. Zettlemoyer,et al. Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations , 2011, ACL.