Towards Managing Knowledge Collection from Volunteer Contributors

A new generation of intelligent applications can be enabled by broad-coverage, up-to-date repositories of knowledge. One emerging approach to constructing such repositories is proactive knowledge collection from volunteer contributors. In this paper, we study the quality of the knowledge repository resulting from collecting spontaneous, little guided contributions of volunteers. In a representative collection of part-of information contributed by volunteers, we study the coverage and quality of the resulting collection. As a possible way to address the deficiencies, we outline a more managed, three-stage approach to the collection process, consisting of collection, evaluation & revision, and publication.

[1]  Douglas Herrmann,et al.  A Taxonomy of Part-Whole Relations , 1987, Cogn. Sci..

[2]  George A. Miller,et al.  Nouns in WordNet: A Lexical Inheritance System , 1990 .

[3]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[4]  Douglas B. Lenat,et al.  CYC: a large-scale investment in knowledge infrastructure , 1995, CACM.

[5]  Ellen Riloff,et al.  Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping , 1999, AAAI/IAAI.

[6]  Eugene Charniak,et al.  Finding Parts in Very Large Corpora , 1999, ACL.

[7]  Erik T. Mueller,et al.  Open Mind Common Sense: Knowledge Acquisition from the General Public , 2002, OTM.

[8]  Lenhart K. Schubert Can we derive general world knowledge from texts , 2002 .

[9]  Nicola Guarino,et al.  Sweetening WORDNET with DOLCE , 2003, AI Mag..

[10]  Dan I. Moldovan,et al.  Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations , 2003, NAACL.

[11]  Timothy Chklovski,et al.  Learner: a system for acquiring commonsense knowledge by analogy , 2003, K-CAP '03.

[12]  Matthew Richardson,et al.  Building large knowledge bases by mass collaboration , 2003, K-CAP '03.

[13]  Timothy Chklovski,et al.  Using analogy to acquire commonsense knowledge from human Contributors , 2003 .

[14]  Rada Mihalcea,et al.  Building sense tagged corpora with volunteer contributions over the Web , 2003, RANLP.

[15]  David G. Stork,et al.  Evaluating Classifiers by Means of Test Data with Noisy Labels , 2003, IJCAI.

[16]  Henry Lieberman,et al.  Beating Common Sense into Interactive Applications , 2004, AI Mag..

[17]  P. Pantel,et al.  Path Analysis for Refining Verb Relations , 2004 .

[18]  Doug Downey,et al.  Methods for Domain-Independent Information Extraction from the Web: An Experimental Comparison , 2004, AAAI.

[19]  Rakesh Gupta,et al.  Common Sense Data Acquisition for Indoor Mobile Robots , 2004, AAAI.

[20]  Jon Curtis,et al.  Representing Knowledge Gaps Effectively , 2004, PAKM.

[21]  Steffen Staab,et al.  Project Halo: Towards a Digital Aristotle , 2004, AI Mag..

[22]  Timothy Chklovski,et al.  Designing interfaces for guided collection of knowledge about everyday objects from volunteers , 2005, IUI.