Nearly-Automated Metadata Hierarchy Creation

Currently, information architects create metadata category hierarchies manually. We present a nearly-automated approach for deriving such hierarchies, by converting the lexical hierarchy WordNet into a format that reflects the contents of a target information collection. We use the term "nearly-automated" because an information architect should have to make only small adjustments to produce an acceptable metadata structure. We contrast the results with an algorithm that uses lexical co-occurrence statistics.

[1]  Hinrich Schütze,et al.  Word Space , 1992, NIPS.

[2]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[3]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[4]  Michelle Q. Wang Baldonado,et al.  SONIA: a service for organizing networked information autonomously , 1998, DL '98.

[5]  Louis B. Rosenfeld,et al.  Information architecture for the world wide web - designing large-scale web sites (2. ed.) , 1998 .

[6]  Hsinchun Chen,et al.  Internet Browsing and Searching: User Evaluations of Category Map and Concept Space Techniques , 1998, J. Am. Soc. Inf. Sci..

[7]  W. Bruce Croft,et al.  Deriving concept hierarchies from text , 1999, SIGIR '99.

[8]  Sharon A. Caraballo Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.

[9]  Wanda Pratt,et al.  A Knowledge-Based Approach to Organizing Retrieved Documents , 1999, AAAI/IAAI.

[10]  Thomas Hofmann,et al.  The Cluster-Abstraction Model: Unsupervised Learning of Topic Hierarchies from Text Data , 1999, IJCAI.

[11]  Michael J. Albers,et al.  Book Review: Information Architecture for the World Wide Web: Designing Large-Scale Web Sites , 2000 .

[12]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[13]  Judith L. Klavans,et al.  Extracting taxonomic relationships from on-line definitional sources using LEXING , 2001, JCDL '01.

[14]  Arnold L. Rosenberg,et al.  Finding topic words for hierarchical summarization , 2001, SIGIR '01.

[15]  Rada Mihalcea,et al.  EZ.WordNet: Principles for Automatic Generation of a Coarse Grained WordNet , 2001, FLAIRS Conference.

[16]  Barbara S. Chaparro,et al.  Examining the Effects of Hypertext Shape on User Performance , 2002 .

[17]  Kevin Li,et al.  Faceted metadata for image search and browsing , 2003, CHI '03.

[18]  Kenneth Ward Church,et al.  Good applications for crummy machine translation , 1993, Machine Translation.