论文信息 - Evaluating Task-Dependent Taxonomies for Navigation

Evaluating Task-Dependent Taxonomies for Navigation

Taxonomies of concepts are important across many application domains, for instance, online shopping portals use catalogs to help users navigate and search for products. Task-dependent taxonomies, e.g., adapting the taxonomy to a specific cohort of users, can greatly improve the effectiveness of navigation and search. However, taxonomies are usually created by domain experts and hence designing task-dependent taxonomies can be an expensive process: this often limits the applications to deploy generic taxonomies. Crowdsourcing-based techniques have the potential to provide a cost-efficient solution to building task-dependent taxonomies. In this paper, we present the first quantitative study to evaluate the effectiveness of these crowdsourcing based techniques. Our experimental study compares different task-dependent taxonomies built via crowdsourcing and generic taxonomies built by experts. We design randomized behavioral experiments on the Amazon Mechanical Turk platform for navigation tasks using these taxonomies resembling real-world applications such as product search. We record various metrics such as the time of navigation, the number of clicks performed, and the search path taken by a participant to navigate the taxonomy to locate a desired object. Our findings show that task-dependent taxonomies built by crowdsourcing techniques can reduce the navigation time up to $20\%$. Our results, in turn,demonstrate the power of crowdsourcing for learning complex structures such as semantic taxonomies.

Andreas Krause | Dieter Fox | Adish Singla | Yuyin Sun | Tori Qiao Yan

[1] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[2] Ben Carterette,et al. Using preference judgments for novel document retrieval , 2012, SIGIR '12.

[3] Lydia B. Chilton,et al. Cascade: crowdsourcing taxonomy creation , 2013, CHI.

[4] Philip Resnik,et al. Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[5] Jeffrey Heer,et al. Crowdsourcing graphical perception: using mechanical turk to assess visualization design , 2010, CHI.

[6] Jonathan Krause,et al. Leveraging the Wisdom of the Crowd for Fine-Grained Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Yorick Wilks,et al. Data Driven Ontology Evaluation , 2004, LREC.

[8] Ellen M. Voorhees,et al. Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[9] Aniket Kittur,et al. Crowdsourcing user studies with Mechanical Turk , 2008, CHI.

[10] Simone Paolo Ponzetto,et al. BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..

[11] Daphna Weinshall,et al. Exploiting Object Hierarchy: Combining Models from Different Category Levels , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[12] Michael S. Bernstein,et al. Scalable multi-label annotation , 2014, CHI.

[13] Pierre Dragicevic,et al. Assessing the Effect of Visualizations on Bayesian Reasoning through Crowdsourcing , 2012, IEEE Transactions on Visualization and Computer Graphics.

[14] Dan Klein,et al. Accurate Unlexicalized Parsing , 2003, ACL.

[15] Cordelia Schmid,et al. Semantic Hierarchies for Visual Object Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[16] Ricardo Baeza-Yates,et al. Design and Implementation of Relevance Assessments Using Crowdsourcing , 2011, ECIR.

[17] Dieter Fox,et al. A Scalable Tree-Based Approach for Joint Object and Pose Recognition , 2011, AAAI.

[18] R. Porzel,et al. A Task-based Approach for Ontology Evaluation , 2022 .

[19] Siddharth Suri,et al. Conducting behavioral research on Amazon’s Mechanical Turk , 2010, Behavior research methods.

[20] Joerg Evermann,et al. Evaluating Ontologies: Towards a Cognitive Measure of Quality , 2007, EDOCW.

[21] Uta Priss,et al. Facet-like Structures in Computer Science , 2008 .

[22] Andreas Krause,et al. Building Hierarchies of Concepts via Crowdsourcing , 2015, IJCAI.

[23] Steffen Staab,et al. An Ontology-based Framework for Text Mining , 2005, LDV Forum.

[24] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[25] Mark Sifer. Filter co-ordinations for exploring multi-dimensional data , 2006, J. Vis. Lang. Comput..

[26] Maryam Habibi,et al. Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations , 2012, RUE@RecSys.

[27] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[28] Thomas L. Griffiths,et al. Hierarchical Topic Models and the Nested Chinese Restaurant Process , 2003, NIPS.

[29] Ashutosh Saxena,et al. Hierarchical Semantic Labeling for Task-Relevant RGB-D Perception , 2014, Robotics: Science and Systems.

[30] Yejin Choi,et al. From Large Scale Image Categorization to Entry-Level Categories , 2013, 2013 IEEE International Conference on Computer Vision.

[31] Simone Paolo Ponzetto,et al. Large-Scale Taxonomy Mapping for Restructuring and Integrating Wikipedia , 2009, IJCAI.

[32] Kevin Knight,et al. Building a Large Ontology for Machine Translation , 1993, HLT.

[33] Mark A. Musen,et al. Developing Crowdsourced Ontology Engineering Tasks: An iterative process , 2013, CrowdSem.

[34] Steffen Staab,et al. Measuring Similarity between Ontologies , 2002, EKAW.

[35] Mausam,et al. Crowdsourcing Multi-Label Classification for Taxonomy Creation , 2013, HCOMP.