Zero-Shot Learning and Detection of Teeth in Images of Bat Skulls

Biologists collect and analyze phenomic (e.g., anatomical or non-genomic) data to discover relationships among species in the Tree of Life. The domain is seeking to modernize this very time-consuming and largely manual process. We have developed an approach to detect and localize object parts in standardized images of bat skulls. This approach has been further developed for unannotated images by leveraging knowledge learned from a few annotated images. The key challenge is that the unlabeled images show bat skulls of "unknown" species that may have types, total numbers, and layouts of the teeth that differ from the "known" species appearing in the labeled images. Our method begins by matching the unlabeled images to the labeled ones. This allows a transfer of tooth annotations to the unlabeled images. We then learn a tree parts model on the transferred annotations, and apply this model to detect and label teeth in the unlabeled images. Our evaluation demonstrates good performance, which is close to our upper bound performance by the fully supervised model.

[1]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[2]  Nathan D. Ratliff Subgradient Methods for Structured Prediction , 2007 .

[3]  Seth Kaufman,et al.  MorphoBank: phylophenomics in the “cloud” , 2011, Cladistics : the international journal of the Willi Hennig Society.

[4]  Christoph H. Lampert,et al.  Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Antonio Torralba,et al.  Part and appearance sharing: Recursive Compositional Models for multi-view , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  J. Andrew Bagnell,et al.  (Approximate) Subgradient Methods for Structured Prediction , 2007, International Conference on Artificial Intelligence and Statistics.

[7]  P. Lewis A likelihood approach to estimating phylogeny from discrete morphological character data. , 2001, Systematic biology.

[8]  C. Marshall Encyclopedia of Life , 2008 .

[9]  Antonio Torralba,et al.  Sharing features: efficient boosting procedures for multiclass object detection , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[10]  Joshua B. Tenenbaum,et al.  Learning to Learn with Compound HD Models , 2011, NIPS.

[11]  Andrea L. Cirranello,et al.  The Placental Mammal Ancestor and the Post–K-Pg Radiation of Placentals , 2013, Science.

[12]  Jin Meng,et al.  Response to Comment on “The Placental Mammal Ancestor and the Post–K-Pg Radiation of Placentals” , 2013, Science.

[13]  Yi Yang,et al.  Articulated pose estimation with flexible mixtures-of-parts , 2011, CVPR 2011.

[14]  J. Wiens The role of morphological data in phylogeny reconstruction. , 2004, Systematic biology.

[15]  Daniel P. Huttenlocher,et al.  Pictorial Structures for Object Recognition , 2004, International Journal of Computer Vision.

[16]  Ali Farhadi,et al.  Attribute-centric recognition for cross-category generalization , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  Antonio Torralba,et al.  Nonparametric scene parsing: Label transfer via dense scene alignment , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Yang Wang,et al.  A Discriminative Latent Model of Object Classes and Attributes , 2010, ECCV.

[19]  Antonio Torralba,et al.  Transfer Learning by Borrowing Examples for Multiclass Object Detection , 2011, NIPS.

[20]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.