Previous work in hierarchical categorization focuses on the hierarchical perceptron (Hieron) algorithm. Hierarchical perceptron works on the principles of the perceptron,that is each class label in the hierarchy has an associated weight vector. To account for the hierarchy, we begin at the root of the tree and sum all weights to the target label.We make a prediction by considering the label that yields the maximum inner product of its feature set with its path-summed weights. Learning is done by adjusting the weights along the path from the predicted node to the correct node by a specific loss function that adheres to the large margin principal. There are several problems with applying this approach to a multiple class problem. In many cases we could end up punishing weights that gave a correct prediction, because the algorithm can only take a single case at a time. In this paper we present an extended hierarchical perceptron algorithm capable of solving the multiple categorization problem (MultiHieron). We introduce new aggregate loss function for multiple label learning. We make weight updates simultaneously instead of serially. Then, significant improvement over the basic Hieron algorithm is demonstrated on the aviation safety reporting system (ASRS) flight anomaly database and OntoNews corpus using both flat and hierarchical categorization metrics.
[1]
Yoram Singer,et al.
Large margin hierarchical classification
,
2004,
ICML.
[2]
Diana Maynard,et al.
Metrics for Evaluation of Ontology-based Information Extraction
,
2006,
EON@WWW.
[3]
Kalina Bontcheva,et al.
Hierarchical, perceptron-like learning for ontology-based information extraction
,
2007,
WWW '07.
[4]
Ramanathan V. Guha,et al.
SemTag and seeker: bootstrapping the semantic web via automated semantic annotation
,
2003,
WWW '03.
[5]
Nicola Guarino,et al.
OntoSeek: content-based access to the Web
,
1999,
IEEE Intell. Syst..