Clinical Practice Guidelines (CPGs) are increasingly common in clinical medicine for prescribing a set of rules that a physician should follow. Recent interest is in accurate retrieval of CPGs at the point of care. Examples are the CPGs digital libraries National Guideline Clearinghouse (NGC) or Vaidurya, which are organized along predefined concept hierarchies. In this case, both browsing and concept-based search can be applied. However, mandatory step in enabling both ways to CPGs retrieval is manual classification of CPGs along the concepts hierarchy, which is extremely time consuming. Supervised learning approaches are usually not satisfying, since commonly too few or no CPGs are provided as training set for each class.
In this paper we apply TaxSOM for multiple classification. TaxSOM is an unsupervised model that supports the physician in the classification of CPGs along the concepts hierarchy, even when no labeled examples are available. This model exploits lexical and topological information on the hierarchy to elaborate a classification hypothesis for any given CPG. We argue that such a kind of unsupervised classification can support a physician to classify CPGs by recommending the most probable classes. An experimental evaluation on various concept hierarchies with hundreds of CPGs and categories provides the empirical evidence of the proposed technique.
[1]
Daniel A. Keim,et al.
On Knowledge Discovery and Data Mining
,
1997
.
[2]
Diego Sona,et al.
Clustering documents in a web directory
,
2003,
WIDM '03.
[3]
Pedro M. Domingos,et al.
Learning to Match the Schemas of Data Sources: A Multistrategy Approach
,
2003,
Machine Learning.
[4]
Susan T. Dumais,et al.
Hierarchical classification of Web content
,
2000,
SIGIR '00.
[5]
Yuval Shahar,et al.
Multiple hierarchical classification of free-text clinical guidelines
,
2006,
Artif. Intell. Medicine.
[6]
Yuval Shahar,et al.
Vaidurya - A Concept-Based, Context-Sensitive Search Engine For Clinical Guidelines
,
2004,
MedInfo.
[7]
Michelangelo Ceci,et al.
Hierarchical Classification of HTML Documents with WebClassII
,
2003,
ECIR.
[8]
Ee-Peng Lim,et al.
Hierarchical text classification and evaluation
,
2001,
Proceedings 2001 IEEE International Conference on Data Mining.
[9]
Jian Tang,et al.
Hierarchical Classification of Documents with Error Control
,
2001,
PAKDD.
[10]
Padmini Srinivasan,et al.
Hierarchical Text Categorization Using Neural Networks
,
2004,
Information Retrieval.
[11]
Diego Sona,et al.
Bootstrapping for hierarchical document classification
,
2003,
CIKM '03.