Analysis on Drug Dosage Form Name Based on N-gram Technique and Network Analysis
暂无分享,去创建一个
In this paper, we analyzed drug dosage form names. We created the network structure whose nodes are dosage form names. Its edges between dosage form names denote that they share some of sub-strings generated based on N-gram technique. We employed Simpson coefficient to define the weight of an edge. We proposed a new clustering method and applied it to the network. The results showed that “dosage forms” can be categorized based on not only physical form information but their application site,purpose,processing and so on.
[1] M E J Newman,et al. Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.
[2] M. Newman,et al. Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.