Fuzzy clustering algorithms for identification of Exocarpium Citrus Grandis through chromatography

Chromatography has been extensively applied in identification and quality control of Chinese medicines (CMs). However, regular analytical methods are not suitable if labeled patterns or reference patterns are not available. Unsupervised and semi-supervised recognition approaches for chromatographic patterns, namely nonrandomized fuzzy C-Means clustering (FCM) with weighted principal components (NWPC-FCM) and partial supervised FCM with weighted PCs (PSWPC-FCM) are proposed in this work. The basic ideas of the proposed algorithms are as follows: PCs are extracted and weighted according to corresponding variances via principal component analysis to search for more complicated geometry of fuzzy clusters, then nonrandomized methodology and partial supervised clustering with seeds are employed, respectively, in NWPC-FCM and PSWPC-FCM to determine initial cluster centers for reliable cluster results. Satisfactory results were achieved with this method in identification of Exocarpium Citrus Grandis, a genuine herbal medicine of Guangdong Province. The presented algorithms improve cluster effectiveness and reliability significantly compared with standard FCM, PC-FCM, and two widely utilized clustering methods on chromatographic analysis. The research indicates the proposed algorithms exhibit functional applicability and interpretability for pattern recognition in chromatographic fingerprints of CMs in the presence of limited labeling or reference information.

[1]  Tao Li,et al.  Entropy-based criterion in categorical clustering , 2004, ICML.

[2]  Witold Pedrycz,et al.  Enhancement of fuzzy clustering by mechanisms of partial supervision , 2006, Fuzzy Sets Syst..

[3]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[4]  Ren Fang Multiseed clustering algorithm based on max-min distance means , 2006 .

[5]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[6]  Yi-Zeng Liang,et al.  Pretreatments of chromatographic fingerprints for quality control of herbal medicines. , 2006, Journal of chromatography. A.

[7]  Yi-zeng Liang,et al.  Perspective of chemical fingerprinting of Chinese herbs. , 2010, Planta medica.

[8]  Alessandro Laio,et al.  Clustering by fast search and find of density peaks , 2014, Science.

[9]  Chen Shao-hong,et al.  Chinese Patent Medicines Containing Eighteen Incompatible Herbs or Nineteen Mutual Antagonistic Herbs in Pharmacopoeia of People's Republic of China(volume I,2010) , 2011 .

[10]  Christian Esposito,et al.  Smart Cloud Storage Service Selection Based on Fuzzy Logic, Theory of Evidence and Game Theory , 2016, IEEE Transactions on Computers.

[11]  Liu Bing Fuzzy clustering algorithm using two weighting methods , 2011 .

[12]  C. China Pharmacopoeia,et al.  Pharmacopoeia of the People's Republic of China , 2010 .

[13]  Witold Pedrycz,et al.  Kernel-based fuzzy clustering and fuzzy clustering: A comparative experimental study , 2010, Fuzzy Sets Syst..

[14]  Shaida Fariza Sulaiman,et al.  The application of pattern recognition techniques in metabolite fingerprinting of six different Phyllanthus spp. , 2011 .

[15]  Victor V. Toporkov,et al.  Heuristic strategies for preference-based scheduling in virtual organizations of utility grids , 2015, J. Ambient Intell. Humaniz. Comput..

[16]  Kuldip K. Paliwal,et al.  Feature extraction and dimensionality reduction algorithms and their applications in vowel recognition , 2003, Pattern Recognit..

[17]  Christian Esposito,et al.  Interconnecting Federated Clouds by Using Publish-Subscribe Service , 2013, Cluster Computing.

[18]  Arindam Banerjee,et al.  Semi-supervised Clustering by Seeding , 2002, ICML.

[19]  Chu-Sing Yang,et al.  A fast particle swarm optimization for clustering , 2015, Soft Comput..

[20]  Hu Xiaowei Identification of Fructus Schisandrae Chinensis and Fructus Schisandrae Sphenantherae by RP-HPLC and its Cluster Analysis and Discriminant Analysis , 2009 .

[21]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[22]  Witold Pedrycz,et al.  Knowledge-based clustering - from data to information granules , 2007 .

[23]  Ni Yong-nian Application of chemical pattern recognition techniques in food quality control , 2009 .

[24]  Peerasak Intarapaiboon A hierarchy-based similarity measure for intuitionistic fuzzy sets , 2016, Soft Comput..

[25]  José Manuel Amigo,et al.  ChroMATHography: solving chromatographic issues with mathematical models and intuitive graphics. , 2010, Chemical reviews.

[26]  Wang Yi-min Comparison of different methods for evaluating the similarity of the fingerprints of traditional Chinese medicines , 2005 .

[27]  Yuan Zhang,et al.  [Research on the application of grey system theory in the pattern recognition for chromatographic fingerprints of traditional Chinese medicine]. , 2013, Se pu = Chinese journal of chromatography.

[28]  Xiao-ke Zheng,et al.  [Chemical pattern recognition for HPLC fingerprint analysis of Flos Lonicerae Japonicae in different collecting time]. , 2007, Zhong yao cai = Zhongyaocai = Journal of Chinese medicinal materials.

[29]  Dingying Tan,et al.  Research on the Similarity Algorithm of Chromatographic Fingerprint Based on Information Entropy , 2012, EMEIT 2012.

[30]  Witold Pedrycz,et al.  Distributed proximity-based granular clustering: towards a development of global structural relationships in data , 2015, Soft Comput..

[31]  Iacopo Carreras,et al.  An analysis of distance estimation to detect proximity in social interactions , 2013, Journal of Ambient Intelligence and Humanized Computing.

[32]  Nadziroh Nadziroh,et al.  KONSEP PEMBELAJARAN PKN DALAM MENANAMKAN PENDIDIKAN ANTI KORUPSI SEJAK DINI DISEKOLAH DASAR , 2017 .

[33]  Ameet Talwalkar,et al.  Foundations of Machine Learning , 2012, Adaptive computation and machine learning.

[34]  Zhi Xue-zhi Quality control system of overall qualitative similarities and overall quantitative similarities of chromatographic fingerprints , 2007 .