Clustering Analysis for Vasculitic Diseases

We introduce knowledge discovery for vasculitic diseases in this paper. Vasculitic diseases affect some organs and tissues and diagnosing can be quite difficult. Biomedical literature can contain hidden and useful knowledge for biomedical research and we develop a study based on co-occurrence analysis by using the articles in MEDLINE which is a widely used database.The mostly seen vasculitic diseases are selected to explore hidden patterns. We select PolySearch system as a web based biomedical text mining tool to find organs and tissues in the articles and create two separate datasets with their frequencies for each disease. After forming these datasets, we apply hierarchical clustering analysis to find similarities between the diseases. Clustering analysis reveals some similarities between diseases. We think that the results of clustered diseases positively affect on the medical research of vasculitic diseases especially during the diagnosis and certain similarities can provide different views to medical specialists.

[1]  Satoru Miyano,et al.  Open source clustering software , 2004 .

[2]  Miguel A. Andrade-Navarro,et al.  Update on XplorMed: a web server for exploring scientific literature , 2003, Nucleic Acids Res..

[3]  L. Andrade,et al.  Systemic vasculitis: a difficult diagnosis. , 1997, Sao Paulo medical journal = Revista paulista de medicina.

[4]  Clement T. Yu,et al.  A tutorial on information retrieval: basic terms and concepts , 2006, Journal of biomedical discovery and collaboration.

[5]  Simon M. Lin,et al.  MedlineR: an open source library in R for Medline literature data mining , 2004, Bioinform..

[6]  Sophia Ananiadou,et al.  FACTA: a text search engine for finding associated biomedical concepts , 2008, Bioinform..

[7]  Brian Everitt,et al.  Cluster analysis , 1974 .

[8]  John H Stone,et al.  Classification and diagnostic criteria in systemic vasculitis. , 2005, Best practice & research. Clinical rheumatology.

[9]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[10]  William R. Hersh,et al.  A Survey of Current Work in Biomedical Text Mining , 2005 .

[11]  Thomas Werner,et al.  LitMiner and WikiGene: identifying problem-related key players of gene regulation using publication abstracts , 2005, Nucleic Acids Res..

[12]  Hisham Al-Mubaid,et al.  A New Text Mining Approach for Finding Protein-to-Disease Associations , 2005 .

[13]  Martin Krallinger,et al.  Analysis of biological processes and diseases using text mining approaches. , 2010, Methods in molecular biology.

[14]  Jeffrey L. Solka,et al.  Text Data Mining: Theory and Methods , 2008, ArXiv.

[15]  Jason W Beckstead,et al.  Using Hierarchical Cluster Analysis in Nursing Research , 2002, Western journal of nursing research.

[16]  Naohiko Uramoto,et al.  A text-mining system for knowledge discovery from biomedical documents , 2004, IBM Syst. J..

[17]  M. Schuemie,et al.  Anni 2.0: a multipurpose text-mining tool for the life sciences , 2008, Genome Biology.

[18]  David S. Wishart,et al.  Nucleic Acids Research Polysearch: a Web-based Text Mining System for Extracting Relationships between Human Diseases, Genes, Mutations, Drugs Polysearch: a Web-based Text Mining System for Extracting Relationships between Human Diseases, Genes, Mutations, Drugs and Metabolites , 2008 .