DEODORANT: a novel approach for early detection and prevention of polycystic ovary syndrome using association rule in hypergraph with the dominating set property

Present online health discussion forums generate the bountiful amount of digitized data through health-blogs, posts, tweets, and chats in the social media. People post queries, health issues they undergo along with the symptoms, diagnosis, and clinical reports to get direction for preventive measures and medical relief. As a case study, this work focus on detection of Polycystic Ovary Syndrome, a prevalent condition that affects a woman’s hormone levels. This PCOS problem has been investigated as it forms high-risk factor for infertility, heart disease, diabetes, stroke and many such diseases. We propose a novel model named as DEODORANT (Detection and prEvention of polycystic Ovary synDrome using assOciation rule hypeRgrAph and domiNating set properTy) to derive prospective use of real-time mining data. The unstructured data collected from various media sources are preprocessed using NLTK and association rules are derived by applying apriori algorithm. These association rules are represented in hypergraph and then regenerated as line graph to make it suitable for cluster construction. Spectral clustering is performed on line graph to partition into clusters of hypergraphs. By applying dominating set property on the resultant hypergraph, required inferences can be elicited. From the experimental results, support value of the outcome derived from the dominating set of each cluster has exhibited the symptoms and the causes with percentage ranking. It is evident that they get aligned with precise result portraying real statistics. This type of analysis will empower doctors and health organizations to keep track of the diseases, their symptoms for early detection and safe recovery.

[1]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[2]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[3]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Francisco Herrera,et al.  What do people think about this monument? Understanding negative reviews via deep learning, clustering and descriptive rules , 2020, J. Ambient Intell. Humaniz. Comput..

[5]  Rosane Minghim,et al.  Visual text mining using association rules , 2007, Comput. Graph..

[6]  Xiangbin Yan,et al.  Mining patient opinion to evaluate the service quality in healthcare: a deep-learning approach , 2020, J. Ambient Intell. Humaniz. Comput..

[7]  M. Anandhavalli,et al.  Association Rule Mining in Genomics , 2010 .

[8]  Vipin Kumar,et al.  Association Analysis Techniques for Bioinformatics Problems , 2009, BICoB.

[9]  Alain Bretto,et al.  Applications of Hypergraph Theory: A Brief Overview , 2013 .

[10]  Bernhard Schölkopf,et al.  Learning with Hypergraphs: Clustering, Classification, and Embedding , 2006, NIPS.

[11]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[12]  Frédérique Segond,et al.  Eventfully Safapp: hybrid approach to event detection for social media mining , 2018, Journal of Ambient Intelligence and Humanized Computing.

[13]  Csilla Bujtás,et al.  Transversals and domination in uniform hypergraphs , 2012, Eur. J. Comb..

[14]  Manpreet Kaur,et al.  Market Basket Analysis: Identify the Changing Trends of Market Data Using Association Rule Mining , 2016 .

[15]  Azucena Montes Rendón,et al.  Sentence level matrix representation for document spectral clustering , 2017, Pattern Recognit. Lett..

[16]  Andreas Hotho,et al.  A Brief Survey of Text Mining , 2005, LDV Forum.

[17]  T. Velmurugan,et al.  A survey of association rule mining in text applications , 2013 .

[18]  Shishir Kumar,et al.  Effective surveillance and predictive mapping of mosquito-borne diseases using social media , 2017, J. Comput. Sci..

[19]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[20]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[21]  Enhua Hu,et al.  Data mining model for food safety incidents based on structural analysis and semantic similarity , 2020 .

[22]  Xiao Wang,et al.  Research on a New Automatic Generation Algorithm of Concept Map Based on Text Clustering and Association Rules Mining , 2018, ICIC.

[23]  S. M. Kamruzzaman,et al.  Text Classification using the Concept of Association Rule of Data Mining , 2010, ArXiv.

[24]  Zhongfei Zhang,et al.  Context-Aware Hypergraph Construction for Robust Spectral Clustering , 2014, 1401.0764.

[25]  Jenq-Neng Hwang,et al.  Association Rule Mining of Personal Hobbies in Social Networks , 2014, BigData Congress.

[26]  N. C. Chauhan,et al.  Interesting association rule mining with consistent and inconsistent rule detection from big sales data in distributed environment , 2017 .

[27]  Tat-Jun Chin,et al.  Clustering with Hypergraphs: The Case for Large Hyperedges , 2017, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Deepak Chowdary Edara,et al.  Sentiment analysis and text categorization of cancer medical records with LSTM , 2019, Journal of Ambient Intelligence and Humanized Computing.

[29]  Anton Borg,et al.  Finding Influential Users in Social Media Using Association Rule Learning , 2016, Entropy.

[30]  Vipin Kumar,et al.  Hypergraph Based Clustering in High-Dimensional Data Sets: A Summary of Results , 1998, IEEE Data Eng. Bull..

[31]  Hyun Yoo,et al.  Ambient context-based modeling for health risk assessment using deep neural network , 2018, J. Ambient Intell. Humaniz. Comput..

[32]  R. Janani,et al.  Text document clustering using Spectral Clustering algorithm with Particle Swarm Optimization , 2019, Expert Syst. Appl..