Granular Mining and Big Data Analytics: Rough Models and Challenges

Data analytics in granular computing framework is considered for several mining applications, such as in video analysis, bioinformatics and online social networks which have all the characteristics of Big data. The role of granulation, lower approximation and r – f information measure is exhibited. While the lower approximation over a video sequence signifies the object model for unsupervised tracking, it characterizes the probability (relative frequency) of definite regions in ranking miRNAs for normal and cancer classification. For neural learning, the information on definite region is used as the initial knowledge for encoding while generating the networks through evolution. Granules considered are of different sizes and dimensions with fuzzy and crisp boundaries. The tracking method is effective in handling different ambiguous situations, e.g., overlapping objects, newly appeared object(s), multiple objects in different directions and speeds, in unsupervised mode. The ranking algorithm could find only 1% miRNAs to result in significantly higher F-score than the entire set. Fuzzy–rough communities detected over the granular model of social networks are suitable in dealing with overlapping virtual communities in Big data. The knowledge encoding based on fuzzy–rough set provides superior performance than that of rough set. Future directions of research and challenges including the significance of z -numbers in precisiation of granules are stated. The article includes some of the results published elsewhere.

[1]  Andrea Cavallaro,et al.  Measures of Effective Video Tracking , 2014, IEEE Transactions on Image Processing.

[2]  Sankar K. Pal,et al.  Roughness of a Fuzzy Set , 1996, Inf. Sci..

[3]  Li-Chen Fu,et al.  Multitarget Visual Tracking Based Effective Surveillance With Cooperation of Multiple Active Cameras , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[4]  Hanzi Wang,et al.  Generalized Kernel-Based Visual Tracking , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Sankar K. Pal,et al.  Rough Self Organizing Map , 2004, Applied Intelligence.

[6]  Sankar Kumar Pal,et al.  Granular Flow Graph, Adaptive Rule Generation and Tracking , 2017, IEEE Transactions on Cybernetics.

[7]  Sankar K. Pal,et al.  Identifying relevant group of miRNAs in cancer using fuzzy mutual information , 2015, Medical & Biological Engineering & Computing.

[8]  Sankar K. Pal,et al.  Handbook on Soft Computing for Video Surveillance , 2012 .

[9]  Soumitra Dutta,et al.  Class-dependent rough-fuzzy granular space, dispersion index and classification , 2012, Pattern Recognit..

[10]  Kathleen M. Carley,et al.  Clearing the FOG: Fuzzy, overlapping groups for social networks , 2008, Soc. Networks.

[11]  Rita Cucchiara,et al.  Detecting Moving Objects, Ghosts, and Shadows in Video Streams , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  R. Weiss,et al.  A Method for the Analysis of the Structure of Complex Organizations , 1955 .

[13]  Lucia Maddalena,et al.  Object Motion Detection and Tracking by an Artificial Intelligence Approach , 2008, Int. J. Pattern Recognit. Artif. Intell..

[14]  Andrzej Skowron,et al.  Rudiments of rough sets , 2007, Inf. Sci..

[15]  Lotfi A. Zadeh,et al.  Fuzzy logic = computing with words , 1996, IEEE Trans. Fuzzy Syst..

[16]  Hossein Rabbani,et al.  Stable Gene Signature Selection for Prediction of Breast Cancer Recurrence Using Joint Mutual Information , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[17]  C. A. Murthy,et al.  Fitting truncated geometric distributions in large scale real world networks , 2014, Theor. Comput. Sci..

[18]  Lotfi A. Zadeh,et al.  A New Direction in AI: Toward a Computational Theory of Perceptions , 2001, AI Mag..

[19]  Zdzislaw Pawlak,et al.  Flow Graphs and Data Mining , 2005, Trans. Rough Sets.

[20]  Sankar K. Pal,et al.  Rough fuzzy MLP: knowledge encoding and classification , 1998, IEEE Trans. Neural Networks.

[21]  Witold Pedrycz,et al.  Granular Computing: Perspectives and Challenges , 2013, IEEE Transactions on Cybernetics.

[22]  A. Bittner,et al.  Characterization of global microRNA expression reveals oncogenic potential of miR-145 in metastatic colorectal cancer , 2009, BMC Cancer.

[23]  Yanqing Zhang,et al.  Granular Neural Networks With Evolutionary Interval Learning , 2008, IEEE Transactions on Fuzzy Systems.

[24]  Sankar K. Pal,et al.  Granulation, rough entropy and spatiotemporal moving object detection , 2013, Appl. Soft Comput..

[25]  Sankar K. Pal,et al.  Neighborhood Rough Filter and Intuitionistic Entropy in Unsupervised Tracking , 2018, IEEE Transactions on Fuzzy Systems.

[26]  M Cieplak 蛋白質の折りたたみにおける協調性と接触秩序 | 文献情報 | J-GLOBAL 科学技術総合リンクセンター , 2004 .

[27]  Sankar K. Pal,et al.  Generalized Rough Sets, Entropy, and Image Ambiguity Measures , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[28]  A. Shelton,et al.  Changes in neuronal activation patterns in response to androgen deprivation therapy: a pilot study , 2010, BMC Cancer.

[29]  J.C. Rajapakse,et al.  SVM-RFE With MRMR Filter for Gene Selection , 2010, IEEE Transactions on NanoBioscience.

[30]  C. A. Murthy,et al.  Unsupervised Feature Selection Using Feature Similarity , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Lotfi A. Zadeh,et al.  A Note on Z-numbers , 2011, Inf. Sci..

[32]  Soumitra Dutta,et al.  Fuzzy rough granular neural networks, fuzzy granules, and classification , 2011, Theor. Comput. Sci..

[33]  Sankar K. Pal,et al.  Case generation using rough sets with fuzzy representation , 2004, IEEE Transactions on Knowledge and Data Engineering.

[34]  Andrzej Skowron,et al.  Data science, big data and granular mining , 2015, Pattern Recognit. Lett..

[35]  Sankar K. Pal,et al.  Z*-numbers: Augmented Z-numbers for machine-subjectivity representation , 2015, Inf. Sci..

[36]  Emilio Maggio,et al.  Video Tracking - Theory and Practice , 2011 .

[37]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[38]  Yue Han,et al.  Stable Gene Selection from Microarray Data via Sample Weighting , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[39]  Lei Zhang,et al.  Fast Compressive Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[41]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[42]  He Huang,et al.  Implementation of Robot Visual Tracking System Based on Rough Set Theory , 2008, 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery.

[43]  Sankar K. Pal,et al.  Fuzzy–Rough Sets for Information Measures and Selection of Relevant Genes From Microarray Data , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[44]  Sankar K. Pal,et al.  A Granular Self-Organizing Map for Clustering and Gene Selection in Microarray Data , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[45]  Sankar K. Pal,et al.  Fuzzy rough granular self-organizing map and fuzzy rough entropy , 2012, Theor. Comput. Sci..

[46]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[47]  Witold Pedrycz,et al.  Granular Computing - The Emerging Paradigm , 2007 .

[48]  Yiyu Yao,et al.  MGRS: A multi-granulation rough set , 2010, Inf. Sci..

[49]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Eckart Meese,et al.  High-throughput miRNA profiling of human melanoma blood samples , 2010, BMC Cancer.

[51]  Sankar K. Pal,et al.  Context granulation and subjective-information quantification , 2013, Theor. Comput. Sci..

[52]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Michalis Vazirgiannis,et al.  Clustering and Community Detection in Directed Networks: A Survey , 2013, ArXiv.

[54]  Dan Schonfeld,et al.  Video Tracking Based on Sequential Particle Filtering on Graphs , 2011, IEEE Transactions on Image Processing.

[55]  Sankar K. Pal,et al.  Fuzzy-rough community in social networks , 2015, Pattern Recognit. Lett..

[56]  Sung-Bae Cho,et al.  Fuzzy-Rough Entropy Measure and Histogram Based Patient Selection for miRNA Ranking in Cancer , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[57]  Sankar K. Pal,et al.  FGSN: Fuzzy Granular Social Networks - Model and applications , 2015, Inf. Sci..

[58]  Sankar K. Pal,et al.  A computational model for the endogenous arousal of thoughts through Z*-numbers , 2017, Inf. Sci..

[59]  Sankar K. Pal,et al.  Fuzzy mutual information based grouping and new fitness function for PSO in selection of miRNAs in cancer , 2017, Comput. Biol. Medicine.

[60]  Lotfi A. Zadeh,et al.  Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic , 1997, Fuzzy Sets Syst..

[61]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[62]  A. Skowron,et al.  Towards adaptive calculus of granules , 1998 .

[63]  D. Kavanagh,et al.  Is overexpression of HER-2 a predictor of prognosis in colorectal cancer? , 2009, BMC Cancer.

[64]  Hui Fang,et al.  A fuzzy logic approach for detection of video shot boundaries , 2006, Pattern Recognit..

[65]  Sankar K. Pal,et al.  Title Paper: Natural computing: A problem solving paradigm with granular information processing , 2013, Appl. Soft Comput..

[66]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[67]  Xizhao Wang,et al.  On the generalization of fuzzy rough sets , 2005, IEEE Transactions on Fuzzy Systems.

[68]  Sankar K. Pal,et al.  Fuzzy rough sets, and a granular neural network for unsupervised feature selection , 2013, Neural Networks.

[69]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[70]  Sankar K. Pal,et al.  Granular Neural Networks, Pattern Recognition and Bioinformatics , 2017, Studies in Computational Intelligence.

[71]  S. Boorman,et al.  Social Structure from Multiple Networks. II. Role Structures , 1976, American Journal of Sociology.

[72]  Sankar K. Pal,et al.  Neighborhood granules and rough rule-base in tracking , 2016, Natural Computing.