A modified K-means clustering for mining of multimedia databases based on dimensionality reduction and similarity measures

With rapid innovations in digital technology and cloud computing off late, there has been a huge volume of research in the area of web based storage, cloud management and mining of data from the cloud. Large volumes of data sets are being stored, processed in either virtual or physical storage and processing equipments on a daily basis. Hence, there is a continuous need for research in these areas to minimize the computational complexity and subsequently reduce the time and cost factors. The proposed research paper focuses towards handling and mining of multimedia data in a data base which is a mixed composition of data in the form of graphic arts and pictures, hyper text, text data, video or audio. Since large amounts of storage are required for audio and video data in general, the management and mining of such data from the multimedia data base needs special attention. Experimental observations using well known data sets of varying features and dimensions indicate that the proposed cluster based mining technique achieves promising results in comparison with the other well-known methods. Every attribute denoting the efficiency of the mining process have been compared component wise with recent mining techniques in the past. The proposed system addresses effectiveness, robustness and efficiency for a high-dimensional multimedia database.

[1]  Dianhui Wang,et al.  Learning Based Neural Similarity Metrics for Multimedia Data Mining , 2006, Soft Comput..

[2]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[3]  Pierre Ailliot,et al.  Sparse vector Markov switching autoregressive models. Application to multivariate time series of temperature , 2017, Comput. Stat. Data Anal..

[4]  Radu Sion,et al.  A grid-based approach for enterprise-scale data mining , 2007, Future Gener. Comput. Syst..

[5]  Gareth J. Janacek,et al.  Clustering Time Series with Clipped Data , 2005, Machine Learning.

[6]  Taner Z Sen,et al.  Predicting Protein Secondary Structure Using Consensus Data Mining (CDM) Based on Empirical Statistics and Evolutionary Information. , 2017, Methods in molecular biology.

[7]  Mahmoud Al-Ayyoub,et al.  MedGraph: a graph-based representation and computation to handle large sets of images , 2016, Multimedia Tools and Applications.

[8]  Francesco Masulli,et al.  A survey of kernel and spectral methods for clustering , 2008, Pattern Recognit..

[9]  Gonzalo Navarro,et al.  Probabilistic proximity searching algorithms based on compact partitions , 2004, J. Discrete Algorithms.

[10]  Lokesh Kumar Sharma,et al.  Genetic K-Means Clustering Algorithm for Mixed Numeric and Categorical Data Sets , 2010 .

[11]  Luís A. Alexandre,et al.  LEGClust—A Clustering Algorithm Based on Layered Entropic Subgraphs , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Desheng Dash Wu,et al.  Data Mining Models and Enterprise Risk Management , 2017 .

[13]  Taher Niknam,et al.  An Efficient Hybrid Evolutionary Algorithm for Cluster Analysis , 2008 .

[14]  Zhaolei Zhang,et al.  Evolutionary multimodal optimization using the principle of locality , 2012, Inf. Sci..

[15]  Pravin M. Kamde,et al.  A SURVEY ON WEB MULTIMEDIA MINING , 2011 .

[16]  Mohan S. Kankanhalli,et al.  Probabilistic temporal multimedia data mining , 2011, TIST.

[17]  N. R. Sakthivel,et al.  Clustering stock price time series data to generate stock trading recommendations: An empirical study , 2017, Expert Syst. Appl..

[18]  Abraham P. Punnen,et al.  Learning multicriteria fuzzy classification method PROAFTN from data , 2007, Comput. Oper. Res..

[19]  Carlos Ordonez,et al.  Integrating K-means clustering with a relational DBMS using SQL , 2006, IEEE Transactions on Knowledge and Data Engineering.

[20]  Patricia Melin,et al.  A Competitive Modular Neural Network for Long-Term Time Series Forecasting , 2017, Nature-Inspired Design of Hybrid Intelligent Systems.

[21]  Dong Wang,et al.  Query representation by structured concept threads with application to interactive video retrieval , 2009, J. Vis. Commun. Image Represent..

[22]  Pradipta Maji,et al.  Fuzzy–Rough Supervised Attribute Clustering Algorithm and Classification of Microarray Data , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[23]  S. Kotsiantis,et al.  MULTIMEDIA MINING , 2004 .

[24]  Mohan S. Kankanhalli,et al.  Multimedia data mining: state of the art and challenges , 2010, Multimedia Tools and Applications.

[25]  Pierpaolo D'Urso,et al.  Autoregressive metric-based trimmed fuzzy clustering with an application to PM10 time series , 2017 .

[26]  Ravindra S. Hegadi,et al.  A Survey on Multimedia Data Mining and Its Relevance Today , 2010 .

[27]  Michael J. Laszlo,et al.  A genetic algorithm that exchanges neighboring centers for k-means clustering , 2007, Pattern Recognit. Lett..

[28]  Gian-Marco Rignanese,et al.  High-Throughput Design of Non-oxide p-Type Transparent Conducting Materials: Data Mining, Search Strategy, and Identification of Boron Phosphide , 2017 .