Enhanced DBSCAN with Hierarchical Tree for Web Rule Mining

Like other mining, web mining is also necessary to increase the power of web search engine to identify the intended web page and web document. While processing with large datasets, there arises several issues associated with space availability, similarity relationships between different webpage’s and running time. Hence, this paper intends to develop an enhanced web mining model based on two contributions. At first, the hierarchical tree is framed, which produces different categories of the searching queries (different web pages). Next, to hierarchical tree model, enhanced Density-Based Spatial Clustering of Applications with Noise (DBSCAN) technique model is developed by modifying the traditional DBSCAN. This technique results in proper session identification from raw data. Moreover, this technique offers the optimal level of clusters necessitated for hierarchical clustering. After hierarchical clustering, the rule mining is adopted. The traditional rule mining technique is generally based on the frequency; however, this paper intends to enhance the traditional rule mining based on utility factor as the second contribution. Hence the proposed model for web rule mining is termed as Enhanced DBSCAN-based Hierarchical Tree (EDBHT). It benefits in providing the search results depending on high level information (e.g., location), so that the ability of search engine in providing the interesting association rules can be improved. Next, to the implementation, the performance of proposed EDBHT is found to be enhanced when compared over several traditional models.

[1]  Xiao Qin,et al.  Interrelation analysis of celestial spectra data using constrained frequent pattern trees , 2013, Knowl. Based Syst..

[2]  V Sujatha,et al.  Improved user Navigation Pattern Prediction Technique from Web Log Data , 2012 .

[3]  Ashutosh Gupta,et al.  Improvised Apriori Algorithm using frequent pattern tree for real time applications in data mining , 2014, ArXiv.

[4]  Zhenjun Ma,et al.  A decision tree based data-driven diagnostic strategy for air handling units , 2016 .

[5]  Araceli Sanchis,et al.  Web news mining in an evolving framework , 2016, Inf. Fusion.

[6]  Elena Baralis,et al.  Frequent Itemsets Mining for Big Data: A Comparative Analysis , 2017, Big Data Res..

[7]  Xiaoli Zhang,et al.  Web-video-mining-supported workflow modeling for laparoscopic surgeries , 2016, Artif. Intell. Medicine.

[8]  Aiiad Albeshri,et al.  Analysis of Eight Data Mining Algorithms for Smarter Internet of Things (IoT) , 2016, EUSPN/ICTH.

[9]  Sattar Hashemi,et al.  DFP-SEPSF: A dynamic frequent pattern tree to mine strong emerging patterns in streamwise features , 2015, Eng. Appl. Artif. Intell..

[10]  G. Singh,et al.  Adaptive network architecture and firefly algorithm for biogas heating model aided by photovoltaic thermal greenhouse system , 2018 .

[11]  Archana H. Sable,et al.  Modified Double Bilateral Filter for Sharpness Enhancement and Noise Removal , 2010, 2010 International Conference on Advances in Computer Engineering.

[12]  Li Hanguang,et al.  Intrusion Detection Technology Research Based on Apriori Algorithm , 2012 .

[13]  B. Sivakumar,et al.  Cross-entropy clustering framework for catchment classification , 2017 .

[14]  Francisco Herrera,et al.  Tutorial on practical tips of the most influential data preprocessing algorithms in data mining , 2016, Knowl. Based Syst..

[15]  José Sena-Cruz,et al.  Using data mining algorithms to predict the bond strength of NSM FRP systems in concrete , 2016 .

[16]  Raymond Y. K. Lau,et al.  An ontology-based Web mining method for unemployment rate prediction , 2014, Decis. Support Syst..

[17]  Chengfei Liu,et al.  AutoRM: An effective approach for automatic Web data record mining , 2015, Knowl. Based Syst..

[18]  Sreenatha G. Anavatti,et al.  Evolving type-2 web news mining , 2017, Appl. Soft Comput..

[19]  Phong Thanh Nguyen,et al.  A hybrid multi criteria decision analysis for engineering project manager evaluation , 2017 .

[20]  A. Rama Mohan Reddy,et al.  A fast DBSCAN clustering algorithm by accelerating neighbor searching using Groups method , 2016, Pattern Recognit..

[21]  Luiz Flavio Autran Monteiro Gomes,et al.  Multi-criteria Web Mining with DRSA , 2016 .

[22]  Ming Li,et al.  An approach of product usability evaluation based on Web mining in feature fatigue analysis , 2014, Comput. Ind. Eng..

[23]  Eduardo Sany Laber,et al.  Decision tree classification with bounded number of errors , 2017, Inf. Process. Lett..

[24]  Fang Liu,et al.  Learning simultaneous adaptive clustering and classification via MOEA , 2016, Pattern Recognit..

[25]  Corrado Moiso,et al.  Identifying user habits through data mining on call data records , 2016, Eng. Appl. Artif. Intell..

[26]  Marco Comuzzi,et al.  Combining Apriori heuristic and bio-inspired algorithms for solving the frequent itemsets mining problem , 2017, Inf. Sci..

[27]  Seyed-Hassan Mirian-Hosseinabadi,et al.  Event-driven web application testing based on model-based mutation testing , 2015, Inf. Softw. Technol..

[28]  Dirk Thorleuchter,et al.  Weak signal identification with semantic web mining , 2013, Expert Syst. Appl..

[29]  Gintautas Dzemyda,et al.  A new web-based solution for modelling data mining processes , 2017, Simul. Model. Pract. Theory.

[30]  María N. Moreno García,et al.  Web mining based framework for solving usual problems in recommender systems. A case study for movies' recommendation , 2016, Neurocomputing.

[31]  C. Brühl,et al.  Stratospheric aerosol data records for the climate change initiative: Development, validation and application to chemistry-climate modelling , 2017 .

[32]  Reda Alhajj,et al.  Effective web log mining and online navigational pattern prediction , 2013, Knowl. Based Syst..

[33]  F Simeonov,et al.  Web-based platform for patient dose surveys in diagnostic and interventional radiology in Bulgaria: Functionality testing and optimisation. , 2017, Physica medica : PM : an international journal devoted to the applications of physics to medicine and biology : official journal of the Italian Association of Biomedical Physics.