Verification of the fulfilment of the purposes of Basel II, Pillar 3 through application of the web log mining methods

The objective of the paper is the verification of the fulfilment of the purposes of Basel II, Pillar 3 – market discipline during the recent financial crisis. The objective of the paper is to describe the current state of the working out of the project that is focused on the analysis of the market participants’ interest in mandatory disclosure of financial information by a commercial bank by means of advanced methods of web log mining. The output of the realized project will be the verification of the assumptions related to the purposes of Basel III by means of the web mining methods, the recommendations for possible reduction of mandatory disclosure of information under Basel II and III, the proposal of the methodology for data preparation for web log mining in this application domain and the generalised procedure for users’ behaviour modelling dependent on time. The schedule of the project has been divided into three phases. The paper deals with its first phase that is focusing on the data pre-processing, analysis and evaluation of the required information under Basel II, Pillar 3 since 2008 and its disclosure into the web site of a commercial bank. The authors introduce the methodologies for data preparation and known heuristic methods for path completion into web log files with respect to the particularity of investigated application domain. They propose scientific methods for modelling users’ behaviour of the webpages related to Pillar 3 with respect to time.

[1]  Jaideep Srivastava,et al.  Data Preparation for Mining World Wide Web Browsing Patterns , 1999, Knowledge and Information Systems.

[2]  Jiawei Han,et al.  WebML: Querying the World-Wide Web for Resources and Knowledge , 1998, Workshop on Web Information and Data Management.

[3]  Peter Svec,et al.  Data preprocessing evaluation for web log mining: reconstruction of activities of a web visitor , 2010, ICCS.

[4]  Anália Lourenço,et al.  Catching web crawlers in the act , 2006, ICWE '06.

[5]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[6]  Martin Drlík,et al.  Influence of Different Session Timeouts Thresholds on Results of Sequence Rule Analysis in Educational Data Mining , 2011, DICTAP.

[7]  Michal Munk,et al.  Probability modeling of accesses to the web parts of portal , 2011, WCIT.

[8]  Saul Greenberg,et al.  Revisitation patterns in World Wide Web navigation , 1997, CHI.

[9]  Myra Spiliopoulou,et al.  Analysis of navigation behaviour in web sites integrating multiple information systems , 2000, The VLDB Journal.

[10]  V. Chitraa,et al.  A Survey on Preprocessing Methods for Web Usage Data , 2010, ArXiv.

[11]  Javier Lorenzo,et al.  A Tool for Web Usage Mining , 2007, IDEAL.

[12]  Sebastián Ventura,et al.  Applying Web usage mining for personalizing hyperlinks in Web-based adaptive educational systems , 2009, Comput. Educ..

[13]  Myra Spiliopoulou,et al.  WUM: A tool for Web Utilization analysis , 1999 .

[14]  Peter Svec,et al.  Data advance preparation factors affecting results of sequence rule analysis in web log mining , 2010 .

[15]  Martin Drlík,et al.  Impact of Different Pre-Processing Tasks on Effective Identification of Users' Behavioral Patterns in Web-based Educational System , 2011, ICCS.

[16]  Philip S. Yu,et al.  Data mining for path traversal patterns in a web environment , 1996, Proceedings of 16th International Conference on Distributed Computing Systems.

[17]  Martin Drlík,et al.  Probability Modelling of Accesses to the Course Activities in the Web-Based Educational System , 2011, ICCSA.