Big Data Analytics Implementation In Banking Industry – Case Study Cross Selling Activity In Indonesia’s Commercial Bank

In 21st century, the big data revolution is happening and has found its place, also within the banking Industry, Bank can leverage big data analytics to gain deeper insights for customers, channels, and the entire market. Integrating predictive analytics with automatic decision making, a bank can better understand the preference of its customers, identify customers with high spending potential, promote the right products to the right customers / cross selling, and improve customer experience, and drive revenue. One of Indonesia’s commercial bank is having an issue with cross selling activity of loan product, and seeks a big data analytics solution to help them. This paper aims to design paper aims to create a design of big data analytics application architecture, suitable business rule and model for cross selling analysis in the Bank. By leveraging Cloudera Hadoop, Aster Analytics as big data analytics engine, TeraData RDMS as their target storage for analytics result, and tableau for data visualization, also Talend data integrator for ETL engine we can perform cross selling analytics for several Bank loan products with a promising result, also design business rule and algorithm used for performing the analytics by using Propensity model using Random Forest and special tagging using SAX using bank specific threshold, and additional filter. Random forest algorithm is showing a good result measured by ROC / AUC.

[1]  Bankim Patel,et al.  Big Data Analysis: Recommendation System with Hadoop Framework , 2015, 2015 IEEE International Conference on Computational Intelligence & Communication Technology.

[2]  Chao Ren,et al.  SQL-SA for big data discovery polymorphic and parallelizable SQL user-defined scalar and aggregate infrastructure in Teradata Aster 6.20 , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[3]  Terry Anthony Byrd,et al.  Big data analytics: Understanding its capabilities and potential benefits for healthcare organizations , 2018 .

[4]  Emyana Ruth Eritha Sirait,et al.  IMPLEMENTASI TEKNOLOGI BIG DATA DI LEMBAGA PEMERINTAHAN INDONESIA , 2016 .

[5]  Yonggang Wen,et al.  Toward Scalable Systems for Big Data Analytics: A Technology Tutorial , 2014, IEEE Access.

[6]  Chongqing Kang,et al.  Load profiling and its application to demand response: A review , 2015 .

[7]  Yu Xiao,et al.  Large-Scale Graph Analytics in Aster 6: Bringing Context to Big Data Discovery , 2014, Proc. VLDB Endow..

[8]  Michel Wedel,et al.  Cross-Selling Through Database Marketing: A Mixed Data Factor Analyzer for Data Augmentation and Prediction , 2003 .

[9]  Kenli Li,et al.  A Parallel Random Forest Algorithm for Big Data in a Spark Cloud Computing Environment , 2017, IEEE Transactions on Parallel and Distributed Systems.

[10]  Kyoji Kawagoe,et al.  Extended SAX: Extension of Symbolic Aggregate Approximation for Financial Time Series Data Representation , 2006 .

[11]  Weiwei Lin,et al.  An Ensemble Random Forest Algorithm for Insurance Big Data Analysis , 2017, IEEE Access.

[12]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[13]  T. Prokop,et al.  Heterogeneous Medical Data Processing , 2017 .

[14]  Daniel Adomako Asamoah,et al.  Teaching Case: Who Renews? Who Leaves? Identifying Customer Churn in a Telecom Company Using Big Data Techniques , 2016, J. Inf. Syst. Educ..

[15]  Dirk Van den Poel,et al.  Predicting customer retention and profitability by using random forests and regression forests techniques , 2005, Expert Syst. Appl..

[16]  X. Zhu,et al.  iCARE: A framework for big data-based banking customer analytics , 2014, IBM J. Res. Dev..

[17]  R. Mahalakshmi,et al.  Semantic Filtering of IoT Data using Symbolic Aggregate Approximation ( SAX ) , 2016 .

[18]  Alexandros Labrinidis,et al.  Challenges and Opportunities with Big Data , 2012, Proc. VLDB Endow..

[19]  Tan Yi Fei,et al.  Network Equipment Failure Prediction with Big Data Analytics , 2016 .

[20]  Veershetty Dagade,et al.  Big Data Weather Analytics Using Hadoop , .

[21]  Kenneth D. Strang,et al.  Analyzing Relationships in Terrorism Big Data Using Hadoop and Statistics , 2017, J. Comput. Inf. Syst..

[22]  Bart Baesens,et al.  Predicting interpurchase time in a retail environment using customer-product networks: An empirical study and evaluation , 2018, Expert Syst. Appl..

[23]  Anne-Laure Boulesteix,et al.  Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics , 2012, WIREs Data Mining Knowl. Discov..

[24]  Lin Li,et al.  Risk adjustment of patient expenditures: A big data analytics approach , 2013, 2013 IEEE International Conference on Big Data.

[25]  Erik Brynjolfsson,et al.  Big data: the management revolution. , 2012, Harvard business review.

[26]  Aleksey Bondarev,et al.  A comparison of data mining techniques in evaluating retail credit scoring using R programming , 2015, 2015 Twelve International Conference on Electronics Computer and Computation (ICECCO).

[27]  Lujiao Tan Data-Driven Marketing: Purchase Behavioral Targeting in Travel Industry based on Propensity Model , 2017 .

[28]  In Lee,et al.  Big data: Dimensions, evolution, impacts, and challenges , 2017 .

[29]  Yi Wang,et al.  Clustering of Electricity Consumption Behavior Dynamics Toward Big Data Applications , 2016, IEEE Transactions on Smart Grid.

[30]  Awais Ahmad,et al.  Urban planning and building smart cities based on the Internet of Things using Big Data analytics , 2016, Comput. Networks.

[31]  Gianmarco De Francisci Morales,et al.  Random Forests of Very Fast Decision Trees on GPU for Mining Evolving Big Data Streams , 2014, ECAI.

[32]  Robert P. Sheridan,et al.  Random Forest: A Classification and Regression Tool for Compound Classification and QSAR Modeling , 2003, J. Chem. Inf. Comput. Sci..

[33]  Rohith Subramanyam HDFS Heterogeneous Storage Resource Management Based on Data Temperature , 2015, 2015 International Conference on Cloud and Autonomic Computing.

[34]  Utkarsh Srivastava,et al.  Impact of Big Data Analytics on Banking Sector: Learning for Indian Banks , 2015 .

[35]  Wei Shen,et al.  Face recognition model based on privacy protection and random forest algorithm , 2018, 2018 27th Wireless and Optical Communication Conference (WOCC).