Business Intelligence using Machine Learning and Data Mining techniques - An analysis

Today's social informatics is facing the challenge of exponentially increasing data present online. However this massive amount of data is available across several heterogeneous platform making it a challenging task especially to comprehend useful information and effectively use it for business intelligence. Achieving Business intelligence through machine learning is one of the significant issues in recent era. Previously, outliers were being considered as noisy data and disregarded leading to loss of relevant information. This paper highlights the major research challenges in this mining sub domain. It provides a comprehensive taxonomy extracted for Business Intelligence methodologies along with current application sectors. Future research directions and suggestions have been pointed to address this anomaly gap to achieve effective business strategies

[1]  Fabrizio Angiulli,et al.  Outlier Detection Techniques for Data Mining , 2009, Encyclopedia of Data Warehousing and Mining.

[2]  Hans-Peter Kriegel,et al.  Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering , 2009, TKDD.

[3]  Xavier Le Pallec,et al.  Multimodal interaction: a survey from model driven engineering and mobile perspectives , 2013, Journal on Multimodal User Interfaces.

[4]  Elke Achtert,et al.  Visual Evaluation of Outlier Detection Models , 2010, DASFAA.

[5]  Ian H. Witten,et al.  Data Mining: Practical Machine Learning Tools and Techniques, 3/E , 2014 .

[6]  Sridhar Ramaswamy,et al.  Efficient algorithms for mining outliers from large data sets , 2000, SIGMOD '00.

[7]  Charu C. Aggarwal,et al.  An Introduction to Social Network Data Analytics , 2011, Social Network Data Analytics.

[8]  Eamonn Keogh Mining Time Series Data , 2005 .

[9]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[10]  Surendra Sarnikar,et al.  Introduction to Business Process Intelligence Minitrack , 2014, 2014 47th Hawaii International Conference on System Sciences.

[11]  Mahnoosh Kholghi,et al.  Classification and evaluation of data mining techniques for data stream requirements , 2010, 2010 International Symposium on Computer, Communication, Control and Automation (3CA).

[12]  Yuguang Fang,et al.  A Fine-Grained Reputation System for Reliable Service Selection in Peer-to-Peer Networks , 2007, IEEE Transactions on Parallel and Distributed Systems.

[13]  Hari Om,et al.  Evolving Fuzzy Min-max Neural Network for Outlier Detection , 2015 .

[14]  Hamid R. Nemati,et al.  Organizational Data Mining , 2010, Data Mining and Knowledge Discovery Handbook.

[15]  Dirk Neumann,et al.  Early Warning of Impending Oil Crises Using the Predictive Power of Online News Stories , 2013, 2013 46th Hawaii International Conference on System Sciences.

[16]  Ronald K. Pearson,et al.  BMC Bioinformatics BioMed Central Methodology article , 2005 .

[17]  Matthias W. Seeger,et al.  Gaussian Processes For Machine Learning , 2004, Int. J. Neural Syst..

[18]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[19]  Ana R. Cavalli,et al.  Process Mining for Trust Monitoring , 2015, 2015 IEEE 29th International Conference on Advanced Information Networking and Applications Workshops.

[20]  Philip S. Yu,et al.  A General Model for Multiple View Unsupervised Learning , 2008, SDM.

[21]  Qingtian Zeng,et al.  Towards Comprehensive Support for Privacy Preservation Cross-Organization Business Process Mining , 2019, IEEE Transactions on Services Computing.

[22]  Din J. Wasem,et al.  Mining of Massive Datasets , 2014 .

[23]  Robert D. Nowak,et al.  Socioscope: Spatio-Temporal Signal Recovery from Social Media (Extended Abstract) , 2012, IJCAI.

[24]  Reda Alhajj,et al.  A Framework for Periodic Outlier Pattern Detection in Time-Series Sequences , 2014, IEEE Transactions on Cybernetics.

[25]  Gabriele Bavota,et al.  Mining Unstructured Data in Software Repositories: Current and Future Trends , 2016, 2016 IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering (SANER).