Computational Experimental Study on Social Organization Behavior Prediction Problems

With the development of mobile Internet, behavioral trajectories of human life are more and more recorded, which makes it possible to use computer technology to mine organizational behavior patterns. The mining of organizational behavior patterns based on social computing can not only prepare them in a targeted manner but also predict the consequences of possible measures. The organization behavior pattern mining has achieved a series of achievements in the fields of e-commerce and enterprise management. However, the problem of class imbalance and nonconsistent misclassification cost is common in the field of organizational behavior. For this problem, this article compares and analyzes the performance of the organizational behavior prediction model established by four typical cost-sensitive learning methods based on six classifiers, which provides a basis for the appropriate selection of cost-sensitive learning methods in different situations. Among them, the upsampling learning method is a better cost-sensitive learning method. However, there are some shortcomings in the upper sampling method. In order to avoid the possible overfitting problem of the social organization behavior prediction model established by the upper sampling method, this article proposes a new cost-sensitive learning method suitable for the mining of organizational behavior patterns. Based on the cost curve, this article proposes an effective personalized solution to the problem of class disequilibrium and nonconsistent misclassification cost in organizational behavior prediction modeling.

[1]  Aria Alasty,et al.  On the existence of proper stochastic Markov models for statistical reconstruction and prediction of chaotic time series , 2019, Chaos, Solitons & Fractals.

[2]  Petr Hájek,et al.  Predicting Regional Credit Ratings Using Ensemble Classification with MetaCost , 2019, CSOC.

[3]  S. Athey,et al.  Generalized random forests , 2016, The Annals of Statistics.

[4]  Yuanyuan Qiao,et al.  An improved Markov method for prediction of user mobility , 2016, 2016 12th International Conference on Network and Service Management (CNSM).

[5]  Zhiyong Lu,et al.  TaggerOne: joint named entity recognition and normalization with semi-Markov Models , 2016, Bioinform..

[6]  Hubert Razik,et al.  Hidden Markov Models for the Prediction of Impending Faults , 2016, IEEE Transactions on Industrial Electronics.

[7]  Bennamoun,et al.  Cost-Sensitive Learning of Deep Feature Representations From Imbalanced Data , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[8]  David Zhang,et al.  Evolutionary Cost-Sensitive Extreme Learning Machine , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[9]  V. S. Subrahmanian,et al.  CONVEX: Similarity-Based Algorithms for Forecasting Group Behavior , 2008, IEEE Intelligent Systems.

[10]  Vicky Arnold,et al.  Use of Knowledge Management Systems and the Impact on the Acquisition of Explicit Knowledge , 2008, J. Inf. Syst..

[11]  Chung-Jen Chen,et al.  How organizational climate and structure affect knowledge management - The social interaction perspective , 2007, Int. J. Inf. Manag..

[12]  Fadi A. Thabtah,et al.  A review of associative classification mining , 2007, The Knowledge Engineering Review.

[13]  Sotiris B. Kotsiantis,et al.  Machine learning: a review of classification and combining techniques , 2006, Artificial Intelligence Review.

[14]  Hsiu-Fen Lin,et al.  Impact of organizational learning and knowledge management factors on e‐business adoption , 2005 .

[15]  Kai Ming Ting,et al.  An Instance-weighting Method to Induce Cost-sensitive Trees , 2001 .

[16]  Charles Elkan,et al.  The Foundations of Cost-Sensitive Learning , 2001, IJCAI.

[17]  Pedro M. Domingos MetaCost: a general method for making classifiers cost-sensitive , 1999, KDD '99.

[18]  Martin Krzywinski,et al.  Points of Significance: Classification and regression trees , 2017, Nature Methods.

[19]  Fu Zhong-liang Cost-sensitive Ensemble Learning Algorithm for Multi-label Classification Problems , 2014 .

[20]  Robert C. Holte,et al.  C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling beats Over-Sampling , 2003 .

[21]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[22]  R. Agrawal,et al.  Fast Algorithms for Mining Association Rules , 1998 .

[23]  Zhi-Hua Zhou,et al.  Ieee Transactions on Knowledge and Data Engineering 1 Training Cost-sensitive Neural Networks with Methods Addressing the Class Imbalance Problem , 2022 .