Fuzzy factorization machine

Abstract Rational and accurate classification cannot be achieved without considering both the historical information and domain knowledge. We propose fuzzy factorization machine (fuzzy FM) to integrate fuzzy set theory and factorization machine techniques for knowledge-enhanced classification. Each instance is assigned a membership through experts' estimations, and the instance’s contribution to the objective function is weighted by its membership instead of the equal penalty in the standard FM. By adopting differentiated weighting strategies, we propose two variants of fuzzy FM: unilaterally weighted fuzzy FM (UFFM) and bilaterally weighted fuzzy FM (BFFM). In BFFM, each instance may not be fully assigned to one of two classes for better classification of imbalanced data, while in UFFM, each instance can only be assigned to one class. A set of membership generation approaches is summarized to quantify experts’ prior estimations. We introduce solving methods based on stochastic gradient descent for UFFM and BFFM. Experiments on real credit datasets demonstrate that the proposed fuzzy FM models can yield better rational classification than previous baselines (including the standard FM). The proposed fuzzy FM is a generic machine learning framework that can be applied to various rational classification tasks.

[1]  Bart Baesens,et al.  Credit rating prediction using Ant Colony Optimization , 2010, J. Oper. Res. Soc..

[2]  Catherine C. Eckel,et al.  Anatomy of the Credit Score , 2013 .

[3]  Rinaldo Artes,et al.  Spatial dependence in credit risk and its improvement in credit scoring , 2016, Eur. J. Oper. Res..

[4]  Ji Won Kim,et al.  Decision tree-based technology credit scoring for start-up firms: Korean case , 2012, Expert Syst. Appl..

[5]  Vasile Palade,et al.  FSVM-CIL: Fuzzy Support Vector Machines for Class Imbalance Learning , 2010, IEEE Transactions on Fuzzy Systems.

[6]  Yanchun Zhang,et al.  Domain-Driven Classification Based on Multiple Criteria and Multiple Constraint-Level Programming for Intelligent Credit Scoring , 2010, IEEE Transactions on Knowledge and Data Engineering.

[7]  Thorsten Beck,et al.  When arm’s length is too far. Relationship banking over the credit cycle , 2015 .

[8]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[9]  R. Avery,et al.  Consumer Credit Scoring: Do Situational Circumstances Matter? , 2004 .

[10]  Chih-Jen Lin,et al.  Field-aware Factorization Machines for CTR Prediction , 2016, RecSys.

[11]  Kazuyuki Murase,et al.  Adaptive weighted fuzzy rule-based system for the risk level assessment of heart disease , 2018, Applied Intelligence.

[12]  M. Petersen,et al.  The Benefits of Lending Relationships: Evidence from Small Business Data , 1994 .

[13]  H. Zimmermann Fuzzy sets, decision making, and expert systems , 1987 .

[14]  Witold Pedrycz,et al.  Fuzzy Multicriteria Decision-Making: Models, Methods and Applications , 2010 .

[15]  Sergio L. Schmukler,et al.  Bank Involvement with SMEs: Beyond Relationship Lending , 2008 .

[16]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[17]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[18]  George Hripcsak,et al.  Research Paper: The Role of Domain Knowledge in Automating Medical Text Report Classification , 2003, J. Am. Medical Informatics Assoc..

[19]  Yian-Kui Liu,et al.  Expected value of fuzzy variable and fuzzy expected value models , 2002, IEEE Trans. Fuzzy Syst..

[20]  M. Beynon,et al.  Variable precision rough set theory and data discretisation: an application to corporate failure prediction , 2001 .

[21]  Xiang Li,et al.  Mean-semi-entropy portfolio adjusting model with transaction costs , 2020, Journal of Data, Information and Management.

[22]  Andrew K. C. Wong,et al.  Classification of Imbalanced Data: a Review , 2009, Int. J. Pattern Recognit. Artif. Intell..

[23]  Stephen V. Stehman,et al.  Selecting and interpreting measures of thematic classification accuracy , 1997 .

[24]  Chang Sun Kang,et al.  Use of fuzzy set theory in the aggregation of expert judgments , 1999 .

[25]  Edward I. Altman,et al.  FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND THE PREDICTION OF CORPORATE BANKRUPTCY , 1968 .

[26]  Vladimir Vapnik,et al.  Support-vector networks , 2004, Machine Learning.

[27]  Lars Schmidt-Thieme,et al.  Fast context-aware recommendations with factorization machines , 2011, SIGIR.

[28]  Tat-Seng Chua,et al.  Neural Factorization Machines for Sparse Predictive Analytics , 2017, SIGIR.

[29]  So Young Sohn,et al.  Technology credit scoring model considering both SME characteristics and economic conditions: The Korean case , 2010, J. Oper. Res. Soc..

[30]  G. E. Apostolakis,et al.  Theoretical foundations and practical issues for using expert judgements in uncertainty analysis of high-level radioactive waste disposal , 1991 .

[31]  Jing Chen,et al.  Memory-aware gated factorization machine for top-N recommendation , 2020, Knowl. Based Syst..

[32]  Ling Ma,et al.  Deep learning models for bankruptcy prediction using textual disclosures , 2019, Eur. J. Oper. Res..

[33]  Michael Grottke,et al.  Exploiting social media with higher-order Factorization Machines: statistical arbitrage on high-frequency data of the S&P 500 , 2018, Quantitative Finance.

[34]  Jindong Qin A survey of type-2 fuzzy aggregation and application for multiple criteria decision making , 2019 .

[35]  Huimin Zhao,et al.  Incorporating domain knowledge into data mining classifiers: An application in indirect lending , 2008, Decis. Support Syst..

[36]  Witold Pedrycz,et al.  Building the fundamentals of granular computing: A principle of justifiable granularity , 2013, Appl. Soft Comput..

[37]  Haitao Yu,et al.  Time consistent fuzzy multi-period rolling portfolio optimization with adaptive risk aversion factor , 2017, J. Ambient Intell. Humaniz. Comput..

[38]  Zoubin Ghahramani,et al.  Probabilistic machine learning and artificial intelligence , 2015, Nature.

[39]  Hirofumi Uchida,et al.  Information verifiability, bank organization, bank competition and bankborrower relationships , 2011 .

[40]  Sena Durguner,et al.  Do borrower-lender relationships still matter for small business loans? , 2017 .

[41]  L. Thomas A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers , 2000 .

[42]  Bart Baesens,et al.  Using Neural Network Rule Extraction and Decision Tables for Credit - Risk Evaluation , 2003, Manag. Sci..

[43]  Kin Keung Lai,et al.  A new fuzzy support vector machine to evaluate credit risk , 2005, IEEE Transactions on Fuzzy Systems.