Significance, relevance and explainability in the machine learning age: an econometrics and financial data science perspective

Although machine learning is frequently associated with neural networks, it also comprises econometric regression approaches and other statistical techniques whose accuracy enhances with increasing observation. What constitutes high quality machine learning is yet unclear though. Proponents of deep learning (i.e. neural networks) value computational efficiency over human interpretability and tolerate the ‘black box’ appeal of their algorithms, whereas proponents of explainable artificial intelligence (xai) employ traceable ‘white box’ methods (e.g. regressions) to enhance explainability to human decision makers. We extend Brooks et al.’s [2019. ‘Financial Data Science: The Birth of a New Financial Research Paradigm Complementing Econometrics?’ European Journal of Finance 25 (17): 1627–36.] work on significance and relevance as assessment critieria in econometrics and financial data science to contribute to this debate. Specifically, we identify explainability as the Achilles heel of classic machine learning approaches such as neural networks, which are not fully replicable, lack transparency and traceability and therefore do not permit any attempts to establish causal inference. We conclude by suggesting routes for future research to advance the design and efficiency of ‘white box’ algorithms.

[1]  Allan Timmermann,et al.  Complete subset regressions , 2013 .

[2]  S. B. Thompson,et al.  Predicting Excess Stock Returns Out of Sample: Can Anything Beat the Historical Average? , 2008 .

[3]  Is firm-level clean or dirty innovation valued more? , 2020 .

[4]  Paulo S. C. Alencar,et al.  The use of machine learning algorithms in recommender systems: A systematic review , 2015, Expert Syst. Appl..

[5]  David G. McMillan,et al.  Forecasting U.S. stock returns , 2020 .

[6]  Jason H. Yang,et al.  A White-Box Machine Learning Approach for Revealing Antibiotic Mechanisms of Action , 2019, Cell.

[7]  Amit P. Sheth,et al.  Machine learning for Internet of Things data analysis: A survey , 2017, Digit. Commun. Networks.

[8]  Panagiotis Asimakopoulos,et al.  Dividend smoothing and credit rating changes , 2020 .

[9]  R. Banz,et al.  The relationship between return and market value of common stocks , 1981 .

[10]  Francisco Herrera,et al.  Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI , 2020, Inf. Fusion.

[11]  Andreas G. F. Hoepner,et al.  Financial data science: the birth of a new financial research paradigm complementing econometrics? , 2019, The European Journal of Finance.

[12]  W. Härdle,et al.  Rise of the machines? Intraday high-frequency trading patterns of cryptocurrencies , 2020, 2009.04200.

[13]  Zhuang Fengqing,et al.  Patients’ Responsibilities in Medical Ethics , 2016 .

[14]  Arun Rai,et al.  Explainable AI: from black box to glass box , 2019, Journal of the Academy of Marketing Science.

[15]  Octavio Loyola-González,et al.  Black-Box vs. White-Box: Understanding Their Advantages and Weaknesses From a Practical Point of View , 2019, IEEE Access.

[16]  G. Clark,et al.  Entrepreneurs for a low carbon world: How environmental knowledge and policy shape the creation and financing of green start-ups , 2020 .

[17]  I. Welch,et al.  A Comprehensive Look at the Empirical Performance of Equity Premium Prediction II , 2004, SSRN Electronic Journal.

[18]  Dharmendra Singh Rajput,et al.  Survey on Evaluating the Performance of Machine Learning Algorithms: Past Contributions and Future Roadmap , 2019, Deep Learning and Parallel Computing Environment for Bioengineering Systems.