A comparative study of stock scoring using regression and genetic-based linear models

Stock selection has long been a challenging and important task in investment and finance. Researchers and practitioners in this area often use regression models to tackle this problem due to their simplicity and effectiveness. Recent advances in machine learning (ML) are leading to significant opportunities to solve these problems more effectively. In this paper, we present a comparative study between the traditional regression-based and ML-based linear models for stock scoring, which is crucial to the success of stock selection. In ML-based models, Genetic Algorithms (GA), a class of well-known search algorithms in the area of ML, is used for optimization of model parameters and selection of input variables to the stock scoring model. We will show that our proposed genetic-based method significantly outperforms the traditional regression-based method as well as the benchmark. As a result, we expect this genetic-based methodology to advance the research in machine learning for finance and provide an attractive alternative to stock selection over the regression-based approach.

[1]  Robert R. Johnson,et al.  New Evidence on Size and Price-to-Book Effects in Stock Returns , 1997 .

[2]  Ingoo Han,et al.  Genetic algorithms approach to feature discretization in artificial neural networks for the prediction of stock price index , 2000 .

[3]  Edwin J. Elton,et al.  Risk Reduction and Portfolio Size: An Analytical Solution , 1977 .

[4]  Ronnie Sadka,et al.  Predictability and the Earnings-Returns Relation , 2008 .

[5]  Ta-Chung Chu,et al.  Application of fuzzy multiple attribute decision making on company analysis for stock selection , 1996, Soft Computing in Intelligent Systems and Information Processing. Proceedings of the 1996 Asian Fuzzy Systems Symposium.

[6]  E. Fama,et al.  Average Returns, B/M, and Share Issues , 2007 .

[7]  Thomas A. Carnes Unexpected Changes in Quarterly Financial-Statement Line Items and Their Relationship to Stock Prices , 2006 .

[8]  Campbell R. Harvey,et al.  Fundamental Determinants of National Equity Market Returns: A Perspective on Conditional Asset Pricing , 1996 .

[9]  Mehdi R. Zargham,et al.  A Web-based information system for stock selection and evaluation , 1999, Proceedings of International Workshop on Advance Issues of E-Commerce and Web-Based Information Systems. (Cat. No.PR00334).

[10]  Josef Lakonishok,et al.  Corporate Governance through the Proxy Contest: Evidence and Implications , 1993 .

[11]  J. Lewellen,et al.  Predicting Returns with Financial Ratios , 2002 .

[12]  Charles Chang,et al.  Trading imbalances, predictable reversals, and cross-stock price pressure , 2008 .

[13]  Miguel A. Ferreira,et al.  Forecasting Stock Market Returns: The Sum of the Parts is More than the Whole , 2008 .

[14]  Mohammed Omran,et al.  Linear Versus Non‐linear Relationships Between Financial Ratios and Stock Returns: Empirical Evidence from Egyptian Firms , 2004 .

[15]  Dongsong Zhang,et al.  Discovering golden nuggets: data mining in financial application , 2004, IEEE Trans. Syst. Man Cybern. Part C.

[16]  Michael Caplan,et al.  Lessons Learned Using Genetic Programming in a Stock Picking Context , 2005 .

[17]  Thomas D. Dowdell,et al.  The Return-Stages Valuation Model and the Expectations within a Firm's P/B and P/E Ratios , 2001 .

[18]  Ying L. Becker,et al.  Stock Selection - an Innovative Application of Genetic Programming Methodology , 2006 .

[19]  Mark T. Soliman,et al.  The Use of Dupont Analysis by Market Participants , 2007 .

[20]  Erik Hjalmarsson,et al.  Predicting Global Stock Returns , 2008, Journal of Financial and Quantitative Analysis.

[21]  Joseph D. Piotroski Value Investing: The Use of Historical Financial Statement Information to Separate Winners from Losers , 2000 .

[22]  Rob Bauer,et al.  Empirical evidence on corporate governance in Europe: The effect on stock returns, firm value and performance , 2003 .

[23]  Sandip Mukherji,et al.  A Fundamental Analysis of Korean Stock Returns , 1997 .

[24]  Jihoon Yang,et al.  Feature Subset Selection Using a Genetic Algorithm , 1998, IEEE Intell. Syst..

[25]  John L. Evans,et al.  DIVERSIFICATION AND THE REDUCTION OF DISPERSION: AN EMPIRICAL ANALYSIS , 1968 .

[26]  Nicolas Chapados,et al.  Cost functions and model combination for VaR-based asset allocation using neural networks , 2001, IEEE Trans. Neural Networks.

[27]  Pedro Isasi Viñuela,et al.  Soft computing techniques applied to finance , 2008, Applied Intelligence.

[28]  Bao Rong Chang,et al.  A study of a hybrid evolutionary fuzzy model for stock selection , 2011, 2011 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2011).

[29]  Tong-Seng Quah,et al.  Improving returns on stock investment through neural network selection , 1999 .