QuantCloud: Enabling Big Data Complex Event Processing for Quantitative Finance Through a Data-Driven Execution

Quantitative Finance (QF) utilizes increasingly sophisticated mathematic models and advanced computer techniques to predict the movement of global markets, and price the derivatives and other assets. Being able to react quickly and intelligently to fast-changing markets is a decisive success factor for trading companies. To date, the rise of QF requires an integrated toolchain of enabling technologies to carry out complex event processing on the explosive growth and diversified forms of market metadata, in pursuit of a microsecond latency on an Exabyte-level dataset. Inspired by this, we present a data-driven execution paradigm that untangles the dependencies of complex processing events and integrate the paradigm with a big data infrastructure that streams time series data. This integrated platform is termed as the QuantCloud platform. Essentially, QuantCloud executes the complex event processing in a data-driven mode and manages large amounts of diversified market data in a data-parallel mode. To show its practicability and performance, we develop a prototype and benchmark by applying real-world QF research models on the New York Stock Exchange (NYSE) data. Using this prototype, we demonstrate this platform with an application to: (i) data cleaning and aggregating (including the computing of logarithmic returns from tick data and the finding the medians of grouped data) and (ii) data modeling: the autoregressive-moving average (ARMA) model. The performance results show that (a) this platform obtains a high throughput (usually in the order of millions of tick messages per second) and a sub-microsecond latency; (b) it fully executes data-dependent tasks through a data-driven execution; and (c) it implements a modular design approach for rapidly developing these data-crunching methods and QF research models. This platform resulting from an aggregated effort of the data-driven execution and big data infrastructure, offers the financial engineers with new insights and enhanced capabilities for effective and efficient incorporation of big data complex event processing technologies in their workflow.

[1]  Andrew J. Patton,et al.  What good is a volatility model? , 2001 .

[2]  Jan W Dash Quantitative Finance and Risk Management: A Physicist's Approach , 2004 .

[3]  Francis X. Diebold,et al.  Modeling and Forecasting Realized Volatility , 2001 .

[4]  Peng Zhang,et al.  SNC: A Cloud Service Platform for Symbolic-Numeric Computation Using Just-In-Time Compilation , 2018, IEEE Transactions on Cloud Computing.

[5]  N. Shephard,et al.  Econometric analysis of realized volatility and its use in estimating stochastic volatility models , 2002 .

[6]  J. Manyika,et al.  Are you ready for the era of ‘big data’? , 2010 .

[7]  Bin Fang,et al.  Big Data in Finance , 2016 .

[8]  Xu Dong New development on market microstructure and macrostructure: Patterns of U.S. high frequency data and a unified factor model framework , 2013 .

[9]  Peng Zhang,et al.  A data-driven paradigm for mapping problems , 2015, Parallel Comput..

[10]  Glenn Ellison,et al.  Risk Taking by Mutual Funds as a Response to Incentives , 1995, Journal of Political Economy.

[11]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[12]  Ronald K. Pearson,et al.  Outliers in process modeling and identification , 2002, IEEE Trans. Control. Syst. Technol..

[13]  Thomas J. Chemmanur,et al.  Investment Bank Reputation, Information Production, and Financial Intermediation , 1994 .

[14]  Yin-Wong Cheung,et al.  Currency Traders and Exchange Rate Dynamics: A Survey of the U.S. Market , 2000, SSRN Electronic Journal.

[15]  Wei Jiang,et al.  On-line outlier detection and data cleaning , 2004, Comput. Chem. Eng..

[16]  Erik Brynjolfsson,et al.  Big data: the management revolution. , 2012, Harvard business review.

[17]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[18]  Richard A. Brealey,et al.  Hedge Funds and Financial Stability: An Analysis of Their Factor Exposures , 2001 .

[19]  Christian T. Brownlees,et al.  Financial Econometric Analysis at Ultra-High Frequency: Data Handling Concerns , 2006, Comput. Stat. Data Anal..

[20]  Alexander J. McNeil,et al.  Quantitative Risk Management: Concepts, Techniques and Tools : Concepts, Techniques and Tools , 2015 .

[21]  Kai Yu,et al.  QuantCloud: Big Data Infrastructure for Quantitative Finance on the Cloud , 2018, IEEE Transactions on Big Data.

[22]  Helmut Krcmar,et al.  Big Data , 2014, Wirtschaftsinf..

[23]  A. Lo,et al.  Moore’s Law versus Murphy’s Law: Algorithmic Trading and Its Discontents † , 2013 .

[24]  Meikang Qiu,et al.  A Data-Oriented Method for Scheduling Dependent Tasks on High-Density Multi-GPU Systems , 2015, 2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems.

[25]  Ed Thorp A Perspective on Quantitative Finance: Models for Beating the Market , 2004 .

[26]  W. Sharpe The Sharpe Ratio , 1994 .

[27]  Peng Zhang,et al.  Quantitative Data Analysis in Finance , 2017, Handbook of Big Data Technologies.

[28]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[29]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.