Mapping of option pricing algorithms onto heterogeneous many-core architectures

The rapid development of technologies and applications in recent years poses high demands and challenges for high-performance computing. Because of their competitive performance/price ratio, heterogeneous many-core architectures are widely used in high-performance computing areas. GPU and Xeon Phi are two popular general-purpose many-core accelerators. In this paper, we demonstrate how heterogeneous many-core architectures, powered by multi-core CPUs, CUDA-enabled GPUs and Xeon Phis can be used as an efficient computational platform to accelerate popular option pricing algorithms. In order to make full use of the compute power of this architecture, we have used a hybrid computing model which consists of two types of data parallelism: worker level and device level. The worker level data parallelism uses a distributed computing infrastructure for task distribution, while the device level data parallelism uses both the multi-core CPUs and many-core accelerators for fast option pricing calculation. Experiments show that our implementations achieve good performance and scalability on this architecture and also outperform other state-of-the-art GPU-based solutions for Monte Carlo European/American option pricing and BSDE European option pricing.

[1]  Vladimir Surkov Parallel option pricing with Fourier Space Time-stepping method on Graphics Processing Units , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[2]  Prabhas Chongstitvatana,et al.  Parallelization of European Monte-Carlo options pricing on graphics processing units , 2011, 2011 Eighth International Joint Conference on Computer Science and Software Engineering (JCSSE).

[3]  Stavros A. Zenios,et al.  High-performance computing in finance: The last 10 years and the next , 1999, Parallel Comput..

[4]  Alexandros V. Gerbessiotis,et al.  Architecture independent parallel binomial tree option price valuations , 2004, Parallel Comput..

[5]  Srinivas Aluru,et al.  Parallel Mutual Information Based Construction of Whole-Genome Networks on the Intel (R) Xeon Phi (TM) Coprocessor , 2014, 2014 IEEE 28th International Parallel and Distributed Processing Symposium.

[6]  Aart J. C. Bik,et al.  Automatic Intra-Register Vectorization for the Intel® Architecture , 2002, International Journal of Parallel Programming.

[7]  R. Farnoosh,et al.  Stochastic Differential Equations and Application of the Kalman-Bucy Filter in the Modeling of RC Circuit , 2010 .

[8]  Ying Peng,et al.  Parallel Option Pricing with BSDE Method on GPU , 2010, 2010 Ninth International Conference on Grid and Cloud Computing.

[9]  Frank Kienle,et al.  An Energy Efficient FPGA Accelerator for Monte Carlo Option Pricing with the Heston Model , 2011, 2011 International Conference on Reconfigurable Computing and FPGAs.

[10]  Hans Werner Meuer,et al.  Top500 Supercomputer Sites , 1997 .

[11]  Süleyman Özekici,et al.  Parallel computing in Asian option pricing , 2007, Parallel Comput..

[13]  Viktor K. Prasanna,et al.  Tiling, Block Data Layout, and Memory Hierarchy Performance , 2003, IEEE Trans. Parallel Distributed Syst..

[14]  Mark Bull,et al.  Development of mixed mode MPI / OpenMP applications , 2001, Sci. Program..

[15]  Roger D. Chamberlain,et al.  Acceleration of Binomial Options Pricing via Parallelizing along time-axis on a GPU , 2011 .

[16]  Paul Glasserman,et al.  Monte Carlo Methods in Financial Engineering , 2003 .

[17]  S. Peng Backward Stochastic Differential Equation, Nonlinear Expectation and Their Applications , 2011 .

[18]  Hamid R. Arabnia,et al.  Parallel Edge-Region-Based Segmentation Algorithm Targeted at Reconfigurable MultiRing Network , 2003, The Journal of Supercomputing.

[19]  Hui Liu,et al.  Option Pricing on the GPU with Backward Stochastic Differential Equation , 2011, 2011 Fourth International Symposium on Parallel Architectures, Algorithms and Programming.

[20]  Gary L. Mullen,et al.  Parallel computing of a quasi-Monte Carlo algorithm for valuing derivatives , 2000, Parallel Comput..

[21]  Sunil Kumar,et al.  Optimizations in financial engineering: The Least-Squares Monte Carlo method of Longstaff and Schwartz , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[22]  Francis A. Longstaff,et al.  Valuing American Options by Simulation: A Simple Least-Squares Approach , 2001 .

[23]  Gene Cooperman,et al.  STAR/MPI: binding a parallel library to interactive symbolic algebra systems , 1995, ISSAC '95.

[24]  S. Ross,et al.  Option pricing: A simplified approach☆ , 1979 .

[25]  Myungho Lee,et al.  Parallel implementation of a financial application on a GPU , 2009, ICIS.

[26]  Tom VanCourt,et al.  FPGA acceleration of quasi-Monte Carlo in finance , 2008, 2008 International Conference on Field Programmable Logic and Applications.

[27]  Abdeluaab Lidouh,et al.  Parallel Monte Carlo Method for Pricing Asian Options Using Trapezium Scheme , 2011 .

[28]  Massimiliano Fatica,et al.  Pricing American options with least squares Monte Carlo on GPUs , 2013, WHPCF '13.

[29]  J. Bismut Conjugate convex functions in optimal stochastic control , 1973 .

[30]  Hamid R. Arabnia,et al.  Parallel stereocorrelation on a reconfigurable multi-ring network , 1996, The Journal of Supercomputing.

[31]  Ruppa K. Thulasiram,et al.  Parallel algorithm for pricing American Asian options with multi-dimensional assets , 2005, 19th International Symposium on High Performance Computing Systems and Applications (HPCS'05).

[32]  Yongchao Liu,et al.  Mapping of BLASTP Algorithm onto GPU Clusters , 2011, 2011 IEEE 17th International Conference on Parallel and Distributed Systems.

[33]  Ruppa K. Thulasiram,et al.  Performance evaluation of parallel algorithms for pricing multidimensional financial derivatives , 2002, Proceedings. International Conference on Parallel Processing Workshop.

[34]  F. Black,et al.  The Pricing of Options and Corporate Liabilities , 1973, Journal of Political Economy.

[35]  Stéphane Vialle,et al.  Pricing derivatives on graphics processing units using Monte Carlo simulation , 2014, Concurr. Comput. Pract. Exp..