Full-Speed Deterministic Bit-Accurate Parallel Floating-Point Summation on Multi- and Many-Core Architectures
暂无分享,去创建一个
David Defour | Sylvain Collange | Stef Graillat | Roman Iakymchuk | S. Graillat | D. Defour | R. Iakymchuk | Caroline Collange
[1] Nicholas J. Higham,et al. INVERSE PROBLEMS NEWSLETTER , 1991 .
[2] Donald E. Knuth,et al. The art of computer programming. Vol.2: Seminumerical algorithms , 1981 .
[3] Vincent Lefèvre,et al. MPFR: A multiple-precision binary floating-point library with correct rounding , 2007, TOMS.
[4] Ulrich W. Kulisch,et al. Comments on Fast and Exact Accumulation of Products , 2010, PARA.
[5] Alex Fit-Florea,et al. Precision and Performance: Floating Point and IEEE 754 Compliance for NVIDIA GPUs , 2011 .
[6] Guillaume Melquiond,et al. Emulation of a FMA and Correctly Rounded Sums: Proved Algorithms Using Rounding to Odd , 2008, IEEE Transactions on Computers.
[7] James Reinders,et al. Intel® threading building blocks , 2008 .
[8] David Defour,et al. SOFTWARE CARRY-SAVE FOR FAST MULTIPLE-PRECISION ALGORITHMS , 2002 .
[9] Siegfried M. Rump,et al. Ultimately Fast Accurate Summation , 2009, SIAM J. Sci. Comput..
[10] Jonathan M. Borwein,et al. High-precision computation: Mathematical physics and dynamics , 2010, Appl. Math. Comput..
[11] Ulrich W. Kulisch,et al. The exact dot product as basic tool for long interval arithmetic , 2011, Computing.
[12] James Demmel,et al. Design, implementation and testing of extended and mixed precision BLAS , 2000, TOMS.