Abstract
Current general purpose libraries for multiple precision floating point
arithmetic such as Mpfr suffer from a large
performance penalty with respect to hard-wired instructions. The
performance gap tends to become even larger with the advent of wider
SIMD arithmetic in both CPUs and GPUs. In this paper, we present
efficient algorithms for multiple precision floating point arithmetic
that are suitable for implementations on SIMD processors.
Keywords: floating point arithmetic, multiple
precision, SIMD, matrix multiplication
View: Html, TeXmacs, Pdf,
BibTeX