background info on algorithm used for fmpz_set

Niek Bouman

unread,

Nov 15, 2023, 6:09:09 AM11/15/23

to flint-devel

Dear flint-devs,

In a discussion about base-10 integer-literal parsing in LLVM, I concluded from a benchmark that flint's implementation (fmpz_set_str) is far superior over LLVM's APInt implementation. See also:

https://github.com/carbon-language/carbon-lang/issues/980

Due to licensing issues with LGPL, LLVM folks cannot use flint and have to reimplement anything themselves.

Could anyone perhaps provide information about which strategy/algorithm is used in fmpz_set_str ? (Of course the source code is public, but having a pointer to the algorithm would make it much easier)

best,
Niek Bouman

Fredrik Johansson

unread,

Nov 15, 2023, 6:47:58 AM11/15/23

to flint...@googlegroups.com

Hi Niek,

fmpz_set_str just calls GMP for input smaller than about 10000 digits. Unless you're parsing million-digit integers and you compiled FLINT with --enable-avx2 so that the new FFT is used, you're just looking at the performance of GMP's mpn_set_str.

The basecase algorithm used in GMP for numbers up to a few limbs is AFAIK just a nested Horner's rule: you have an inner base-10 Horner loop that accumulates 19 digits in a single word and an outer base-10^19 Horner loop that updates your multi-limb result using mpn_mul_1 and mpn_add_1. For larger numbers GMP uses the standard divide and conquer algorithm, which benefits from reusing a single table of powers 10^(2^n) throughout the tree.

If you implement the same method using GMP's low-level interface you should be able to get close to GMP's speed. If you're comparing GMP against something not based on GMP, probably the difference is that GMP's basic arithmetic is much faster. Other than that, memory management could be an issue; GMP does careful temporary allocations on the stack and tries hard to reuse scratch space within an algorithm; you definitely don't want any mallocs anywhere near the basecase code.

Best,
Fredrik

--

---
You received this message because you are subscribed to the Google Groups "flint-devel" group.
To unsubscribe from this group and stop receiving emails from it, send an email to flint-devel...@googlegroups.com.
To view this discussion on the web, visit https://groups.google.com/d/msgid/flint-devel/c2d29cee-6976-4a3c-8cca-0c7c134a8e01n%40googlegroups.com.

Niek Bouman

unread,

Nov 15, 2023, 7:04:26 AM11/15/23

to flint-devel

hi Fredrik,

thanks for you prompt response..!!
Niek

Reply all

Reply to author

Forward

background info on algorithm used for fmpz_set_str

Niek Bouman

Fredrik Johansson

Niek Bouman