NTL 9.7.1

84 views

Skip to first unread message

Victor Shoup

unread,

Apr 20, 2016, 5:08:30 PM4/20/16

to sage-devel

I uploaded a new NTL version (9.7.1) to

http://www.shoup.net/ntl

This version completes the implementation of faster matrix

arithmetic (mul, inv, gauss, etc) modulo small primes.

These new implementations are more cache friendly, and they

make more intelligent use of available hardware (e.g., AVX).

They also can be accelerated in a multicore environment.

I've run some tests that show that this implementation is "not too

bad" compared to FFLAS/FFPACK based on OpenBLAS.

I've also updated my NTL vs FLINT comparisons to include

some benchmarks for matrix arithmetic mod small primes.

You can also see those results at

http://www.shoup.net/ntl/benchmarks.pdf

Bill Hart

unread,

May 2, 2016, 2:02:46 PM5/2/16

to sage-devel

Thanks for sharing this Victor. It looks like there are some additional really good speedups there.

Flint has always had just reasonable matrix code, nothing extraordinary.

I put quite a bit of work into the recent minpoly and charpoly code I put into Flint (which NTL was already very good at). But I didn't do a comparison with anything except Sage and Magma yet.

I definitely want to spend some time going through your benchmarks and improving Flint, but as usual, so many other things need doing that it probably won't happen soon. Much of what we have done in Flint is just out-of-date for modern CPU's, especially with AVX. For example, I doubt we should be using strassen. Many of our tuning cutoffs are also way out of date. This is particularly bad as Flint has no tuning code.

Most of the recent work in Flint has been to add lots of small utility functions, to write better documentation and to make Flint more reliable (asserts, better test code, etc.)