Shouldn't intrinsic SUM avoid overflow?

evan

unread,

Sep 2, 2016, 1:29:11 PM9/2/16

to

Some intrinsic routines, e.g. NORM2 tries to avoid overflow but not SUM. Shouldn't it? Consider the program below

double precision, parameter :: x = huge(1d0)
print*, sum((/x,-x/))
print*, sum((/x,x,-x,-x/))
print*, sum((/x,-x,1d0/))
print*, sum((/1d0,x,-x/))
end

With gfortran 4.6.3 I get:

print*, sum((/x,x,-x,-x/))
1
Error: Arithmetic overflow at (1)
f951: internal compiler error: Segmentation fault
Please submit a full bug report,
with preprocessed source if appropriate.
See <file:///usr/share/doc/gcc-4.6/README.Bugs> for instructions.

With ifort 14.0.0 20130728 the program compiles but gives the wrong answer in some cases:

./a.out
0.000000000000000E+000
Infinity
1.00000000000000
0.000000000000000E+000

Richard Maine

unread,

Sep 2, 2016, 2:05:13 PM9/2/16

to

evan <evan...@gmail.com> wrote:

> Some intrinsic routines, e.g. NORM2 tries to avoid overflow but not SUM.
> Shouldn't it?

What's the cost/benefit tradeoff? For SUM, it seems like a pretty rare
corner case to have something that would overflow if done naively, but
could be massaged to give a meaningful result. I'd have thought that
avoiding loss of precision in SUM would be a more important issue in
practice if one were to go after more sophisticated implementations.

NORM2, on the other hand, can much more easily overflow.

I assume, by the way, that your "shoudn't" refers to quality of
implementation rather than to requirements of the standard.
If you are actually asking about whether the standard requires such
things, that answer is easy. No, it doesn't.

--
Richard Maine
email: last name at domain . net
dimnain: summer-triangle

kargl

unread,

Sep 2, 2016, 2:51:13 PM9/2/16

to

evan wrote:

> Some intrinsic routines, e.g. NORM2 tries to avoid overflow but not SUM. Shouldn't it?

What algorithm do you propose a compiler should use? How slow and
memory hungry are you willing to endure?

--
steve

Louis Krupp

unread,

Sep 2, 2016, 4:59:00 PM9/2/16

to

On Fri, 2 Sep 2016 10:29:08 -0700 (PDT), evan <evan...@gmail.com>
wrote:

>Some intrinsic routines, e.g. NORM2 tries to avoid overflow but not SUM. Shouldn't it? Consider the program below
>
>double precision, parameter :: x = huge(1d0)
>print*, sum((/x,-x/))
>print*, sum((/x,x,-x,-x/))
>print*, sum((/x,-x,1d0/))
>print*, sum((/1d0,x,-x/))
>end
>
>
>With gfortran 4.6.3 I get:
>
>print*, sum((/x,x,-x,-x/))
> 1
>Error: Arithmetic overflow at (1)
>f951: internal compiler error: Segmentation fault
>Please submit a full bug report,
>with preprocessed source if appropriate.
>See <file:///usr/share/doc/gcc-4.6/README.Bugs> for instructions.

The same internal compiler error happens with gfortran version 7.0.0
20160902 (experimental).

See bug 77641.

Louis

Louis Krupp

unread,

Sep 2, 2016, 6:05:54 PM9/2/16

to

Which was a duplicate of 77640, so see bug 77640 instead.

Louis

robert....@oracle.com

unread,

Sep 6, 2016, 10:58:06 PM9/6/16

to

All of the sums printed seem reasonable to me. Remember that floating-point numbers are not the real numbers of mathematics.

Robert Corbett

herrman...@gmail.com

unread,

Sep 8, 2016, 3:47:02 PM9/8/16

to

On Friday, September 2, 2016 at 10:29:11 AM UTC-7, evan wrote:

> Some intrinsic routines, e.g. NORM2 tries to avoid overflow but not SUM.
> Shouldn't it? Consider the program below

For NORM2, there is a convenient way to avoid the overflow of the intermediates from doing the square, and also it is fairly easy to overflow when squaring.

As Richard notes for SUM, I suspect that the standard doesn't require that, but it is an implementation consideration. But you say "tries". Presumably it can still overflow.

Somewhat in general, if you want something more than the usual way of doing
something, you should do it yourself. It might be nice if SUM did something
like the Kahan summation:

https://en.wikipedia.org/wiki/Kahan_summation_algorithm

especially since it isn't always so easy to do yourself.
(Some optimizations can mess up the algorithm.)

I don't think that helps your case, though.

robert....@oracle.com

unread,

Sep 9, 2016, 1:48:16 AM9/9/16

to

On Thursday, September 8, 2016 at 12:47:02 PM UTC-7, herrman...@gmail.com wrote:
> On Friday, September 2, 2016 at 10:29:11 AM UTC-7, evan wrote:
> > Some intrinsic routines, e.g. NORM2 tries to avoid overflow but not SUM.
> > Shouldn't it? Consider the program below
>
> For NORM2, there is a convenient way to avoid the overflow of the intermediates from doing the square, and also it is fairly easy to overflow when squaring.
>
> As Richard notes for SUM, I suspect that the standard doesn't require that, but it is an implementation consideration. But you say "tries". Presumably it can still overflow.
>
> Somewhat in general, if you want something more than the usual way of doing
> something, you should do it yourself. It might be nice if SUM did something
> like the Kahan summation:

I tried something like that in release 2.0 of Sun Fortran 90. It proved not to be viable. Sun's support engineers reported that we were being killed on customer benchmarks, which led to a patch that put an end to that feature.

Robert Corbett

campbel...@gmail.com

unread,

Sep 10, 2016, 12:53:55 AM9/10/16

to

I am interested to see how Kahan's algorithm might conflict with a "smart"
compiler so this is my testing:

Chasing round-off errors with SUM can lead to a number of options with limited
practical gains.
* Kahan is one of them.
* Using a higher precision accumulator is another approach (probably simpler).

I have written a simple test of Kahan's algorithm using real*4 precision
(listed below) and compared this to a real*8 accumulator.

I have used gFortran 6.1.0 to see what smart compiler conflicts there may be
and also tested a number of different compiler options:
set options=-O1 -mavx
set options=-O2 -mavx
set options=-O3 -mavx
set options=-O3 -mavx -ffast-math
set options=-O3 -mavx -ffast-math -funroll-loops --param max-unroll-times=2

My .bat test loop is:
del %1.exe
del %1.o
gfortran %1.f90 %options% -o %1.exe
set options >> %1.log
%1 >> %1.log

My test tried 4 approaches at using SUM of a list of real*4 values.
si = SUM intrinsic
s4 = real*4 DO loop
k4 = Kahan's algorithm for real*4
s8 = real*8 accumulator

The results I found were
-ffast-math messed with Kahan's algorithm, but
changing optimisation with -Ox doesn't (no smarts identified)

From this test case, using a real*8 accumulator appears to be the most robust
approach.

In the past when using 8086/8087 there was the uncertainty of what values were
retained in 80-bit registers.
These latest tests with -mavx appear to show no mixing of 4-byte and 8-byte
registers.

When considering other cases, the results are very sensitive to the type of
accumulated rounding error that is occurring. Claims of error analysis in the
Wiki article look wrong to me as I find that cases where you may want to
improve accuracy are where the error accumulation is unusual.
Another problem becomes as to what you should make of the result, given the
accuracy of the original data.

The test case I used is summing a set of +ve random numbers in the range 0,1.
Overflow swamps the result at about 10^7 values.
Others may be interested in using the following test or adapting to a different
set of values with different round-off characteristics.

The test code I used is:
! program to test KahanSum
!
real*4 function KahanSum (values, n)
! https://en.wikipedia.org/wiki/Kahan_summation_algorithm
integer*4 n
real*4 values(n)
!
integer*4 i
real*4 sum, c, y, t
!
c = 0
sum = 0
do i = 1,n
y = values(i) - c ! So far, so good : c is zero
t = sum + y ! Alas, sum is big, Y small, so low-order digits of y are lost
c = (t - sum) - y ! (t-sum) cancels the high-order part of y; subtracting y recovers negative (low part of y)
sum = t ! Algebraically, c should always be zero. Beware overly-aggressive optimizing compilers !
end do ! Next time around, the low part will be added to y in a fresh attempt.
!
KahanSum = sum
end function KahanSum

real*4 function Sum8 (values, n)
integer*4 n
real*4 values(n)
!
integer*4 i
real*8 sum, y
!
sum = 0
do i = 1,n
y = values(i)
sum = sum + y
end do
!
Sum8 = sum
end function Sum8

real*4 function Sum4 (values, n)
integer*4 n
real*4 values(n)
!
integer*4 i
real*4 sum, y
!
sum = 0
do i = 1,n
y = values(i)
sum = sum + y
end do
!
Sum4 = sum
end function Sum4

Program Kahan_test
!
integer*4 n, k
real*4, allocatable :: values(:)
real*4 sum4, sum8, KahanSum, s8, k4, s4, si
!
call report_options
!
do k = 2,30
n = 2**k
allocate ( values(n) )
call random_number ( values )
s8 = sum8 ( values, n )
k4 = KahanSum ( values, n )
s4 = sum4 ( values, n )
si = sum ( values )
write (*,fmt='(i10,7es11.3)') n, si, s4, k4, s8, abs(si-s8) , abs(s4-s8), abs(k4-s8)
deallocate (values)
end do
!
end

subroutine report_options
use ISO_FORTRAN_ENV
write (*,*) 'Compiler: ',compiler_version ()
write (*,*) 'Options : ',compiler_options ()
end subroutine report_options