negative zeros and dot_product / matmul

Thomas Koenig

unread,

May 16, 2010, 3:07:10 PM5/16/10

to

Consider the following program:

program main
implicit none
real, dimension(1) :: a, b
real, dimension(1,1) :: aa, bb
a(1) = -1.0
b(1) = 0.0
print *,a(1)*b(1)
print *,dot_product(a,b)
aa(1,1) = -1.0
bb(1,1) = 0.0
print *,matmul(aa,bb)
end program main

Assuming that the processor supports signed zeros, is the following
result OK?

-0.0000000
0.0000000
0.0000000

robin

unread,

May 16, 2010, 11:01:51 PM5/16/10

to

"Thomas Koenig" <tko...@netcologne.de> wrote in message news:hspfou$ofo$1...@newsreader5.netcologne.de...

Have you tried multiplying the result of a(1)*b(1) by, say, 1000,
or of printing 15 digits for a(1)*b(1)?

FX

unread,

May 17, 2010, 2:24:36 AM5/17/10

to

> Have you tried multiplying the result of a(1)*b(1) by, say, 1000, or of
> printing 15 digits for a(1)*b(1)?

You are misled. The result really is a zero with a minus sign, and not a
very small negative number.

--
FX

robin

unread,

May 17, 2010, 6:06:00 AM5/17/10

to

"FX" <cou...@alussinan.org> wrote in message news:hsqnf4$3v9$1...@nef.ens.fr...

|> Have you tried multiplying the result of a(1)*b(1) by, say, 1000, or of
| > printing 15 digits for a(1)*b(1)?
|
| You are misled.

No. It has not yet been determined that there is no compiler bug.

| The result really is a zero with a minus sign, and not a
| very small negative number.

You don't know that until you try what I suggested above.

m_b_metcalf

unread,

May 17, 2010, 6:20:39 AM5/17/10

to

The standard states; "Processors that distinguish between positive and
negative zeros shall treat them as equivalent ... as actual arguments
to intrinsic functions other than SIGN ..."

Thus, the result seems correct (and I haven't a clue what Robin is
wittering on about).

Regards,

Mike Metcalf

m_b_metcalf

unread,

May 17, 2010, 6:29:12 AM5/17/10

to

On May 17, 12:20 pm, m_b_metcalf <michaelmetc...@compuserve.com>
wrote:

> Mike Metcalf- Hide quoted text -
>
> - Show quoted text -

Sorry, I take back my comment (misread the -1.0 as -0.0). But I still
don't understand Robin's remarks!

Tobias Burnus

unread,

May 17, 2010, 6:29:52 AM5/17/10

to

On 05/16/2010 09:07 PM, Thomas Koenig wrote:
> Consider the following program:

> a(1) = -1.0
> b(1) = 0.0

> print *,a(1)*b(1)
> print *,dot_product(a,b)

> aa(1,1) = -1.0
> bb(1,1) = 0.0
> print *,matmul(aa,bb)

> Assuming that the processor supports signed zeros, is the following
> result OK?

Well, in terms of numerics, small deviations from the exact result can
be expected (cf. [for instance] the Goldberg paper); in this case the
deviation "(-0.0)-(0.0)" is zero and thus one can be happy.

> -0.0000000
> 0.0000000
> 0.0000000

I think it also shows how the calculation is done; depending how the
calculation is done, the sign is lost (e.g. "0 + (-0)" - should this be
+0 or -0?). If intrinsic is inlined and not handled via a call to the
run-time library, the chance is higher that the sign comes out as
expected - as then the compiler can fold the implicit loops of
dot_product and matmul into a simple multiplication and assignment.

Using gfortran and g95, I get the -/+/+ result, with NAG f95 -/-/+ and
with ifort (and the option "-assume minus0"!) I get -/-/-.* I think all
variants in the output are standard conform and - for real-world code -
one does something wrong if one worries about such deviations.

Tobias

* Several compilers have an option for handling negative zeros
(gfortran: -fsign-zero); the option is needed not only for I/O but also
and more importantly for the SIGN intrinsic as some Fortran 77 programs
rely on a non-signed 0.

FX

unread,

May 17, 2010, 6:35:03 AM5/17/10

to

> | The result really is a zero with a minus sign, and not a
> | very small negative number.
>
> You don't know that until you try what I suggested above.

Well, the possibility of a compiler bug is not the question Thomas asked.
He asked if a compiler giving this output would be considered
standard-conforming.

And also, I can argue that even after trying what you suggest, you still
have no further information, because you could be experiencing a bug in
the I/O library :)

--
FX

robin

unread,

May 17, 2010, 7:35:31 AM5/17/10

to

"FX" <cou...@alussinan.org> wrote in message news:hsr64n$107t$1...@nef.ens.fr...

|> | The result really is a zero with a minus sign, and not a
| > | very small negative number.
| >
| > You don't know that until you try what I suggested above.
|
| Well, the possibility of a compiler bug is not the question Thomas asked.

He doesn't have to. It's a possibility.

| He asked if a compiler giving this output would be considered
| standard-conforming.

I know what he asked.

glen herrmannsfeldt

unread,

May 17, 2010, 8:12:57 AM5/17/10

to

m_b_metcalf <michael...@compuserve.com> wrote:
> On May 16, 9:07�pm, Thomas Koenig <tkoe...@netcologne.de> wrote:
>> Consider the following program:

>> program main
>> � implicit none
>> � real, dimension(1) :: a, b
>> � real, dimension(1,1) :: aa, bb
>> � a(1) = -1.0
>> � b(1) = 0.0
>> � print *,a(1)*b(1)
>> � print *,dot_product(a,b)
>> � aa(1,1) = -1.0
>> � bb(1,1) = 0.0
>> � print *,matmul(aa,bb)
>> end program main

>> Assuming that the processor supports signed zeros, is the following
>> result OK?

>> � -0.0000000
>> � �0.0000000
>> � �0.0000000

> The standard states; "Processors that distinguish between positive and
> negative zeros shall treat them as equivalent ... as actual arguments
> to intrinsic functions other than SIGN ..."

I read that one, and didn't even consider how it might apply here.

It seems to me that the standard at various points is trying
to guarantee that (-0.0).EQ.(+0.0), which is true, as far as
I know, on all processors that allow for a signed zero.
That is what I think of when they say "equivalent."
To me, "equivalent" does not mean bitwise identical.

I am not sure which processors generate (-0.0) as the product.
Most likely not all that generate one from -(0.0). (That is,
the unary negation operator.)

It seems to me that the natural way to write either dot_product
or matrix_multiply is to initialize a variable to zero and then
sum into that variable, which in this case gives (0.0)+(-0.0)
as the result.

The question left, then, as I see it, is what the I/O library
should print given a negative zero.

> Thus, the result seems correct (and I haven't a clue what Robin is
> wittering on about).

He often uses the "answer first, read question later" approach,
which seems to apply in this case. I suppose some machines might
give a very small, non-zero, result in this case. That would
seem to be very surprising, but maybe not impossible. The product,
in some sense, has zero significant digits.

This reminds me of a feature of the IBM Stretch that allowed one
to specify at run time whether to shift in zero or one when doing
floating point post-normalization. The idea was that one could
run a program both ways and compare the result. In some, but
not all, cases that would show up errors due to loss of
significance. It might even be possible that (-1.0) times (0.0)
would return (plus or minus) epsilon(1.0) on Stretch, though
that would surprise me.

-- glen

Ron Shepard

unread,

May 17, 2010, 11:21:06 AM5/17/10

to

In article <hsrbs9$6us$1...@speranza.aioe.org>,
glen herrmannsfeldt <g...@ugcs.caltech.edu> wrote:

> It seems to me that the natural way to write either dot_product
> or matrix_multiply is to initialize a variable to zero and then
> sum into that variable, which in this case gives (0.0)+(-0.0)
> as the result.

If my memory is correct, matmul() is not defined in terms of
dot_product() in the standard, it is defined with sum(). I don't
remember how dot_product is defined, but it is possible that a
compiler might evaluate sum() and matmul() differently than
dot_product() in this respect. For example, one or the other might
treat the size()=1 case differently. Or one or the other might be
evaluated as

sum=a(1)*b(1)
do i = 2, n
sum = sum + a(i)*b*i)
enddo

which could explain some of the differences observed in some of the
compilers. The above loop would preserve the negative zero, whereas
running the loop from 1 would destroy the negative zero.

> The question left, then, as I see it, is what the I/O library
> should print given a negative zero.

Yes, particularly since list-directed I/O was used. To see the
bits, you might need to use a Z format, which although common is
nonstandard for real variables.

> [...] The product,

> in some sense, has zero significant digits.

I'm not sure I agree with this. I would say that the product itself
has full precision (if the result is either positive or negative
zero). I guess the sum (0.0)+(-0.0) might be said to have zero
significant digits, but this does not seem to be a useful convention
either. For example, if you were to track the significant digits
through several expressions to determine the accuracy of the final
result, then the convention that (0.0)+(-0.0), and all subsequent
operations have no significant digits isn't the correct answer in a
practical sense.

> This reminds me of a feature of the IBM Stretch that allowed one
> to specify at run time whether to shift in zero or one when doing
> floating point post-normalization. The idea was that one could
> run a program both ways and compare the result. In some, but
> not all, cases that would show up errors due to loss of
> significance. It might even be possible that (-1.0) times (0.0)
> would return (plus or minus) epsilon(1.0) on Stretch, though
> that would surprise me.

The CRAY computers also allowed the least significant bit to be
wrong for floating point multiply (and the result depended on the
order the two operands were loaded into registers), although I think
multiplication by zero was always correct.

$.02 -Ron Shepard

steve

unread,

May 17, 2010, 1:03:21 PM5/17/10

to

On May 17, 3:29 am, Tobias Burnus <bur...@net-b.de> wrote:
> On 05/16/2010 09:07 PM, Thomas Koenig wrote:
>
> > Consider the following program:
> > a(1) = -1.0
> > b(1) = 0.0
> > print *,a(1)*b(1)
> > print *,dot_product(a,b)
> > aa(1,1) = -1.0
> > bb(1,1) = 0.0
> > print *,matmul(aa,bb)
> > Assuming that the processor supports signed zeros, is the following
> > result OK?
>
> Well, in terms of numerics, small deviations from the exact result can
> be expected (cf. [for instance] the Goldberg paper); in this case the
> deviation "(-0.0)-(0.0)" is zero and thus one can be happy.
>
> > -0.0000000
> > 0.0000000
> > 0.0000000
>
> I think it also shows how the calculation is done; depending how the
> calculation is done, the sign is lost (e.g. "0 + (-0)" - should this be
> +0 or -0?).

If one follows IEEE 754, then this is well-defined:

When the sum of two operands with opposite signs (or the
difference of two operands with like signs) is exactly zero,
the sign of that sum (or difference) shall be + in all rounding
direction modes except roundTowardNegative; in that mode,
the sign of an exact zero sum (or difference) shall be -.
However,
x+x = x-(-x) retains the same sign as x even when x is zero.

--
steve

glen herrmannsfeldt

unread,

May 17, 2010, 2:11:07 PM5/17/10

to

Ron Shepard <ron-s...@nospam.comcast.net> wrote:
> In article <hsrbs9$6us$1...@speranza.aioe.org>,
> glen herrmannsfeldt <g...@ugcs.caltech.edu> wrote:

>> It seems to me that the natural way to write either dot_product
>> or matrix_multiply is to initialize a variable to zero and then
>> sum into that variable, which in this case gives (0.0)+(-0.0)
>> as the result.

> If my memory is correct, matmul() is not defined in terms of
> dot_product() in the standard, it is defined with sum(). I don't
> remember how dot_product is defined, but it is possible that a
> compiler might evaluate sum() and matmul() differently than
> dot_product() in this respect. For example, one or the other might
> treat the size()=1 case differently. Or one or the other might be
> evaluated as

> sum=a(1)*b(1)
> do i = 2, n
> sum = sum + a(i)*b*i)
> enddo

Yes, I might call that the unnatural way to write it.
Also, if done inline with dimensions known at compile time,
it could easily be sum=a(1)*b(1).

> which could explain some of the differences observed in some of the
> compilers. The above loop would preserve the negative zero, whereas
> running the loop from 1 would destroy the negative zero.

>> The question left, then, as I see it, is what the I/O library
>> should print given a negative zero.

> Yes, particularly since list-directed I/O was used. To see the
> bits, you might need to use a Z format, which although common is
> nonstandard for real variables.

Well, there is IEEE_IS_NEGATIVE() to test for negative values,
including negative zero.

>> [...] The product,
>> in some sense, has zero significant digits.

> I'm not sure I agree with this. I would say that the product itself
> has full precision (if the result is either positive or negative
> zero).

I should ask in a physics newsgroup. It seems that zero can have
either zero or infinite significant digits.

> I guess the sum (0.0)+(-0.0) might be said to have zero
> significant digits, but this does not seem to be a useful convention
> either. For example, if you were to track the significant digits
> through several expressions to determine the accuracy of the final
> result, then the convention that (0.0)+(-0.0), and all subsequent
> operations have no significant digits isn't the correct answer in a
> practical sense.

Well, consider x=123.456-123.456

now how many digits does x have?

>> This reminds me of a feature of the IBM Stretch that allowed one
>> to specify at run time whether to shift in zero or one when doing
>> floating point post-normalization. The idea was that one could
>> run a program both ways and compare the result. In some, but
>> not all, cases that would show up errors due to loss of
>> significance. It might even be possible that (-1.0) times (0.0)
>> would return (plus or minus) epsilon(1.0) on Stretch, though
>> that would surprise me.

> The CRAY computers also allowed the least significant bit to be
> wrong for floating point multiply (and the result depended on the
> order the two operands were loaded into registers), although I think
> multiplication by zero was always correct.

CRAY was good at fast and close enough, where others went for
slow and always rights. In physics, most quantities have a
relative error, that is, an error that scales with the size of
the quantity. Floating point is well designed for such quantities,
and CRAY was designing for physicists.

I didn't look to see how Stretch did multiply, but after a floating
point multiply with normalized operands there is a potential one
digit post normalization. If, following the "shift in ones"
option, Stretch multply did that you would get something close
to epsilon(1.) from 1.0*0.0, but also consider the case
1.0*(1.0-1.0).

-- glen

robin

unread,

May 17, 2010, 6:50:55 PM5/17/10

to

"glen herrmannsfeldt" <g...@ugcs.caltech.edu> wrote in message news:hsrbs9$6us$1...@speranza.aioe.org...

| He often uses the "answer first, read question later" approach,
| which seems to apply in this case. I suppose some machines might
| give a very small, non-zero, result in this case. That would
| seem to be very surprising, but maybe not impossible. The product,
| in some sense, has zero significant digits.

The product has about k significant digits,
where k is the precision of default REAL.

Another of your "answer first, read question later" approach.