Hello All
I am running a simple matrix multiply program (see attached file).
I am getting inconsistent results when running it with clang and polly with and without parallelization.
With single core:
Command: ./llvm-project/build/bin/clang -O3 -mllvm -polly matmul.c
Result: sum = 433588338688.000000
With parallelization:
Command: ../llvm-project/build/bin/clang -O3 -mllvm -polly -mllvm -polly-parallel -lgomp -mllvm -polly-num-threads=4 matmul.c
Result: sum = 315474345984.000000