I figured out a part of the problem.
I re-compiled the GPUOcelot without llvm. Examples like Clock & BandwidthTest work fine now (no more "segmentation fault").
But this does not solve all the "segmentation fault" problems. Some examples still have this kind of fault.
Furthermore, how could I solve the race problem when I run ScalarProd.
The error is: "
terminate called after throwing an instance of 'hydrazine::Exception'
what(): [PC 37] [thread 1] [cta 0] st.shared.f32 [%r27], %r18 - Shared memory race condition, address 0x4 was previously read by thread 0 without a memory barrier in between.
Near tests/cuda4.1sdk/tests/scalarProd/scalarProd_kernel.cu:65:1
"
Best regards,
Alex
On Thursday, April 18, 2013 8:10:00 PM UTC+8, Alexander Donn wrote:
Hi,When I add "x86Trace: true" in configure.ocelot,
I got a "segmentation fault" at the end of running some examples (Clock & BandwidthTest) in cuda4.1sdk folder.
I got X86TraceGenerator.cpp:1829 ... X86TraceGenerator ... Assertion `0' failed when running SimplePrintf.
However, applications like VectorAdd & DeviceQuery work fine.
When I remove "x86Trace: true" in configure.ocelot, everything works fine.
Can anyone help me?
Thanks and Best Regards,
Alex