Regarding shared memory on kernels

4 views
Skip to first unread message

Alberto de Sá Cavalcanti de Albuquerque

unread,
Dec 19, 2011, 3:56:51 PM12/19/11
to gpuo...@googlegroups.com, Fernando Magno Quintao Pereira
Dear guys,

I am trying to use Ocelot to determine the amount of shared memory that each kernel declares statically. For instance, the NVIDIA SDK's transpose benchmark starts like:

entry _Z9transposePfS_ii(...) {
  .shared .align 4 .b8
__cuda___cuda_local_var_84432_32_non_const_block16[1088];"
  ...
}

Is there a way to use Ocelot to know that the amount of memory allocated to the static array is 1088 bytes?

Sincerely yours,
Alberto de Sá Cavalcanti de Albuquerque

Gregory...@gatech.edu

unread,
Dec 20, 2011, 2:34:18 AM12/20/11
to gpuo...@googlegroups.com, Fernando Magno Quintao Pereira
Dear Alberto,

Ocelot tracks this information in the sharedMemorySize() method of the ExecutableKernel class.  So any trace generator should be able to access it at runtime.

Regards,

Gregory Diamos


-----Original message-----
--
You received this message because you are subscribed to the Google Groups "gpuocelot" group.
To post to this group, send email to gpuo...@googlegroups.com.
To unsubscribe from this group, send email to gpuocelot+...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/gpuocelot?hl=en.
Reply all
Reply to author
Forward
0 new messages