Dmitry N. Mikushin
unread,Nov 20, 2012, 9:16:48 PM11/20/12Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to asf...@googlegroups.com
Dear colleagues,
Here comes another challenge in our favourite sport: looks like in
sm_35 (Kepler K20) NVIDIA completely changed the opcodes:
sm_30:
- /*0010*/ /*0x9400dc042c000000*/ S2R R3, SR_CTAid_X;
- /*0018*/ /*0x84001c042c000000*/ S2R R0, SR_Tid_X;
- /*0020*/ /*0x1030dc036000c000*/ SHL R3, R3, 0x4;
- /*0028*/ /*0x00311c0348000000*/ IADD R4, R3, R0;
- /*0030*/ /*0x3c41dc231a0ec000*/ ISETP.GT.AND P0, pt,
R4, 0xf, pt;
- /*0038*/ /*0x000001e780000000*/ @P0 EXIT;
- /*0048*/ /*0x00011de428004005*/ MOV R4, c [0x0] [0x140];
- /*0050*/ /*0xa0411c034801c000*/ IADD R4.CC, R4, 0x28;
sm_35:
+ /*0008*/ /*0x089c000664c03c00*/ MOV R1, c [0x0] [0x44];
+ /*0010*/ /*0x129c000e86400000*/ S2R R3, SR37;
+ /*0018*/ /*0x109c000286400000*/ S2R R0, SR33;
+ /*0020*/ /*0x021ffc0db7c00c00*/ SHF.L R3, RZ, 0x4, R3;
+ /*0028*/ /*0x001c0c12e0800000*/ IADD R4, R3, R0;
+ /*0030*/ /*0x079c101db3481c00*/ ISETP.GT.AND P0, PT,
R4, 0xf, PT;
+ /*0038*/ /*0x0000003c18000000*/ @P0 EXIT;
+ /*0048*/ /*0x281c001264c03c00*/ MOV R4, c [0x0] [0x140];
+ /*0050*/ /*0x141c1011c0840000*/ IADD R4.CC, R4, 0x28;
So, unless it is not a bug in cuobjdump, there is a need to re-run some tools...
- D.