Groups
Groups
Sign in
Groups
Groups
asfermi
Conversations
About
Send feedback
Help
asfermi
1–30 of 50
Discussion group for issues related to asfermi
Mark all as read
Report group
0 selected
Serge Clause
7/25/22
Hello, friends! I interested in GPU programming in assembly
Hello, friends! I am interested in GPU programming in assembly language. Some advice on where to
unread,
Hello, friends! I interested in GPU programming in assembly
Hello, friends! I am interested in GPU programming in assembly language. Some advice on where to
7/25/22
Matt Seil
,
Dmitry N. Mikushin
2
4/13/14
Hello all!
Please consider drawing attention of your undergrad students to this work-in-progress tool: http://
unread,
Hello all!
Please consider drawing attention of your undergrad students to this work-in-progress tool: http://
4/13/14
Dmitry N. Mikushin
9/12/13
SCHI encoding
Dear all, asfermi wiki http://code.google.com/p/asfermi/wiki/OpcodeExecution#SCHI states SM30's
unread,
SCHI encoding
Dear all, asfermi wiki http://code.google.com/p/asfermi/wiki/OpcodeExecution#SCHI states SM30's
9/12/13
Sun HuanHuan
, …
Hou Yunqing
8
7/23/13
the "same" cubins behaving total differently.
Hi Jayvant, I can't see exactly what went wrong. You can try the following: 1. Make sure your
unread,
the "same" cubins behaving total differently.
Hi Jayvant, I can't see exactly what went wrong. You can try the following: 1. Make sure your
7/23/13
Hou Yunqing
, …
Dmitry N. Mikushin
52
7/23/13
Fwd: GTX 680 (sm_30)
I've looked through a part of the 32-bit libcublas (the entire library took too long to
unread,
Fwd: GTX 680 (sm_30)
I've looked through a part of the 32-bit libcublas (the entire library took too long to
7/23/13
Hou Yunqing
, …
Everett Fominyen
5
7/18/13
need gtx680
Hi Everett, yes that would be most appreciated. Thanks a lot! Yunqing On Thu, Jul 18, 2013 at 6:51 PM
unread,
need gtx680
Hi Everett, yes that would be most appreciated. Thanks a lot! Yunqing On Thu, Jul 18, 2013 at 6:51 PM
7/18/13
Hou Yunqing
, …
Junjie Lai
7
7/15/13
sponsorship proposal
Hi Yunqing, Yes, I just started my career in NV. As in my paper done during my PhD work, "
unread,
sponsorship proposal
Hi Yunqing, Yes, I just started my career in NV. As in my paper done during my PhD work, "
7/15/13
anup holey
, …
Hou Yunqing
6
7/4/13
Understanding BRK instruction
HI Anup, if your problem goes away with -g flag, it surely is an nvcc bug. You can file a bug report
unread,
Understanding BRK instruction
HI Anup, if your problem goes away with -g flag, it surely is an nvcc bug. You can file a bug report
7/4/13
Peng Di
,
HuanHuan
2
5/17/13
any news about the control opcode in sm_30?
What do you mean by saying cuobjdump shows more instructions by sass??? What cuobjdump compares to?
unread,
any news about the control opcode in sm_30?
What do you mean by saying cuobjdump shows more instructions by sass??? What cuobjdump compares to?
5/17/13
Hou Yunqing
, …
HuanHuan
9
5/16/13
Sponsorship?
YQ, they don't release theirs, and you can do yours. and ask them for him. for example, rCUDA(www
unread,
Sponsorship?
YQ, they don't release theirs, and you can do yours. and ask them for him. for example, rCUDA(www
5/16/13
Dmitry N. Mikushin
,
Sun HuanHuan
2
2/23/13
Interpretation of compute capability bits in CUBIN ELF flags (e_flags)
Hi Dmitry, Thank you for your detailed observation. So I guess that 0x14 is 20 in decimal? and I can
unread,
Interpretation of compute capability bits in CUBIN ELF flags (e_flags)
Hi Dmitry, Thank you for your detailed observation. So I guess that 0x14 is 20 in decimal? and I can
2/23/13
Dmitry N. Mikushin
, …
Hou Yunqing
27
11/30/12
ISA decoding questions
Oops one error in the previous email. The second MOV[.S] reg0, reg1 [,cst]; should be MOV[.S] reg0, c
unread,
ISA decoding questions
Oops one error in the previous email. The second MOV[.S] reg0, reg1 [,cst]; should be MOV[.S] reg0, c
11/30/12
Dmitry N. Mikushin
,
Sun HuanHuan
6
11/21/12
sm_35 uses completely different set of opcodes
I want to do it. But I don't know C++. So I cannot do. I have to watch only. I am feeling sorry.
unread,
sm_35 uses completely different set of opcodes
I want to do it. But I don't know C++. So I cannot do. I have to watch only. I am feeling sorry.
11/21/12
Dmitry N. Mikushin
,
Hou Yunqing
6
11/9/12
Bizarre CUBIN kernels loading success/failure
Hi Yunqing, Strange, but when I'm trying to change the region you suggested (and also another one
unread,
Bizarre CUBIN kernels loading success/failure
Hi Yunqing, Strange, but when I'm trying to change the region you suggested (and also another one
11/9/12
Dmitry N. Mikushin
,
Hou Yunqing
3
11/6/12
[patch] Adding names for section-symbols, cleaning up first symbols layout
Hi Yunqing, Not yet, I'm implementing my own tool to merge two ELF objects. Maybe it would
unread,
[patch] Adding names for section-symbols, cleaning up first symbols layout
Hi Yunqing, Not yet, I'm implementing my own tool to merge two ELF objects. Maybe it would
11/6/12
Jianbin Fang
, …
swp...@gmail.com
8
8/29/12
asfermi vs. ptxas
Thanks! I get it. Well.... Actually, I'm looking for the open source assembler like ptxas that
unread,
asfermi vs. ptxas
Thanks! I get it. Well.... Actually, I'm looking for the open source assembler like ptxas that
8/29/12
Dmitry N. Mikushin
,
Hou Yunqing
2
8/13/12
Using AsFermi for KernelGen on Kepler
Hi D., Do you mean you haven't tried dynamically loading ptxas-scheduled instructions? I guess it
unread,
Using AsFermi for KernelGen on Kepler
Hi D., Do you mean you haven't tried dynamically loading ptxas-scheduled instructions? I guess it
8/13/12
Dmitry N. Mikushin
8/13/12
AsFermi meeting in Singapore
Dear Yunqing & colleagues, In case you are in Singapore, in would be great pleasure to meet you
unread,
AsFermi meeting in Singapore
Dear Yunqing & colleagues, In case you are in Singapore, in would be great pleasure to meet you
8/13/12
Dmitry N. Mikushin
,
Hou Yunqing
5
8/6/12
Please help to debug wrong behavior on Kepler
Hi D., Good to see that! Actually I was thinking of helping when I saw the first email, but my
unread,
Please help to debug wrong behavior on Kepler
Hi D., Good to see that! Actually I was thinking of helping when I saw the first email, but my
8/6/12
Dmitry N. Mikushin
2
8/2/12
Support for switching between .nv.constant2 (for Fermi) and .nv.constant3 (for Kepler)
Pardon, also r756 contains the central change in helperCubin.cpp: http://code.google.com/p/asfermi/
unread,
Support for switching between .nv.constant2 (for Fermi) and .nv.constant3 (for Kepler)
Pardon, also r756 contains the central change in helperCubin.cpp: http://code.google.com/p/asfermi/
8/2/12
Dmitry N. Mikushin
2
7/28/12
GTX 680M: Error 700 while executing a simplest kernel
Thank you! Yes, it works, if args are started with 0x140: !Kernel kernel !Param 8 3 S2R R0,
unread,
GTX 680M: Error 700 while executing a simplest kernel
Thank you! Yes, it works, if args are started with 0x140: !Kernel kernel !Param 8 3 S2R R0,
7/28/12
Jianbin Fang
,
Hou Yunqing
2
5/15/12
typo error in test demo code
Hi Jianbin, Yes you probably misunderstood that test. And it's not meant to be understood anyway.
unread,
typo error in test demo code
Hi Jianbin, Yes you probably misunderstood that test. And it's not meant to be understood anyway.
5/15/12
Jianbin Fang
5/15/12
data reading latency
Hi guys, Currently, I would like to know how many cycles it takes to read a byte from global memory/
unread,
data reading latency
Hi guys, Currently, I would like to know how many cycles it takes to read a byte from global memory/
5/15/12
Jianbin Fang
,
Hou Yunqing
2
5/14/12
instruction meanings
Hi Jianbin, Here's for the special registers: http://code.google.com/p/asfermi/wiki/
unread,
instruction meanings
Hi Jianbin, Here's for the special registers: http://code.google.com/p/asfermi/wiki/
5/14/12
Jianbin Fang
, …
Dmitry N. Mikushin
4
4/27/12
unable to execute cubin file
Yes, it is because of the mismatch (versions of Driver was lower than that of SDK, and it should be
unread,
unable to execute cubin file
Yes, it is because of the mismatch (versions of Driver was lower than that of SDK, and it should be
4/27/12
Hou Yunqing
, …
Dmitry N. Mikushin
6
4/25/12
Re: about cache strategies
Hmm.. I just thought it's some quick work so I did it.. Not sure if recursion needs non-zero
unread,
Re: about cache strategies
Hmm.. I just thought it's some quick work so I did it.. Not sure if recursion needs non-zero
4/25/12
Jianbin Fang
,
Hou Yunqing
2
4/25/12
about directives in asfermi
Directives have nothing to do with the ISA itself. They are things that tell asfermi what to do. I
unread,
about directives in asfermi
Directives have nothing to do with the ISA itself. They are things that tell asfermi what to do. I
4/25/12
Sun HuanHuan
4
3/31/12
weird problem
please check this!! And thanks to hyq. :) https://groups.google.com/forum/?fromgroups#!topic/asfermi/
unread,
weird problem
please check this!! And thanks to hyq. :) https://groups.google.com/forum/?fromgroups#!topic/asfermi/
3/31/12
Sun HuanHuan
3/28/12
Powerful new instruction shfl
Hi! Kepler or ptx 3.0 introduced new instruction shfl which is VERY powerful. Huan
unread,
Powerful new instruction shfl
Hi! Kepler or ptx 3.0 introduced new instruction shfl which is VERY powerful. Huan
3/28/12
Sun HuanHuan
,
Hou Yunqing
11
3/27/12
Good news!
Corrections. As many instructions have a throught put of 32*n + 8 form. So i guess integer mul/mad is
unread,
Good news!
Corrections. As many instructions have a throught put of 32*n + 8 form. So i guess integer mul/mad is
3/27/12