Does TFRT support multiple CUDA streams?

277 views
Skip to first unread message

Haibin

unread,
Sep 11, 2020, 8:53:17 PM9/11/20
to TensorFlow Runtime
If I understand it correctly, native TF only use 1 CUDA stream for computation per device. Is/will TFRT use multiple CUDA streams? 

Thanks,
Haibin 

Idan Mintz

unread,
Sep 15, 2020, 12:23:31 AM9/15/20
to TensorFlow Runtime
Hi Haibin,

TFRT uses a single stream when executing in eager (op by op) mode. In graph execution mode, TFRT exposes kernels that enable multi-stream computation but it is up to the model creator and/or their compiler to do multi-stream assignment. The TF MLIR compiler will be capable of doing stream assignment.

Thanks,
Idan
Reply all
Reply to author
Forward
0 new messages