AWS FPGA F1 DRAM Latency

76 views
Skip to first unread message

Rafael Ulises Luzius Pizarro Solar

unread,
Mar 18, 2022, 5:36:59 PM3/18/22
to FireSim
Hello guys,

Firesim guarantees cycle accuracy simulation in a context where an actual FPGA DRAM takes multiple FPGA cycles.

Did you measure the latency between the AXI request departing from the module that abstracts DRAM and the data arriving (~pure AXI realm)?

Best,
Rafael

David Biancolin

unread,
Mar 20, 2022, 8:41:48 PM3/20/22
to FireSim
Hey Rafael,

As you might expect, that's a moving target that depends on the relative speed of the clock domains, and the load on the memory system. But on F1, with a 90 MHz simulator clock, a ballpark figure is about 40 cycles.

- David

Rafael Pizarro Solar

unread,
Mar 24, 2022, 6:07:36 AM3/24/22
to FireSim
Hello David,

Thank you for your response.

This ballpark figure is from the memory request coming out of the L2 cache to the data coming back or from the request going out AXI domain ( -> DDR controllers) and data coming back?

- Rafael

David Biancolin

unread,
Mar 24, 2022, 4:47:45 PM3/24/22
to fir...@googlegroups.com
I'm talking just about the latencies in the FPGA memory system.

The latencies seen by the L2 is configurable and deterministic. That depends on what FASED model you select, and what diplomatic widgets you have between the L2 and the AXI4Port coming off your SoC. 

There's a workload we provide under deploy/workloads called ccbench that will measure these latencies for you for your design. I'd encourage you to run that. 

- David 

--
You received this message because you are subscribed to a topic in the Google Groups "FireSim" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/firesim/MGBuW7yiRKY/unsubscribe.
To unsubscribe from this group and all its topics, send an email to firesim+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/firesim/0f1b82f5-0b66-4902-a963-4f580ed85a44n%40googlegroups.com.

David Biancolin

unread,
Mar 24, 2022, 4:50:18 PM3/24/22
to fir...@googlegroups.com
Just to clarify further, your simulation will not observe the 40 cycle figure i mentioned. For example, if you choose a Latency-Bandwidth Pipe your target (the simulated design) will deterministically observe the latency you requested -- which can range from 1 cycle to many thousands of cycles. This paper describes how that works in detail: https://people.eecs.berkeley.edu/~biancolin/papers/fased-fpga19.pdf

Reply all
Reply to author
Forward
0 new messages