I also don't see any references to VBUS/VBUSP, but based on the context
they are internal PRU buses.
To answer your original question, access to anything outside the PRU
block (the two PRU cores, data memories, and local peripherals) requires
communicating over the SoC's internal interconnect fabric. You can
perform zero wait state writes to these resources (at least until you
saturate the posted write logic), but reads will stall the PRU until
data is returned from the far end.
I characterized the performance when accessing the GPIO registers from
the PRU, and got results similar to your DDR memory timings:
https://github.com/machinekit/machinekit/blob/master/src/hal/drivers/hal_pru_generic/pru_generic.p#L137-L165
Note that all timings are approximate. The exact number of PRU cycles
it will take to complete a write or read will depend on things like
internal bus utilization, various clock crossing latencies (which by
nature will have a varying amount of latency) and how quickly the far
end can respond. The DDR DRAM controller in particular needs to
schedule the read request and there are many factors that can cause the
read latency to vary.
--
Charles Steinkuehler
cha...@steinkuehler.net