I know I have a bad habit of replying to myself, but I figure it's polite to close the loop :)
I have confirmed that there is a disk IOPS issue. I used SaltStack to send the following command to all the workers at once (race!):
`fio --randrepeat=1 --ioengine=libaio --direct=1 --gtod_reduce=1 --name=test --filename=test --bs=4k --iodepth=64 --size=4G --readwrite=randrw --rwmixread=75 | grep iops`
The results (below) clearly show wildly variable IOPS performance.
So what looked like a Nameko/Kombu issue (those were the only errors being thrown) was actually more fundamental :)
worker-02-p:
TERM environment variable not set.
/bin/sh: 1: cd: can't cd to ~
read : io=3071.7MB, bw=53605KB/s, iops=13401, runt= 58677msec
write: io=1024.4MB, bw=17876KB/s, iops=4469, runt= 58677msec
worker-05-p:
TERM environment variable not set.
/bin/sh: 1: cd: can't cd to ~
read : io=3071.7MB, bw=13852KB/s, iops=3463, runt=227064msec
write: io=1024.4MB, bw=4619.5KB/s, iops=1154, runt=227064msec
worker-07-p:
TERM environment variable not set.
/bin/sh: 1: cd: can't cd to ~
read : io=3071.7MB, bw=7588.3KB/s, iops=1897, runt=414510msec
write: io=1024.4MB, bw=2530.6KB/s, iops=632, runt=414510msec
worker-06-p:
TERM environment variable not set.
/bin/sh: 1: cd: can't cd to ~
read : io=3071.7MB, bw=6205.5KB/s, iops=1551, runt=506908msec
write: io=1024.4MB, bw=2069.3KB/s, iops=517, runt=506908msec
worker-03-p:
TERM environment variable not set.
/bin/sh: 1: cd: can't cd to ~
read : io=3071.7MB, bw=6203.6KB/s, iops=1550, runt=507030msec
write: io=1024.4MB, bw=2068.8KB/s, iops=517, runt=507030msec
worker-08-p:
TERM environment variable not set.
/bin/sh: 1: cd: can't cd to ~
read : io=3071.7MB, bw=6049.2KB/s, iops=1512, runt=519969msec
write: io=1024.4MB, bw=2017.3KB/s, iops=504, runt=519969msec
worker-04-p:
TERM environment variable not set.
/bin/sh: 1: cd: can't cd to ~
read : io=3071.7MB, bw=6021.4KB/s, iops=1505, runt=522370msec
write: io=1024.4MB, bw=2007.2KB/s, iops=501, runt=522370msec
worker-01-p:
TERM environment variable not set.
/bin/sh: 1: cd: can't cd to ~
read : io=3071.7MB, bw=3877.2KB/s, iops=969, runt=811256msec
write: io=1024.4MB, bw=1292.1KB/s, iops=323, runt=811256msec