I think this does require low latencies for both network and processing.
With local simple 'echo single query parameter service as XML', I was
able to get 10k/second couple of years ago already, on then modern
single-core machine. So I have no trouble believing that such
solid-state services (i.e. no DB lookups, and at most simple SSD
reads) could reach 20k-30k request rates on multi-core machines.
But I would think that anything with higher latency (calls to other
servers, DB, longer reads, access across WAN) would probably be less
likely to get such rates.
FWIW, service I am currently building is doing 3k/second without
breaking sweat and is throttled by client set up (for now). It does do
file and/or local-bdb access, but probably would cap out somewhere
slightly below 10k.
Not sure if above helps, but thought this is a good discussion to have. :-)
-+ Tatu +-