I'm working with Ray, so I'll fill in some more details. First, we're on v0.7.0.
With just a simple GET request, no data payload and no work, we get more or less the performance we expect.
In the controller where the large POST request is handled (a few different test samples, but the largest is ~83KByte) and does no work and immediately returns an "OK", I'm seeing some significant scaling penalty in terms of throughput. Here are some average throughput for a sample that is 54KByte using JMeter Summary Report, HTTPClient4, with a 2nd server acting as a Remote JMeter server.
1 Thread: ~76.6/sec
2 Thread: ~111.9/sec
4 Thread: ~170.6/sec
As you can see, for such a simple accept data and return immediately, this should have at least close to linearly scaling, while the results above clearly do not.
JVM (1.7.0_17):
~12GB memory
-XX:+UseConcMarkSweepGC
-XX:SurvivorRatio=16
-XX:PermSize=128m
-XX:MaxPermSize=256m
-XX:NewSize=512m
Server (both machines, JMeter Remote & DropWizard Application instance):
Xeon E5462, 4C8T
32GB memory
CentOS, kernel 2.6.32-358.el6.x86_64
--Aaron