Hi Anton.
On Tue, 2013-06-18 at 04:22 +0400, Anton Vodonosov wrote:
> I wanted to try the upload from my machine, but the directory
> cl-test-grid/work-dir/agent/test-runs/20130615153704-sbcl-1.1.8.57-d5c8232-dirty-linux-x64/
> is not available online anymore, because your test system runs another test run now.
Sorry, I did some experiments of my own in the meantime which caused the
workspace to be deleted. Thanks for helping figuring this out.
> I can say now that my theory about 30 seconds timeout is not true. I saw in the agent.log
> that the network connection problem occurred 4 seconds after the request was started.
>
> Currently to me the theory that your virtual machines have some network configuration
> problems look the most probable.
>
> Do you have any news about this problem?
I did three experiments:
1. I started SBCL on one of the virtual machines in the workspace
in which the failed upload occurred. I connected to this SBCL
via SLIME/SWANK, changed the batch size to 50 and performed a
manual upload. That worked.
2. After that, I tried automated runs with batch size 50 and it
worked once on one machine, but did not work on the other
machines.
3. After that, I kept the batch size 50 and added a 30 second delay
between individual upload batches, but that it still did not
work on the machine failing before. At the time of writing, the
previously successful machine is still running with this
configuration.
All failed uploads failed with "Connection reset by peer" after less
than 300 files (6 batches with my batch size).
Next, I will try a 60 second delay between individual upload batches.
Kind regards,
Jan