On Mon, May 26, 2008 22:55, Philip Papadopoulos wrote:
> On Mon, May 26, 2008 at 12:00 AM, Jimmy Hedman <
jimmy....@southpole.se>
> wrote:
>
>> On Tue, May 20, 2008 10:56, Jimmy Hedman wrote:
>> > Hi,
>> > We have encountered a strange problem with SGE. sge_execd dies after
>> > about 11 minutes from boot with the message: 'commlib error: got read
>> > error (closing "
test.southpole.se/qmaster/1")'. We have tried both
>> Rocks
>> > 4.3 and 5. It is only after boot, if we restart sgeexecd it keeps
>> > running.
>> > Any ideas what this could be?
>> More info on this. It seems to be only direct after a reinstallation. If
>> I
>> reboot the machines it works fine.
>>
> Can you find anything the SGE logs? This is strange behavior.
")'). The only thing the master says
is that I can re-compile SGE if I like to have more than 1004 clients.
We had different set of Rolls on Rocks 4.3 vs Rocks V. I did first suspect
didn't have that roll on Rocks V I'm pretty sure it's not the problem.