On Thu, May 24, 2012 at 4:16 PM, Kyungyong Lee <
kl...@acis.ufl.edu> wrote:
> Hello Peter,
>
> As I inspect it, all machines seem to be claimed by "C208052244.ipop".
> Is it a machine that you are running your jobs? I think it would be
> worth to spend time to inspect status of both the claimed/idle
> machines and submit VM (C208052244.ipop).
>
Yes, C208052244 is my submit machine. I just dumped a huge batch of
jobs to run, so I don't really want to reboot anything or restart
condor, and I'm not exactly a condor guru, so that's what I do when I
have problems :)
It seems the number of claimed/idle nodes has gone down from 48
yesterday to 34 now. But of the ones that remain, none of them are
machines that I admin. These are the hostnames of my machines:
C000034052.ipop
C018192220.ipop
C026205156.ipop
C033082209.ipop
C050172050.ipop
C113208164.ipop
C142212012.ipop
C152162005.ipop
C180202002.ipop
C185203074.ipop
C208052244.ipop
C212138053.ipop
C251167027.ipop
And these are the ones with claimed/idle nodes:
$ condor_status -const 'State == "Claimed" && Activity == "Idle"' -f
'%s\n' Machine | sort -u
C026214087.ipop
C052013004.ipop
C077078154.ipop
C108079196.ipop
C123040101.ipop
C135095238.ipop
C135096034.ipop
C153238212.ipop
C175010111.ipop
C252172134.ipop
I don't really know how to manage condor on machines I can't
personally ssh into :)
-Pete