Error in SiteDirector after update to 7.3.4

8 views
Skip to first unread message

Luisa Arrabito

unread,
Oct 6, 2021, 10:32:13 AM10/6/21
to diracgrid-forum

Dear all,

I've just updated to our servers to version 7.3.4 but I get errors in the SiteDirector agent:

2021-10-06 14:22:43 UTC WorkloadManagement/SiteDirector VERBOSE: pilotOptions: -S CTA-cert -l CTA -e CTA -N ce14.pic.es -Q condor -n LCG.PIC.es --userEnvVariables DIRACSYSCONFIG:::pilot.cfg
2021-10-06 14:22:43 UTC WorkloadManagement/SiteDirector ERROR: Exception during pilot modules files compression
Traceback (most recent call last):
  File "/opt/dirac/versions/v2.0a5-1633444362/Linux-x86_64/lib/python3.9/site-packages/DIRAC/WorkloadManagementSystem/Agent/SiteDirector.py", line 1024, in _writePilotScript
    pilotFilesCompressedEncodedDict = getPilotFilesCompressedEncodedDict(pilotFiles,
  File "/opt/dirac/versions/v2.0a5-1633444362/Linux-x86_64/lib/python3.9/site-packages/DIRAC/WorkloadManagementSystem/Utilities/PilotWrapper.py", line 288, in getPilotFilesCompressedEncodedDict
    with open(pf, "r") as fd:
FileNotFoundError: [Errno 2] No such file or directory: 'Core/scripts/dirac-install.py'
2021-10-06 14:22:43 UTC WorkloadManagement/SiteDirector ERROR: Agent exception while calling method <bound method SiteDirector.execute of <DIRAC.WorkloadManagementSystem.Agent.SiteDirector.SiteDirector object at 0x7fec4bbba0d0>>
Traceback (most recent call last):
  File "/opt/dirac/versions/v2.0a5-1633444362/Linux-x86_64/lib/python3.9/site-packages/DIRAC/Core/Base/AgentModule.py", line 345, in am_secureCall
    result = functor(*args)
  File "/opt/dirac/versions/v2.0a5-1633444362/Linux-x86_64/lib/python3.9/site-packages/DIRAC/WorkloadManagementSystem/Agent/SiteDirector.py", line 326, in execute
    result = self.submitPilots()
  File "/opt/dirac/versions/v2.0a5-1633444362/Linux-x86_64/lib/python3.9/site-packages/DIRAC/WorkloadManagementSystem/Agent/SiteDirector.py", line 445, in submitPilots
    res = self._submitPilotsToQueue(pilotsToSubmit, ce, queueName)
  File "/opt/dirac/versions/v2.0a5-1633444362/Linux-x86_64/lib/python3.9/site-packages/DIRAC/WorkloadManagementSystem/Agent/SiteDirector.py", line 705, in _submitPilotsToQueue
    executable = self.getExecutable(queue,
  File "/opt/dirac/versions/v2.0a5-1633444362/Linux-x86_64/lib/python3.9/site-packages/DIRAC/WorkloadManagementSystem/Agent/SiteDirector.py", line 919, in getExecutable
    executable = self._writePilotScript(workingDirectory=workingDirectory,
  File "/opt/dirac/versions/v2.0a5-1633444362/Linux-x86_64/lib/python3.9/site-packages/DIRAC/WorkloadManagementSystem/Agent/SiteDirector.py", line 1030, in _writePilotScript
    localPilot = pilotWrapperScript(pilotFilesCompressedEncodedDict=pilotFilesCompressedEncodedDict,
UnboundLocalError: local variable 'pilotFilesCompressedEncodedDict' referenced before assignment


Do you know if it's a bug or some configuration/installation issue?

Thank you,
Luisa

Andrei Tsaregorodtsev

unread,
Oct 6, 2021, 10:43:45 AM10/6/21
to diracgrid-forum
The problem is that :
2021-10-06 14:22:43 UTC WorkloadManagement/SiteDirector ERROR: Exception during pilot modules files compression

Due to : FileNotFoundError: [Errno 2] No such file or directory: 'Core/scripts/dirac-install.py'

Which is I guess is no more there in v7r3. So, the SiteDirector default for that should be updated.

  Andrei

Luisa Arrabito

unread,
Oct 6, 2021, 10:52:03 AM10/6/21
to diracgrid-forum
Yes I saw that.
So, this sounds as a bug or can I fix it with some config?
Thank you,
Luisa

Andrei Tsaregorodtsev

unread,
Oct 6, 2021, 11:00:31 AM10/6/21
to diracgrid-forum
It looks like it is not needed any more, but I would like Federico to comment. You can try to nullify pilotFiles list (line 1022) and try again.

Andrei 

Federico Stagni

unread,
Oct 6, 2021, 12:15:11 PM10/6/21
to Andrei Tsaregorodtsev, diracgrid-forum
Hi,
first, Core/scripts/dirac-install.py is still there: https://github.com/DIRACGrid/DIRAC/blob/rel-v7r3/Core/scripts/dirac-install.py but I see you are using a python3 server, am I right?

And it also seems to me that you are trying to submit python2 pilots, using the python3 server. You should not do that, you probably have a configuration that should not be there. What do you have in CS?

CHeers,
Federico


--
You received this message because you are subscribed to the Google Groups "diracgrid-forum" group.
To unsubscribe from this group and stop receiving emails from it, send an email to diracgrid-for...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/diracgrid-forum/acacfc8d-7632-4f35-a591-19fa73a25cd4n%40googlegroups.com.

Luisa Arrabito

unread,
Oct 7, 2021, 3:44:11 AM10/7/21
to diracgrid-forum
Hi Federico,

First, I don't see dirac_install.py at:


Then, yes we have a python3 server and from the doc I understood that SiteDirector would have submit python3 pilots by default, so I did not add the Python3Pilots option in the CS.

Looking at the code however it seems that the Python3Pilots flag should be set = True, otherwise the default is False.

After having set it to True, pilots got submitted.

Thank you,

Luisa

Luisa Arrabito

unread,
Oct 7, 2021, 4:42:57 AM10/7/21
to diracgrid-forum
Hi again,

Now pilots got submitted but they fail because they cannot configure properly:


Executing: /scratch/condor/dir_31450/DIRAC_bS0Uh2pilot/diracos/bin/dirac-configure -S CTA-cert -C dips://does.not.matter.org:9162 -e CTA -o /LocalSite/ReleaseProject=CTA -o /LocalSite/ReleaseVersion= -O pilot.cfg

Could you please tell me where I have to specify the Configuration Service?

In my CS I have:


Thank you,

Luisa

Luisa Arrabito

unread,
Oct 7, 2021, 5:02:20 AM10/7/21
to diracgrid-forum
I think that the problem comes from the fact that the used pilot location is not the good one.

It seems it fails getting files from here:

2021-10-07 07:59:02 UTC ERROR    https://ccdcta-cert.in2p3.fr/pilot unreacheable (this is normal!)
2021-10-07 07:59:02 UTC ERROR    HTTP Error 404: Not Found

so it gets them from:
Trying file:/cvmfs/dirac.egi.eu/pilot/
2021-10-07 07:59:02 UTC INFO     Checking 'pilot.tar' for checksum

Did anything change in v7r3 in this respect?

Luisa Arrabito

unread,
Oct 7, 2021, 7:28:18 AM10/7/21
to diracgrid-forum
I have found the solution.

I had to create a sym link to the pilot directory:

cd /opt/dirac/pro/Linux-x86_64
ln -s /opt/dirac/webRoot

I will do another post to ask for some clarifications about the 7.3 server installation.

Thank you
Reply all
Reply to author
Forward
0 new messages