I have been trying to submit jobs for Gems/Simics following the
directions on the wiki and although the jobs are submitted correctly
and the condor_q reports that they are running (R), they never finish.
For example I have created a simple simics configuration doing
nothing... just waiting in prompt ...and I submit a job to simulate
1000cycles. When executing it on my client VM it takes like 20-30secs
altogether.
I even put the files on the /mnt/local of my virtual machine and
executed from another client and it completed correctly in it.
I have all the files in /mnt/ganfs/C026090208/test
mod_opal_commands.py
mod_ruby_commands.py
mod_ruby_commands.pyc
opal.so
ruby.so
simics_condor_submit
simics_wrapper.sh
start_up_ruby.script
In theory if someone execute simics_wrapper.sh in a VM with simics it
is running fine.
This is my simics_condor_submit
*****************************************************************************************************************
# Condor submit script for the Simics Archer tutorial example
# Simics always runs in vanilla mode
universe = vanilla
# what will execute remotely is a "wrapper" script - which prepares the
# Simics workspace and runs Simics itself
executable = simics_wrapper.sh
# Specify requirements for job - the job will run on a machine that:
# 1) has the Simics module installed (you need this for all Simics
jobs in Archer), and
# 2) has and a minimum of 512MB RAM (you can change this according to your job)
Requirements = HasArcherSimics == TRUE && Memory >= 1024
# set up output, error and log files
log = simics.$(Cluster).$(Process).log
error = simics.$(Cluster).$(Process).err
output = simics.$(Cluster).$(Process).out
# specify files to transfer from/to remote machine
should_transfer_files = yes
when_to_transfer_output = on_exit
transfer_input_files = start_up_ruby.script, ruby.so,
mod_opal_commands.py, mod_ruby_commands.py
transfer_output_files = screen_dump_1000.out
#error checking
on_exit_remove = (ExitBySignal == False) && (ExitCode == 0)
#queue submits the job to the queue
Queue
*********************************************************************************************************************
and my simics_wrapper.sh
********************************************************************************************************************
griduser@C026090208:/mnt/local/test$ cat simics_wrapper.sh
#!/bin/sh
# This script sets up a Simics workspace for execution on a remote or local
# Archer grid appliance
# This script is based on the Archer simics tutorial; to run your own
# simulation, you will need to change it top manage your own files
# Create and change into the Simics workspace directory, also storing it in
# the tgt_wrk_spc variable
mkdir new-workspace
cd new-workspace
tgt_wrk_spc=`pwd`
# Go to the Simics installation directory and run workspace setup script
cd /opt/virtutech/simics-3.0.31/bin
./workspace-setup $tgt_wrk_spc
# Go back to workspace directory
cd $tgt_wrk_spc
# Setup directories for the TLB modules used in the tutorial:
mkdir x86-linux
mkdir x86-linux/lib
mkdir x86-linux/lib/python
# note that we are within the new-workspace subdirectory; files transferred
# by Condor are one level up (../). Copy those to the right place.
cp ../ruby.so x86-linux/lib
cp ../mod_ruby_commands.py x86-linux/lib/python
cp ../mod_opal_commands.py x86-linux/lib/python
./simics -c /mnt/ganfs/C123175188/abisko-8cpu-after-boot.config
-no-win -batch-mode -stall -x ../start_up_ruby.script >
../screen_dump_1000.out
********************************************************************************************************
And the condor_q output:
********************************************************************************************************
-- Submitter: C026090208.ipop : <5.26.90.208:9501> : C026090208.ipop
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
17.0 griduser 10/16 08:35 0+15:30:30 R 0 26.9 simics_wrapper.sh
18.0 griduser 10/16 08:35 0+15:30:30 R 0 26.9 simics_wrapper.sh
19.0 griduser 10/16 08:35 0+15:30:30 R 0 26.9 simics_wrapper.sh
20.0 griduser 10/16 08:35 0+15:30:30 R 0 26.9 simics_wrapper.sh
21.0 griduser 10/16 08:35 0+15:30:30 R 0 26.9 simics_wrapper.sh
22.0 griduser 10/16 08:35 0+15:30:30 R 0 26.9 simics_wrapper.sh
6 jobs; 0 idle, 6 running, 0 held
********************************************************************************************************
ANY suggestions are welcome.
Thanks,
Dimitris
UT-Austin