We backup/spool to disk volumes and copy/migrate jobs to tape. We have one tape drive.
If we have Concurrent jobs allowed so we can backup more than one host at a time to disk we run into issues with Consolidate jobs failing when jobs were migrated to the tape drive. If we only allow 1 job at a time it works as no second job tries to grab the tape drive.
list joblog jobid=23422
2020-02-13 21:56:23 myth-dir JobId 23422: Start Virtual Backup JobId 23422, Job=sch-hp-desktop-Users-Pictures.2020-02-13_21.56.19_07
2020-02-13 21:56:23 myth-dir JobId 23422: Consolidating JobIds 22736,23413
2020-02-13 21:56:23 myth-dir JobId 23422: Bootstrap records written to /var/lib/bareos/myth-dir.restore.2.bsr
2020-02-13 21:56:23 myth-dir JobId 23422: Connected Storage daemon at myth..com:9103, encryption: PSK-AES256-CBC-SHA
2020-02-13 21:56:23 myth-dir JobId 23422: Using Device "FileStorage" to read.
2020-02-13 21:56:23 myth-dir JobId 23422: Created new Volume "AI-Consolidated-1886" in catalog.
2020-02-13 21:56:23 myth-dir JobId 23422: Using Device "FileStorage3" to write.
2020-02-13 21:56:23 myth-sd JobId 23422: stored/
acquire.cc:159 Changing read device. Want Media Type="LTO5" have="File"
device="FileStorage" (/mnt/bacula)
2020-02-13 21:56:23 myth-sd JobId 23422: Releasing device "FileStorage" (/mnt/bacula).
2020-02-13 21:56:23 myth-sd JobId 23422: F
atal error: stored/acquire.cc:219 No suitable device found to read Volume "DA1391L5" 2020-02-13 21:56:23 myth-sd JobId 23422: Releasing device "Tand-LTO5" (/dev/nst1).
2020-02-13 21:56:24 myth-dir JobId 23422: Error: Bareos myth-dir 19.2.5 (03Feb20):
Build OS: Linux-3.10.0-1062.9.1.el7.x86_64 ubuntu Ubuntu 16.04 LTS
<snip>
SD Errors: 1
SD termination status: Fatal Error
Accurate: yes
Bareos binary info:
bareos.org build: Get official binaries and vendor support on
bareos.com Termination: *** Backup Error ***
So when it swaps to the tape (which is busy with another Consolidate VIrtualFull this job fails.
Maximum Concurrent Jobs = 1
Is there a way to make the job hold and wait for the drive to free up?
I can use
Reschedule on Error = yes
Reschedule Times = 5
Reschedule Interval = 1800
But this causes very long pauses as many consolidation jobs are very short/fast, but can be long if it includes full.