Good day,
When benchmarking my backup with and without spooling it seems that with spooling the jobs take longer than without spooling, even though the efficiencies of the resources have been increased.
My thinking is that this is due to the fact that Bareos first spools data, then despools it, then spools it again and so on - causing 100% network usage, followed by 100% tape usage alternating.
My tape bandwidth and network bandwidth are about the same, around 1 Gbit/s.
Instead, I would like to propose a change where the spooling process creates two spool files (each bound to 50% of Maximum Spool Size), Spool-A and Spool-B.
When Spool-A is filled up, Bareos-SD starts to despool that file while continuing to spool data to Spool-B.
If Spool-B fills up while Spool-A is running, the spooling is paused until the spooling of Spool-A is done. At that moment Spool-B is despooled and Spool-A is used to spool data.
This should make sure that the slowest part is always being used 100% during the backup job, which should be a significant improvement if you are running a single large job.
If you are running multiple jobs in parallel then this solution will do little to help you as you will spool and despool onto different spool files and possibly multiple drives.
Thoughts?
Regards,