rsync, especially when using ssh, can incur significant overhead. If
you are on a private network, you can disable encryption (--rsh=rsh)
and get better speed up.
What all are you archiving? The whole run directory? Does the run
directory contain images? cifs? bcls? qseqs? Just the GERALD
directory? If you are archiving the whole run directory, you might
want to think about archiving less, e.g., a BAM with all the sequences,
qualities, and alignments.
--
David Dooling
The Genome Center at Washington University
http://genome.wustl.edu/
Agreed. If you have to use ssh, you can get slightly better data
rates by usnig the arcfour cipher, instead of the default AES or 3des.
There's also HPN-SSH[1] which does a number of things in order to
speed up SSH transfers, including adding a "no-encryption" cipher
(authentication is *ALWAYS* encrypted; passwords are never sent in the
clear).
[1] http://www.psc.edu/networking/projects/hpn-ssh/
> What all are you archiving? The whole run directory? Does the run
> directory contain images? cifs? bcls? qseqs? Just the GERALD
> directory? If you are archiving the whole run directory, you might
> want to think about archiving less, e.g., a BAM with all the sequences,
> qualities, and alignments.
If it takes more than a few hours, you may have network or disk IO
problems. The most recent analysis directory we have is about 73G.
You should be able to transfer this on a clean network quickly. I
just did a quick test using a 3,511MB file, reading from an rsync
export (not using SSH for tunneling), and maxed out at about 40MB/sec,
and averaged around 30MB/s (both systems involved were busy doing
other things; this was not a "clean" benchmarking envrionment). For a
73G directory, you are looking at about 250 seconds for pure data
transfer, plus a few minutes for rsync to walk the directory tree.
All told, well under an hour. :)
--
Jesse Becker
Every cloud has a silver lining, except for the mushroom-shaped ones,
which come lined with strontium-90.
For a 50G run, the Basecalls/Bustard directory should be about 500 GB.
If it is taking 2 days, you are getting about 24 Mbps from your 1 Gbps
link. You have a bottleneck somewhere and the most likely culprits
have already be posted on this thread. Someone at your site will need
to do some sleuthing.
--
You received this message because you are subscribed to the Google Groups "solexa" group.
To post to this group, send email to sol...@googlegroups.com.
To unsubscribe from this group, send email to solexa+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/solexa?hl=en.
> --
> You received this message because you are subscribed to the Google Groups "solexa" group.
> To post to this group, send email to sol...@googlegroups.com.
> To unsubscribe from this group, send email to solexa+un...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/solexa?hl=en.
>
>
--