I've been running juicer pre for 9 days now, and I'm wondering what "normal run time" would be with my set of data.
I've got a deNovo Pac Bio assembly for my reference which is in 9,205 pieces. The goal was to see if adding in a lane for HiC would give us a better assembly.
I'm running on a GPU, using 16 cores, and the protocol I was looking at said to run pre with a -q 1 and then follow-up with a -q 30.
I'm still running the -q 1 side of things.
Because it's been 9 days, I'm a bit reluctant to kill the job without checking in here first ;)
inter.hic is currently at 1.7G and was written to less than a minute ago (as was my SLURM output)
Sequenced Read Pairs: 145,477,454
Normal Paired: 128,307,544 (88.20%)
Chimeric Paired: 3,926,707 (2.70%)
Chimeric Ambiguous: 9,924,800 (6.82%)
Unmapped: 3,318,403 (2.28%)
Ligation Motif Present: 5,990,072 (4.12%)
Alignable (Normal+Chimeric Paired): 132,234,251 (90.90%)
Unique Reads: 119,080,348 (81.85%)
PCR Duplicates: 13,012,416 (8.94%)
Optical Duplicates: 141,487 (0.10%)
Library Complexity Estimate: 625,661,246
Intra-fragment Reads: 9,385,779 (6.45% / 7.88%)
Below MAPQ Threshold: 101,281,196 (69.62% / 85.05%)
Hi-C Contacts: 8,413,373 (5.78% / 7.07%)
Ligation Motif Present: 684,188 (0.47% / 0.57%)
3' Bias (Long Range): 54% - 46%
Pair Type %(L-I-O-R): 25% - 25% - 25% - 25%
Inter-chromosomal: 3,698,383 (2.54% / 3.11%)
Intra-chromosomal: 4,714,990 (3.24% / 3.96%)
Short Range (<20Kb): 4,298,451 (2.95% / 3.61%)
Long Range (>20Kb): 414,058 (0.28% / 0.35%)
(P.S. I only have easy access to the one GPU node, so starting the quality-30 cut-off run while leaving the other running isn't realistic right now :( )