juicer result - .hic files are empty

334 views
Skip to first unread message

Benedikt

unread,
Jun 9, 2020, 1:46:37 AM6/9/20
to 3D Genomics
I am trying to run juicer and everything seems to run to completion without error, however the .hic files contain no information. The file size of the inter.hic file is ~15kb (attached). When I try to load it into juicebox, I receive an error "Error loading [object File]: RangeError: Offset is outside the bounds of the DataView".

Here is the content of inter.txt:

Experiment description: Juicer version 1.6; BWA 0.7.16a-r1181; 8 threads; splitsize 90000000; openjdk version "1.8.0_91"; Juicer Tools Version 1.22.01; /work/project/ladsie_002/juicer/scripts/juicer.sh -d ./6-juicer -p ./5-phase/output/phased.0.fasta.fai -y ./6-juicer_old/P10_assembly_MboI.txt -z ./5-phase/output/phased.0.fasta                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 
Sequenced Read Pairs:  198.719.564
 Normal Paired: 155.223.372 (78.11%)
 Chimeric Paired: 26.033.901 (13.10%)
 Chimeric Ambiguous: 8.872.717 (4.46%)
 Unmapped: 8.589.574 (4.32%)
 Ligation Motif Present: 103.013.744 (51.84%)
Alignable (Normal+Chimeric Paired): 181.257.273 (91.21%)
Unique Reads: 149,494,961 (75,502,505.56%)
PCR Duplicates: 31,732,449 (16,026,489.39%)
Optical Duplicates: 29,863 (15,082.32%)
Library Complexity Estimate: 455,145,782
Intra-fragment Reads: 1,981,486 (997129.62% / 1.33%)
Below MAPQ Threshold: 60,192,008 (30290011.52% / 40.26%)
Hi-C Contacts: 87,321,467 (43942183.18% / 58.41%)
 Ligation Motif Present: 42,374,626  (21323892.53% / 28.35%)
 3' Bias (Long Range): 81% - 19%
 Pair Type %(L-I-O-R): 25% - 25% - 25% - 25%
Inter-chromosomal: 43,902,142  (22092573.94% / 29.37%)
Intra-chromosomal: 43,419,325  (21849609.25% / 29.04%)
Short Range (<20Kb): 18,595,620  (9357746.37% / 12.44%)
Long Range (>20Kb): 24,823,618  (12491819.10% / 16.60%)


The intermediate files are reasonably sized, for example merged_nodups.txt is about 45GB. I have no idea where to look for the problem, any help is appreciated.
inter.hic

Neva Durand

unread,
Jun 9, 2020, 7:26:41 AM6/9/20
to Benedikt, 3D Genomics
Is there no output in the debug folder around hic file creation? I am unsure "fai" would work for chrom.sizes though it might be fine.

It's clear that the file isn't correct, it's too small.

--
You received this message because you are subscribed to the Google Groups "3D Genomics" group.
To unsubscribe from this group and stop receiving emails from it, send an email to 3d-genomics...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/3d-genomics/cb157adb-0c59-4223-b3a9-0b74cd5881c1o%40googlegroups.com.


--
Neva Cherniavsky Durand, Ph.D.
Pronouns: she, her, hers
Assistant Professor, Aiden Lab

Benedikt

unread,
Jun 9, 2020, 8:31:55 AM6/9/20
to 3D Genomics
I missed the error in the debug folder, my bad. It says during hic creation:

Picked up _JAVA_OPTIONS: -Xmx49152m -Xms49152m
java.lang.RuntimeException: No reads in Hi-C contact matrices. This could be because the MAPQ filter is set too high (-q) or because all reads map to the same fragment.
at juicebox.tools.utils.original.MatrixZoomDataPP.mergeAndWriteBlocks(MatrixZoomDataPP.java:276)
at juicebox.tools.utils.original.Preprocessor.writeMatrix(Preprocessor.java:970)
at juicebox.tools.utils.original.Preprocessor.writeBody(Preprocessor.java:659)
at juicebox.tools.utils.original.Preprocessor.preprocess(Preprocessor.java:425)
at juicebox.tools.clt.old.PreProcessing.run(PreProcessing.java:139)
at juicebox.tools.HiCTools.main(HiCTools.java:94)

I am unsure what causes this error, the stats look okay and the merged_nodups.txt contains 45GB. Could this be because of the fai file?

I can chop off the unneeded columns from the fai file to create a true chrom.sizes and run it again, no problem. But fai used to work fine in the past (if I remember correctly). Is there anything else you would recommend checking before I give it another try?

Thank you for your help!
To unsubscribe from this group and stop receiving emails from it, send an email to 3d-ge...@googlegroups.com.

Neva Durand

unread,
Jun 9, 2020, 9:27:49 AM6/9/20
to Benedikt, 3D Genomics
I would check that the sequence names in the fai file match exactly the sequence names in the merged_nodups file (fields 2 and 6).

If you download a new jar, you will see that Pre no longer throws out intrafragment reads by default, which might be the other source of the problem.

To unsubscribe from this group and stop receiving emails from it, send an email to 3d-genomics...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/3d-genomics/370054b3-e4d6-4d9d-83b1-4703103bc468o%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages