hicpro2juicebox.sh: Could not find chromosome sizes file for: susScr3

591 views
Skip to first unread message

Sarah Djebali

unread,
May 26, 2016, 12:41:02 PM5/26/16
to 3D Genomics

Dear all,


I am trying to visualize hicpro results in juicebox for the pig genome.


For this I converted the hicpro output into a format that juicebox can read following the instructions here:

https://github.com/nservant/HiC-Pro/blob/master/doc/UTILS.rst


After installing the juicebox command line tools, I did the following:

module load bioinfo/Java8
/work2/project/fragencode/tools/hicpro/HiC-Pro_2.7.7/bin/utils/hicpro2juicebox.sh -i ~/fragencode/results/hic/sus_scrofa/testrun20160422.v2.7.7/Pig1/hicpro/output/hic_results/data/liver/liver_allValidPairs -g susScr3 -r /work2/project/fragencode/data/species/sus_scrofa/Sscrofa10.2.84/restrictionfragments_HindIII.bed -j /usr/local/bioinfo/src/Juicebox/juicebox-1.4/out/artifacts/Juicebox_clt_jar/Juicebox.jar


and I got the following error message:
Could not find chromosome sizes file for: susScr3


I also tried with the -c option but it gave me the same error.


Do you know how I can pass the chromosome size file for this genome?


Best regards,
Sarah Djebali

Neva Durand

unread,
May 26, 2016, 1:30:00 PM5/26/16
to Sarah Djebali, 3D Genomics
Hi Sarah,

We suggest using our pipeline, Juicer, for creating Hi-C maps.  There's a lot of documentation on it here:


If you have already created a paired read file, you can use the Juicebox command line tools:


Scroll down and click the tab for Command Line Tools, then Creating a Hi-C File.  

To use a genome that hasn't yet been included in the standard distribution, simply use UCSC's chrom.sizes file in place of the genome ID.  See the previous thread:  


Best
Neva



--
You received this message because you are subscribed to the Google Groups "3D Genomics" group.
To unsubscribe from this group and stop receiving emails from it, send an email to 3d-genomics...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/3d-genomics/271bb2bb-5137-45a4-937b-3893a21af6d8%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.



--
Neva Cherniavsky Durand, Ph.D.
Staff Scientist, Aiden Lab

Lila Rieber

unread,
Jul 2, 2016, 11:41:40 PM7/2/16
to 3D Genomics
You need to have the S scrofa chromosome sizes file in the same directory as the reference genome. The format of the file name must be [reference genome name].chrom.sizes. For example, if your reference genome is in ../references/s_scrofa.fasta, the sizes file must be ../references/s_scrofa.fasta.chrom.sizes. Here's the file format for the human genome: https://genome.ucsc.edu/goldenpath/help/hg19.chrom.sizes.  I don't know of a file of this for S scrofa, but you could make it manually. 
Reply all
Reply to author
Forward
0 new messages