Hello,
I've been working on a genome assembly for a diploid plant genome of about 4Gbp. I've generated an initial assembly using PacBio hifi reads (25X) and hifiasm. The result was about 3100 contigs. Then I ran 3d-dna pipeline using Omni-c HiC reads, about 100X. Resulting assembly.FINAL.fasta file had over 9000 scafolds, but N50 and largest contig were improved compared to initial assembly.
I've loaded the final assembly and heat map into JuiceBox, images are attached. The heatmap looks ok to me, I just need to work manually on finishing the assembly.
I have several questions though.
- why does FINAL.fasta file have many more scaffolds that the initial assembly
- my JuiceBox is running very slow, even slowing down the server significantly, is that due to a large number of scaffolds in the final assembly
- from the zoomed image it seems that some chromosomes are made up of many small scaffolds, is there any way to merge them
- can I move chromosomes around instead of scaffolds, that would be much more convenient
- Also, where can I check the number of generated chromosomes, I'm not sure where to find that number
I actually really like JuiceBox, nice tool.
Best regards,
Krešimir Križanović