Hi,
Thank you so much for juicer, 3D pipeline and giving option to edit the contact map in order to correct the misalignment.
I started genome assembly for a highly heterozygous plant genome of genome size 420Mb using with HiFi sequence dataset (raw read N50 as 25Kb). Using a total of 37x genome coverage and HiFiasm program, followed by pruge_dup based analysis, contig level genome assembly was achieved within 54 contigs with contig N50 as 36Mb. This suggest that the genome is good and it must have included the repeats with several chromosomes should be gapless.
I next used Arima HiC kit1, and HiC reads were mapped using juicer. I followed this with scaffolding using 3D pipeline and it is still running. I have two questions-
1. It has been over 1 week that the lastz based allignment is going. I was able to get .rawchrom.fasta. I am curious if this is normal and if I can speed up the process.
2. Since genome was well assembled, I checked the assembly.0.hic file using juicerbox. Contact map looked good overall, but I am confused on two sites.
Overall HiC contact map looks like shown below-
The plant that I am working on has eight chromosomes, and I could see these eight chromosomes, and the rest seems repeats that were not aligned or seem to be with any pseudo molecule. Here, if we see chromosome 2 (number 2 from top), you will be able to see as If there is a strange shape, and I am not sure if I need to correct this one although I feel that it could be repeat. Looking at higher resolution, you can see two blocks-
If I removed these that are shown through green lines, assembly looks like this-
The black line just shows the section that got excluded.
I am just worried if I am trying to artificially exclude repeats which should be there. Since this assembly was generated using HiFi datasets and I used stringent conditions, is it good to accept these structures as real or shall i exclude them.
I am using jucierbox ver 1.11.08, and 3D pipeline were downloaded on 18th July 2021. My server has 1Tb ram, 128 cores.
Kindly let me know if you need any information in order to help me out. Thank you so much in advance,
with best regards
Amit