Hi,
I've been reading many posts here and am still unsure how I can tweek parameters towards a better assembly. In annex I have my 0 iteration of the assembly ( 1- an overview and 2- a zoomed picture).
It seems as if there is still possible scaffolding, but small misassemblies (zoom) probably disrupt the connection patterns between scaffolds. I loaded bed and wig files but am not sure how to take decisions from here. It seems to find all mismatches very well but it even finds them where it should not, based on drops in the depletion score.
I understand that the -r serves to correct misassemblies, but it seems to pick them up and throw them in a garbage bin. Increasing this parameter makes the garbage area bigger and bigger, and many contigs remain away from scaffolds where they belong.
The last two annexed pictures are the raw-chromossome version of a default run (-r 3) and a -r 6 run (same color scale: 0-500-1000). Can you honestly evaluate which is better? It seems very subjective to me.
I'm not sure more -r rounds are good. Unless I can somehow better recover excluded contigs, perhaps with the seal stage?
Maybe I should be content with this and move to manual curation.
Resume of points:
- 3d-dna could scaffold more?
- Finding misassemblies is too sensitive?
- More rounds are exuding good contigs from assembly?
Kind regards,
Ricardo
PS: I really admire you guys, great work!