Hello Leszek,
I’m using Redundans to help improve some de novo assembled genomes (assembled using Platanus with 250 bpPE+500bpPE+2kbMP+6kbMP+15kbMP libraries), I ran redundans on the scaffold.fa file generated by Platanus.
My first two attempts – HN_red and HH_red generated good results, this was using the older version of Redundans in early 2017, I emailed you the log files. Redundans seemed to have worked very effectively in removing redundant scaffolds, though I don’t have the contigs.reduced.fa.hist.png file to look at the identity histogram. The reduced genomes had improved N50, expected genome size and reduced scaffold number.
My recent attempt was on a different assembly (Hmiss_redundans), and used same Platanus parameters as the above. After running redundans on the scaffold.fa, the resulting genome size was smaller than expected, a very big reduction in scaffold numbers and a good increase in N50. The contigs. reduced.fa.hist.png file (attached) shows that majority of scaffolds that were removed were not identical. Maybe I can fix it by increasing the –identity? But the other concern is the high variability seen in all the libraries which was not seen in my earlier attempts, I’m not sure if it is because of the newer version or the quality of the libraries was indeed bad. I emailed you both the Hmiss_redundans log file and the Hmiss_ reduced.fa.hist.png file.
I will appreciate any suggestions you could provide.
Thanks,
Sumitha