Hi Charles,
You don't need to combine the read files. You can specify all pairs of files in a single command, ie.
transabyss --SS --pe
replicate_01_R1.fq
replicate_01_R2.fq
replicate_02_R1.fq replicate_02_R2.fq ... replicate_12_R1.fq replicate_12_R2.fq ...
Hope that helps!
Ka Ming
To unsubscribe from this group and stop receiving emails from it, send an email to trans...@googlegroups.com.
Hi Charles,
Yes, there is a difference. If you use the `--SS` option for non-strand specific data, then the assembly will be less contiguous and you will also see more duplicated sequences. That is why it is not turn on by default.
The `--useblat` option is better at removing duplicated sequences during the initial assembly steps, but it is quite slow. If redundant sequences is an issue, you can use `transabyss-merge` after the assembly has completed.
Ka Ming
To view this discussion on the web visit https://groups.google.com/d/msgid/trans-abyss/53c15451-5d92-49e0-b450-b6798ab3e3af%40googlegroups.com.
Hi Charles,
There are benefits in removing low coverage sequencing errors early in the assembly process. Since you are already using multiple k-mer sizes and merging the assemblies with `transabyss-merge`, the differences are less noticeable.
Ka Ming