How did you deal with possible PCR or optical duplicate
reads in your metagenomic sample? If you remove them
("dedup"), what software do you use?
If I use Picard Tools MarkDuplicates with single-end reads,
it seems to remove identical reads that map to the same
location, and these reads may not all be duplicates. I am
concerned that "dedup'ing" with MarkDuplicates will remove
too many reads, affecting abundance estimates. On the other
hand if duplicates are left in, that also effects abundance
estimates.
How do you handle this issue of possible duplicates?
--
You received this message because you are subscribed to the Google Groups "MetaPhlAn-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to metaphlan-users+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Thanks a lot for the reply (and also for the promptness)!