Superfocus unclassifed reads

6 views
Skip to first unread message

lka...@gmail.com

unread,
Oct 11, 2017, 3:28:48 PM10/11/17
to Edwards Lab Tools
Hello,

I have been trying several pipelines to functionally annotate metagenomes. I have a dataset of ~200 metagenomes with a range 0.5-5 million per sample. While I expect a portion of the reads to be hypothetical or unknown, I am finding that a very large proportion of reads (65-70%) are not classified. I am now trying SUPER-FOCUS with the different aligners but I would like to know if this is typical of what you see in other samples (Like the HMP or coral reef samples in your paper) and if using the -k 1 flag is the only/easiest way to find the reads that are not aligning so I can pull them for downstream analysis?

Many thanks.

geni....@helix.com

unread,
Oct 13, 2017, 11:26:13 AM10/13/17
to Edwards Lab Tools
Hi there,

Yes, this is the easiest way now (set -k 1) and write a simple script to retrieve unmapped reads from the input.

I will add in my list of future features to retrieve unmapped reads.

Thanks

Reply all
Reply to author
Forward
0 new messages