Dear all,
So far, I have used the defaults from the human-gut-profiler.ngl, whenever I performed gene catalog mappings. However, now I wonder whether 45bp is not relatively short if I'm using 2x150bp as input.
igc_mapped_post = select(igc_mapped) using |mr|:
mr = mr.filter(min_match_size=45, min_identity_pc=95, action={drop})
if not mr.flag({mapped}):
discard
Has anyone ever performed any benchmarking on this? It would be pretty cool if we could discuss that in this community.
Cheers,
Ulrike
p.s. using GMGC as a reference now, just posted the original code snippet