Combining contigs from separate assemblies

526 views
Skip to first unread message

Zoey Werbin

unread,
May 24, 2021, 10:46:28 AM5/24/21
to Anvi'o
Hi Anvi'o community,

I have about 50 metagenomes that were assembled separately into contigs. I'd like to carry out a similar binning workflow to the one described in this post, except without the co-assembly. To do this, it seems I need to concatenate the FASTA files with each sample's contigs (into a single contigs database), then map each sample to the concatenated contigs database. Then, I'll use the workflow described in the blog post (merging each sample's profile, then clustering contigs). 

However, this 2016 thread mentions that when combining contig files, "multiple scaffolds might overlap (i.e., same part of a genome). It would dilute mapping signal and render the analysis noisy and difficult." Does the anvi-cluster-contigs function essentially fix this problem? Or could something like anvi-dereplicate-genomes be used to remove almost-duplicate contigs? 

Thank you for any pointers! 
Zoey

A. Murat Eren

unread,
May 24, 2021, 5:06:28 PM5/24/21
to Anvi'o
Hi Zoey,

I think the best practice would be to reconstruct genomes from your individual samples, then pool all genomes together and de-replicate them to generate a final non-redundant genomic collection, then use that collection to recruit reads from all samples to study their distribution across samples and population structures.


Best wishes,
--

A. Murat Eren (Meren) | he/him
http://merenlab.org :: twitter


--
Anvi'o Paper: https://peerj.com/articles/1319/
Project Page: http://merenlab.org/projects/anvio/
Code Repository: https://github.com/meren/anvio
---
You received this message because you are subscribed to the Google Groups "Anvi'o" group.
To unsubscribe from this group and stop receiving emails from it, send an email to anvio+un...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/anvio/c021b1bf-473f-4eef-94ee-74ba4cd23472n%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages