Minimal size and multiple datasets

134 views
Skip to first unread message

Davide Cittaro

unread,
May 21, 2021, 5:39:39 AM5/21/21
to delly-users
I'm trying to use delly on a large number of low coverage samples and I get this error:

Sample has not enough data to estimate library parameters! File: …

I now filtered files smaller than 500kb, but I'm curious: which is the smallest dataset size allowed for a delly run?
Given that the samples available for the analysis may influence it, is it expected to have some difference for the following approaches?

- multisample calling
- single sample calling repeated multiple times
- single sample calling on a merge of multiple samples (provided I remove read groups as I get "Warning: Multiple sample names (@RG:SM) present in the BAM file!" otherwise)

Thanks
d

Tobias Rausch

unread,
May 25, 2021, 3:32:36 AM5/25/21
to Davide Cittaro, delly-users
Hi,

Delly requires about 10,000 properly aligned pairs for every sample. That's the same for single-sample or multi-sample calling, it is a per-sample quality check. You can, of course, pool samples (merge BAMs) then Delly regards this as one sample.

Best, Tobias
 

--
You received this message because you are subscribed to the Google Groups "delly-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to delly-users...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/delly-users/0f5dd86a-8641-4917-9a1b-50bae634d509n%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages