using more than one set of paired-end read files to run GetOrganelle

17 views
Skip to first unread message

Oscar Alejandro Perez Escobar

unread,
Aug 13, 2024, 12:59:45 PM8/13/24
to GetOrganelle
Dear GetOrganelle team,

I have a two sets of paired end read files for a series of samples and I am wondering if there is any way to run GetOrganelle using as input multiple paired-end read files? For example, would the argument "-1 Sample1_ABC_R1.fq,Sample1_DEF_R1.fq -2 Sample1_ABC_R2.fq,Sample1_DEF_R2.fq" be valid? 
My read files are huge (these are produced from old herbarium specimens) and I am intending to use them all to retrieve complete or partial plastid and nuclear genomic regions. 
Also, any advice on how to optimise the analysis for degraded or contaminated samples is most welcome (I am open to consider a collaboration in this project!).

Best wishes,

Oscar

JianJun Jin

unread,
Aug 15, 2024, 6:11:42 PM8/15/24
to Oscar Alejandro Perez Escobar, GetOrganelle
Hi Oscar,

Thanks for reaching out. Currently (GetOrganelle 1.7.7.1~1.8.0), you can use -1 Sample1_ABC_R1 -2 Sample1_ABC_R2 -u Sample1_DEF_R1.fq,Sample1_DEF_R2.fq. For most samples, using the first pair as a pair and the rest as unpaired like this, is generally good enough.

For genome assembly using short reads, more is not better. For organelle genomes, GetOrganelle will estimate the no. reads to use by estimating the target coverage and limiting it to be smaller than a bound (default: 500). You may change the value by using `--reduce-reads-for-coverage`.


Hope this helps.

JianJun


--
You received this message because you are subscribed to the Google Groups "GetOrganelle" group.
To unsubscribe from this group and stop receiving emails from it, send an email to getorganelle...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/getorganelle/19550295-732d-403f-89c9-d84e4338bb9dn%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages