Dear John,
The process_radtags program works fine on double-digest RAD data. Data
from the GBS protocol is not double-digest RAD data.
Your comment exactly reverses the problem you are encountering. The
process_radtags program is very simple with respect to cutsites. It
looks at the sequence on the single- and paired-end reads, and looks for
the remainder cut site sequence that should be present. If it doesn't
find the expected sequence, it tells you so. If it finds the sequence
with a sequencing error as well, it will correct the sequencing error
(with the -r flag).
You say that when you disable this check, 99.8% of your reads pass "QC",
but you have now disabled QC effectively, so your statement is
meaningless. Well, the phred scores of your sequenced nucleotides are
high enough to avoid having the reads discarded, so that is good.
The question you should ask yourself is not, "Why does process_radtags
not work very well?" And instead ask the question, "Why do my reads not
have the cut sites I expect them to have, based on the molecular
protocol that was used?"
The latter question is more meaningful with respect to your biological
data analysis.
julian
Juncong Yan wrote on 3/7/19 2:52 PM:
> Hi Paula
> I am doing the same job currently and meet the same issue you have. It
> looks process_radtags does not work very well on ddRAD. the code "
> process_radtags -P -p ./Rawdata/ -o ./taged_data/ -b
> ./barcodes/barcode_1 -c -q -r --index_null --renz_1 mspI --renz_2
> nlaIII" I used lead to 87.4% of RAD cutsite not found drop. The way I
> found to solve this issue is using " process_radtags -P -p ./Rawdata/ -o
> ./taged_data/ -b ./barcodes/barcode_1 -c -q -r --index_null --renz_1
> mspI --renz_2 nlaIII --disable_rad_check". after that 99.8% reads passed QC.
> John
>
>
> 在 2019年3月6日星期三 UTC+13上午2:21:09,Paula Reyes写道:
>
> Hello,
> I have data paired-end and double digest (apeKI and pstI). I run
> process_radtags with 384 samples and I found that the average
> percentage of RAD cutsite not found drops is 70%. I think the
> process_radtags discards reads that do not have the RAD cutsite at
> both ends. I followed this thread
> (
https://groups.google.com/forum/#!topic/stacks-users/Sa8X4aArDHA
> <
https://groups.google.com/forum/#!topic/stacks-users/Sa8X4aArDHA>)