Hello,
I am running process_radtags 2.53 on a set of sequences obtained from an Illumina NovaSeq 6000 platform at Genome Quebec. They were able to demultiplex the reads already (I'm just doing quality filtering), so I assume that the barcodes were successfully read by the company. The problem is as follows:
I developed my rad tags using a double digest approach with paired-end combinatorial indexed barcodes. However, process_radtags will not read my files as paired (only as single-end). It was able to read both the sequence files (which are fastq.gz format) and the barcode file I'm using, so I went forward (note: this was a test to get process_radtags to read the files, so I did not include any cleaning or rescue functions). All of the reads returned as ambiguous barcodes, which is odd given that the samples were already demultiplexed from pooled libraries and the barcode file was formatted accordingly.
Here are some examples of the file names for my read files:
NS.1518.002.Msp1_4---Pst1_7.121419-31_R1.fastq.gz
NS.1518.002.Msp1_4---Pst1_7.121419-31_R2.fastq.gz
When I tried this again adding in the -c -q -r functions, I got the error message (Segmentation fault). My current line of code is as follows:
process_radtags -p ./Da_Data/ -b ./Barcodes/Sample_Barcode_Match_Tab.csv -o ./Prophecy/ -c -q -r --index_index --renz_1 pstI --renz_2 mspI
Does anyone know what the issue might be? Thank you!
-Matt Penney
Acadia University