Thank you for answering Julian!
I will definitely contact the service provide about this matter.
I tried demultiplexing with this command:
process_radtags -i gzfastq -1 ~/Sara_raw/Undetermined_S0_L001_R1_001.fastq\ \(2\).gz -2 ~/Sara_raw/Undetermined_S0_L001_R2_001.fastq\ \(2\).gz -o ./ -b ~/Sara_raw/barcodes_1st_run.txt -c -q -r --null_index --renz-1 aseI --renz-2 nsiI
This is what I got back:
Processing paired-end data.
Using Phred+33 encoding for quality scores.
Found 1 paired input file(s).
Searching for single, index barcode.
Loaded 96 barcodes (8bp).
Will attempt to recover barcodes with at most 1 / 1 mismatches.
Processing file 1 of 1 [Undetermined_S0_L001_R1_001.fastq (2).gz]
Reading data from:
/home/sara/Sara_raw/Undetermined_S0_L001_R1_001.fastq (2).gz and
/home/sara/Sara_raw/Undetermined_S0_L001_R2_001.fastq (2).gz
Processing pairs of RAD-Tags...1M...2M...3M...4M...5M...6M...7M...8M...
17130308 total reads; -17116252 ambiguous barcodes; -13855 ambiguous RAD-Tags; +14125 recovered; -0 low quality reads; 201 retained reads.
Closing files, flushing buffers...done.
17130308 total sequences
17116252 barcode not found drops (99.9%)
0 low quality read drops (0.0%)
13855 RAD cutsite not found drops (0.1%)
201 retained reads (0.0%)
Details logged: './process_radtags.Sara_raw.log'
For a summary, execute one or more:
stacks-dist-extract ./process_radtags.Sara_raw.log total_raw_read_counts
stacks-dist-extract --pretty ./process_radtags.Sara_raw.log per_barcode_raw_read_counts
process_radtags is done.
Did I use the correct barcode option? Just for the reference, the first few lines in my 1st run barcode file look like this (they are not revcomp):
ATTACTCG AGGCTATA 1-1_47-3
TCCGGAGA AGGCTATA 1-5_47-5
CGCTCATT AGGCTATA 1-10_47-6
GAGATTCC AGGCTATA 1-15_49-2
ATTCAGAA AGGCTATA 21-1_49-3
GAATTCGT AGGCTATA 21-2_49-5
CTGAAGCT AGGCTATA 21-5_49-6
TAATGCGC AGGCTATA 21-8_15-1
CGGCTATG AGGCTATA 2-1_15-3
TCCGCGAA AGGCTATA 2-5_15-6
TCTCGCGC AGGCTATA 2-10_15-8
AGCGATAG AGGCTATA 2-20_28-2
ATTACTCG GCCTCTAT 20-4_28-4
TCCGGAGA GCCTCTAT 20-6_28-7
CGCTCATT GCCTCTAT 20-8_28-8
GAGATTCC GCCTCTAT 20-10_28-9
ATTCAGAA GCCTCTAT 40-3_24-3
GAATTCGT GCCTCTAT 40-5_24-4
CTGAAGCT GCCTCTAT 40-6_24-9
TAATGCGC GCCTCTAT 40-7_24-10
CGGCTATG GCCTCTAT 40-8_27-1
TCCGCGAA GCCTCTAT 6-2_1-1-K2
TCTCGCGC GCCTCTAT 6-10_13-5-K2
AGCGATAG GCCTCTAT 6-15_16-1-K2
ATTACTCG AGGATAGG 6-20
TCCGGAGA AGGATAGG 7-1
CGCTCATT AGGATAGG 7-10
GAGATTCC AGGATAGG 7-14
ATTCAGAA AGGATAGG 7-18
GAATTCGT AGGATAGG 3A-1
CTGAAGCT AGGATAGG 3A-5
TAATGCGC AGGATAGG 3A-10
CGGCTATG AGGATAGG 3B-1
TCCGCGAA AGGATAGG 3B-5
TCTCGCGC AGGATAGG 3B-9
AGCGATAG AGGATAGG 4-1
ATTACTCG TCAGAGCC 4-5
...
Should first and second columns be in original shape as I got them from the service or should I do revcomps of i7 and i5?
In the second part of demultiplexing I have to demultiplex the first 24 out of 96 samples again, and here is an example of a barcode file for the second round (they are revcomps):
name of the file is: barcodes_1-1_47-3
GAGTT 1-1
GTAGG 47-3
Is there anything else I can do or try to successfully demultifplex my data?
Any help would be greatly appreciated!
Best,
Sara