Hi Tony and Anya,
Tony I tried out your suggestions,starting with the extract barcodes and then joining the pairs (all info below). I checked out the barcodes file and in some cases it looks like there may not have been a barcode so instead it stripped off the first 8 bp of the primer. Additionally, when I tried joining the pairs a significant fraction remain unjoined and those that are joined are 400bp long for a region that should be about 300bp (515F/805R).
extract_barcodes.py -f /home/qiime/Desktop/Alex_16s_qiime/R1.fastq -r /home/qiime/Desktop/Alex_16s_qiime/R2.fastq -c barcode_paired_end -m /home/qiime/Desktop/Alex_16s_qiime/fasta-qual-mapping-files_from_MRDNA/validate_mapping_file_output/16s_mapping_corrected.txt --attempt_read_reorientation --bc1_len 8 --bc2_len 0 -o /home/qiime/Desktop/Alex_16s_qiime/trial_2/processed_seqs_removedbarcodes
Traceback (most recent call last):
File "/usr/local/bin/extract_barcodes.py", line 175, in <module>
main()
File "/usr/local/bin/extract_barcodes.py", line 171, in main
opts.attempt_read_reorientation, disable_header_match)
File "/usr/local/lib/python2.7/dist-packages/qiime/extract_barcodes.py", line 79, in extract_barcodes
forward_primers, reverse_primers = get_primers(header, mapping_data)
File "/usr/local/lib/python2.7/dist-packages/qiime/extract_barcodes.py", line 520, in get_primers
raise IndexError(("Mapping file is missing ReversePrimer field."))
IndexError: Mapping file is missing ReversePrimer field.
Made a new mapping file with ReversePrimer includes
validated it using the validate file.py
retry extract barcodes with new mapping file...
extract_barcodes.py -f /home/qiime/Desktop/Alex_16s_qiime/R1.fastq -r /home/qiime/Desktop/Alex_16s_qiime/R2.fastq -c barcode_paired_end -m /home/qiime/Desktop/Alex_16s_qiime/trial_2/validate_mapping_file_output/new_mappingfile_corrected.txt --attempt_read_reorientation --bc1_len 8 --bc2_len 0 -o /home/qiime/Desktop/Alex_16s_qiime/trial_2/processed_seqs_removedbarcodes
count_seqs.py -i fastq -o ./count_seqs.txt
barcodes file: 4335142
reads 1:281.1002 +/- 14.7983 4335142
reads 2: 289.3880 +/- 14.8808 4335142
Then tried joining the pairs
join_paired_ends.py -f /home/qiime/Desktop/Alex_16s_qiime/trial_2/processed_seqs_removedbarcodes/reads1.fastq -r /home/qiime/Desktop/Alex_16s_qiime/trial_2/processed_seqs_removedbarcodes/reads2.fastq -b /home/qiime/Desktop/Alex_16s_qiime/trial_2/processed_seqs_removedbarcodes/barcodes.fastq -o /home/qiime/Desktop/Alex_16s_qiime/trial_2/fastq-join_joined
check all resulting files with counts seqs...
216338 : /home/qiime/Desktop/Alex_16s_qiime/trial_2/fastq-join_joined/fastqjoin.join.fastq (Sequence lengths (mean +/- std): 400.4612 +/- 59.1968)
4118804 : /home/qiime/Desktop/Alex_16s_qiime/trial_2/fastq-join_joined/fastqjoin.un1.fastq (Sequence lengths (mean +/- std): 280.5051 +/- 14.9417)
4118804 : /home/qiime/Desktop/Alex_16s_qiime/trial_2/fastq-join_joined/fastqjoin.un2.fastq (Sequence lengths (mean +/- std): 288.7916 +/- 15.0263)
Not sure where to go from here....
Thanks again,
Theresa