Dear all,
I would like to analize illumina data (fastq) with the qiime pipeline.
However, in the tutorial for preparing the data I realized that for the split_libraries_fastq.py I need an extra fastq file for my barcodes.
I have found the extract_barcodes.py protocol. My problem with the protocol is that my sequences do not all start with the barcode.
Instead I have e.g. ACACTAG, NACACTGA, NNACAGATC or NNNACAGCTA and so on with N being the basepairs that are in front of the barcodes.
How could I best resolve this? How could I efficiently remove basepairs in front of the barcodes?
I would be really happy about any suggestions!
Heike