Hi,
I have a fasta and a qual file from the sequencing center as they joined the paired end reads together for us (MiSeq reads).
I ran the split_libraries.py command qiime@qiime-VirtualBox:~$ split_libraries.py -f illuminaseq.fasta -q illuminaseq.qual -m mappingfile.txt -o split_library_output -b
The output produces:
"Duplicate ID found in FASTA/qual file: %s" % label
ValueError: Duplicate ID found in FASTA/qual file: M02542:8:000000000-A7F0J:1:1102:17729:24017 1:N:0:1
I ran the grep -c command ~$ grep -c "M02542:8:000000000-A7F0J:1:1102:17729:24017" illuminaseq.fasta with the result being "2".
Is there a way to remove one of these IDs from the fasta file?
Thanks