I have a big FASTAQ file containing sequences from 333 samples. From this FASTAQ file, I would like to extract sequences from 108 of these samples.
I found this function “extract_seqs_by_sample_id.py” that looked promising because it allowed me to filter the sequences based on information from the Mapping file:
extract_seqs_by_sample_id.py -i inseqs.fasta -o outseqs_by_mapping_field.fasta -m map.txt -s "Treatment:Fast"
However, this function works only with FASTA file and not with FASTAQ file.
Does an equivalent function exist for FASTAQ file?
Thanks a lot.