It seems that shortbred_quantify won't recognize my fasta files. I was able to run the same markers with the wgs.fns example file, but they won't work with my data. Can someone tell what the problem is? Below are the first two lines of the file I tried followed by the output from shortbred:
(python2) barichd@JOAN9000:~/mouse/shortbred$ head -2 ../trimmomatic/output_1P.fa
>D00420:134:HC2LTBCXY:2:1101:1249:2139 1:N:0:CGTACTAG+GTAAGGAG
CCTATACCACATCGGACGGTCAGCGGTTTCGGGATAACCGATTACTCGATGAAAAATATTTAAAAACCAATATGGAGGAATTATTTGCTTATGAATATGAAGCAGTTAAAACAAATCAATCAAATACAAACGATAACCGAGCAGACAGTAG
(python2) barichd@JOAN9000:~/mouse/shortbred$ shortbred_quantify.py --markers ShortBRED_CARD_2017_markers.faa --wgs ~/mouse/trimmomatic/output_1P.fa --results outputresults.txt --tmp output_quantify
Tested usearch. Appears to be working.
('Using this version of usearch: ', 'v9.2.64')
Treating input as a wgs file...
usearch v9.2.64_i86linux32, 4.0Gb RAM (528Gb total), 56 cores
(C) Copyright 2013-16 Robert C. Edgar, all rights reserved.
http://drive5.com/usearchLicense:
bus...@kenyon.edu00:00 40Mb 100.0% Reading ShortBRED_CARD_2017_markers.faa
00:00 6.8Mb 100.0% Masking (fastnucleo)
00:00 48Mb 100.0% Word stats
00:00 48Mb 100.0% Alloc rows
00:00 49Mb 100.0% Build index
00:00 36Mb Buffers (3237 seqs)
00:00 53Mb 100.0% Seqs
List of files in WGS set:/home/barichd/mouse/trimmomatic/output_1P.fa
List of files in WGS set (after unpacking tarfiles):/home/barichd/mouse/trimmomatic/output_1P.fa
Working on file 1 of 1
Traceback (most recent call last):
File "/home/barichd/miniconda3/envs/python2/bin/shortbred_quantify.py", line 524, in <module>
for seq in SeqIO.parse(streamWGS, strFormat):
File "/home/barichd/miniconda3/envs/python2/lib/python2.7/site-packages/Bio/SeqIO/__init__.py", line 659, in parse
raise ValueError("Unknown format '%s'" % format)
ValueError: Unknown format 'unknown'
(python2) barichd@JOAN9000:~/mouse/shortbred$
Thanks.