python standardized_DIAMOND_analysis_counter.py -I Pre_NR50_BLAST_AQ_m8_results -D AQprotein.faa -F
Here is the error:
....
25M lines processed so far in 38.51 seconds.
Traceback (most recent call last):
File "standardized_DIAMOND_analysis_counter.py", line 93, in <module>
RefSeq_hit_count_db[splitline[1]] += 1
IndexError: list index out of range
I'll attach a screen shot of the top portion up a file that ran just fine ("Pre_NR17...") and one that is giving the index out of range error ("Pre_NR50..."). They look identical to me, but maybe I am missing something.
Thank you so much!
Cara
--
You received this message because you are subscribed to the Google Groups "SAMSA bioinformatics group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to samsa-bioinformatic...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/samsa-bioinformatics-group/d6ad7647-a052-46b6-afef-f6409eb1538en%40googlegroups.com.
Now reading through the m8 results infile.
1M lines processed so far in 1.55984115601 seconds.
2M lines processed so far in 3.12057709694 seconds.
3M lines processed so far in 4.67945504189 seconds.
4M lines processed so far in 6.19663715363 seconds.
5M lines processed so far in 7.71381211281 seconds.
6M lines processed so far in 9.30063414574 seconds.
7M lines processed so far in 10.8132519722 seconds.
Traceback (most recent call last):
File "standardized_DIAMOND_analysis_counter.py", line 93, in <module>
RefSeq_hit_count_db[splitline[1]] += 1
IndexError: list index out of range
python DIAMOND_results_filter.py -I Pre_NR_17_RefSeq_annot_Flavobacterium_function.tsv -SO fibronectin -D /home/slowshare/fiorec/fiorec_share/samsa2/full_databases/RefSeq_bac.dmnd
I have tried the m8 file as the input file and the tsv file, nothing seems to work since I know that "fibronectin" is in the annotated results. I am trying to get the sequences that correspond to this annotation. The output of the above code says there are zero matches to 'fibronectin'. Am I using the script incorrectly? I also tried different formatted database because I did not see anything in the script about what format the database should be in (.dmnd, .faa, etc?).
I appreciate your time.
Cara
To view this discussion on the web visit https://groups.google.com/d/msgid/samsa-bioinformatics-group/44001345-7eb7-4142-a495-7dbfad3111b2n%40googlegroups.com.