Jimmy,
As you suggested, I replace the value in the "protein" with the first
word from the "protein_descr" in each search_hit entry. But when I run
PeptideProphetParser, it still gives the error message:
ERROR: NAN probability density detected. Please alert the
developer !!!
Is there anything else I need to fix?
Thanks!
Ping
Here is the example of the result after I replace protein/
protein_descr
for not decoy string, it looks like this:
<search_hit hit_rank="1" peptide="YPSRPLPPPPPFGLGFVPPPPPPYGPGR"
peptide_prev_aa="R" peptide_next_aa="I" protein="gi|73975149|ref|
XP_862242.1|" num_tot_proteins="3" num_matched_ions="15"
tot_num_ions="54" calc_neutral_pep_mass="2949.569"
massdiff="0.59985990298992" is_rejected="0" protein_descr="gi|
73975149|
ref|XP_862242.1| PREDICTED: hypothetical protein XP_857149 isoform 2
[Canis familiaris]" num_tol_term="2" num_missed_cleavages="0">
<alternative_protein protein="gi|73975151|ref|XP_862273.1|"
protein_descr="gi|73975151|ref|XP_862273.1| PREDICTED: hypothetical
protein XP_857180 isoform 3 [Canis familiaris]"/>
<alternative_protein protein="gi|73975153|ref|XP_862298.1|"
protein_descr="gi|73975153|ref|XP_862298.1| PREDICTED: hypothetical
protein XP_857205 isoform 4 [Canis familiaris]"/>
<search_score name="pvalue" value="0.000002382663488"/>
<search_score name="expect" value="0.028320338214958"/>
</search_hit>
For decoy string, it looks like this:
<search_hit hit_rank="2" peptide="QESARYSAKVTVAGLEESATEAQQQIR"
peptide_prev_aa="K" peptide_next_aa="S" protein="decoy_2098
2" num_tot_proteins="1" num_matched_ions="17" tot_num_ions="52"
calc_neutral_pep_mass="2949.481" massdiff="0.9728599071700
05" is_rejected="0" protein_descr="decoy_20982">
<search_score name="pvalue" value="0.000014221885248"/>
<search_score name="expect" value="0.168173793062284"/>
</search_hit>