Hi,
Lately, I have been trying to couple the Comet + percolator with other protein inference tools such as PIA and Epifany. However I have hit a roadblock due, it seems, on the way comet writes out the variable modification and in particular the "Gln->pyroGlu (Q)" at the peptide N-terminus. The Q(-17.xxxx) seems to be not at the N-terminus of the peptide but in the middle of the sequence:
The pin output file, for example shows something like this:
/home/ubuntu/1a7d100b-b01e-441e-8acf-2a78fd92c151.comet_6_3_5 -1 6 897.565009 897.564753 1.609438 0.000000 0.013894 4.543652 0.652256 32.644783 0.1190 898.572285 8 0 0 1 0 0 0 0 1 1 2 5.902633 0.000000 0.000000 K.KLSKIAQ[-17.0265]K.Y decoy_sp|Q9NP80|PLPL8_HUMAN decoy_sp|Q9NP80-2|PLPL8_HUMAN
where the sequence
K.RRAGQEQ[-17.0265]R has pyroGly mod in the middle of the peptide, and the pep.xml output reports it like this:
<search_hit hit_rank="5" peptide="KLSKIAQK" peptide_prev_aa="K" peptide_next_aa="Y" protein="decoy_sp|Q9NP80|PLPL8_HUMAN" num_tot_proteins="5" num_matched_ions="5" tot_num_ions="42" calc_neutral_pep_mass="897.564753" massdiff="0.000256" num_tol_term="2" num_missed_cleavages="2" num_matched_peptides="366">
...
<modification_info modified_peptide="KLSKIAQ[111]K">
<mod_aminoacid_mass position="7" mass="111.032029" variable="-17.026549" source="param"/>
Percolator has no issue with the modification, but from this point on I need to convert the pep.xml into idXML, and the OpenMS conversion has trouble with it. I assume this is correct, but, just to understand, why does this happen? Is comet shuffling the peptide sequence that has an n-terminal modification, leaving the mod "attached" to the original amino acid?
Just for reference, I'm using Comet version "2019.01 rev. 5", the parameters file is attached.
thanks,
Peppe